Refining joint locations for human pose tracking in sports videos

The estimation of an athlete's pose in video footage enables the automation of athletic performance assessment, the prediction of motion kinematics and dynamics in sports videos and the possibility of technology-assisted, direct training feedback. Despite remarkable progress in the field of deep learning assisted human pose estimation, the performance of such systems decreases while noise and errors increase with the complexity of the scene. In this paper, we focus on aquatic training scenarios, where even novel pose estimators produce several types of orthogonal errors, including joint swaps and prediction outliers. In order to improve the estimation of an athlete's pose in swimming, we propose a graph partitioning problem that connects pose estimates over time and explicitly allows for joints to switch labels if their location better fits each other's trajectory. We optimize the problem using integer linear programming, which partitions the graph into the most probable joint trajectories. We show experimentally that our method of joint rectification improves the joint detection precision of swimmers in a swimming channel by 0.8%-4.8% PCK for anti-symmetrical motion and up to 1.8% PCK for symmetrical styles.
© Copyright 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE. Julkaistu Tekijä IEEE. Kaikki oikeudet pidätetään.

Aiheet: uinti asento analyysi suorituskyky datan syöttö virhe video seuranta
Aihealueet: kestävyys urheilu tekniset ja luonnontieteet
DOI: 10.1109/CVPRW.2019.00308
Julkaisussa: IEEE/CVF Conference on Computer Vision and Pattern Recognition
Julkaistu: Long Beach IEEE 2019
Sivuja: 2524-2532
Julkaisutyypit: artikkeli
Kieli: englanti (kieli)
Taso: kehittynyt