Computer Vision ; People Tracking; Surface Modeling; Pose Estimation
KatirciogluI., RhodinR., SpörriJ., SalzmannM., FuaPascal (2021), Self-Supervised Human Detection and Segmentation via Multi-View Consensus, in
International Conference on Computer Vision, OnlineIEEE, Online.
GuillardB., RemelliE., YvernayP., FuaPascal (2021), Sketch2mesh: Reconstructing and Editing 3D Shapes from Sketches, in
International Conference on Computer Vision, OnlineIEEE, Online.
BednarikJan, KimVladimir, ChaudhuriS., Parashar S., SalzmannM., FuaP., AigermanN. (2021), Temporally-Coherent Surface Reconstruction via Metric-Consistent Atlases, in
International Conference on Computer Vision, OnlineIEEE, Online.
Kiciroglu Sena, Rhodin Helge, Sinha Sudipta, Salzmann Mathieu, Fua Pascal (2020), ActiveMoCap: Optimized Viewpoint Selection for Active Human Motion Capture, in
CVPR, IEEE, Online.
Liu Weizhe, Salzmann Mathieu (2020), Estimating People Flows to Better Count Them in Crowded Scenes, in
ECCV, EUROPEAN CONFERENCE ON COMPUTER VISION, Online.
Remelli Edoardo, Lukoianov Artem, Richter Stephan, Guillard Benoit, Bagautdinov Timur, Baque Pierre, Fua Pascal (2020), MeshSDF: Differentiable Iso-Surface Extraction, in
NeurIPS, Neural Information Processing Systems Foundation, Online.
BednarikJan, ParasharShaifali, GundogduErhan, SalzmannMathieu (2020), Shape Reconstruction by Learning, in
CVPR, IEEE, Online.
Guillard Benoit, Remelli Edoardo, Fua Pascal (2020), UCLID-Net: Single View Reconstruction in Object Space, in
NeurIPS, Neural Information Processing Systems Foundation, Online.
Maksai Andrii, Fua Pascal (2019), Eliminating Exposure Bias and Metric Mismatch in Multiple Object Tracking, in
2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USAIEEE, Long Beach, CA, USA.
DavydovA., Remizova A., ConstantinV., Honari S., SalzmannM., FuaPascal, Adversarial Parametric Pose Prior}, in
Computer Vision and Pattern Recognition, IEEE, New Orleans.
LiuW., DurasovN., FuaPascal, Leveraging Self-Supervision for Cross-Domain Crowd Counting, in
Computer Vision and Pattern Recognition, IEEE, New Orleans.
Capturing people's 3D shape and motion has always been central to Computer Vision. Today, there is great interest in doing so solely by analyzing video sequences, both because cameras are becoming ever cheaper and more prevalent and because there are so many potential applications. They include athletic training, surveillance, entertainment, and fashion design among others.Much progress has been made recently, in large part thanks to the advent of powerful Machine Learning techniques. However, the task remains challenging because of the many ambiguities inherent to monocular 3D reconstruction, which stem from the loss of depth information resulting from the projection from 3D to 2D. In earlier work, we have addressed these difficulties in the contexts of body pose estimation and surface reconstruction.While both of these approaches are effective on individual people or surfaces, they completely ignore the interactions both between people whose motion are often correlated as well as between people and their clothes, which are deformable surfaces whose motion is driven by that of the person wearing them. The goal of this project will therefore be to bring these two strands of research together so that we can ultimately model multiple clothed people interacting with each other, which is still beyond the state of the art.