Facebook Research Open Sources DensePose


Recent research in human understanding aims primarily at localizing a sparse set of joints, like the wrists, or elbows of humans. This may suffice for applications like gesture or action recognition, but it delivers a reduced image interpretation. We wanted to go further. Imagine trying new clothes on via a photo, or putting costumes on your friend’s photos. For these tasks, a more complete, surface-based image interpretation is required.

The article is very accessible, and DensePose seems to be quite impressive: 

Earlier works on this problem would require computation in the order of minutes. DensePose operates at multiple frames per second on a single GPU and can handle tens or even hundreds of humans simultaneously.


Want to receive more content like this in your inbox?