Weakly-supervised Disentangling with Recurrent Transformations for 3D View Synthesis

10 years 3 months ago

Download web.eecs.umich.edu

An important problem for both graphics and vision is to synthesize novel views of a 3D object from a single image. This is particularly challenging due to the partial observability inherent in projecting a 3D object onto the image space, and the ill-posedness of inferring object shape and pose. However, we can train a neural network to address the problem if we restrict our attention to speciﬁc object categories (in our case faces and chairs) for which we can gather ample training data. In this paper, we propose a novel recurrent convolutional encoder-decoder network that is trained end-to-end on the task of rendering rotated objects starting from a single image. The recurrent structure allows our model to capture long-term dependencies along a sequence of transformations. We demonstrate the quality of its predictions for human faces on the Multi-PIE dataset and for a dataset of 3D chair models, and also show its ability to disentangle latent factors of variation (e.g., identity and...

Jimei Yang, Scott Reed, Ming-Hsuan Yang, Honglak L

Real-time Traffic

CORR 2016 | Education |

claim paper

Post Info
More Details (n/a)

Added	31 Mar 2016
Updated	31 Mar 2016
Type	Journal
Year	2016
Where	CORR
Authors	Jimei Yang, Scott Reed, Ming-Hsuan Yang, Honglak Lee

Comments (0)

Sciweavers

Weakly-supervised Disentangling with Recurrent Transformations for 3D View Synthesis

CORR 2016 | Education |

Explore & Download

Productivity Tools

Sciweavers