Reconstructing and grounding narrated instructional videos in 3D

published by Dimitri Zhukov in 2021 in Informatics Engineering and research's language is English Download

Abstract in English

Narrated instructional videos often show and describe manipulations of similar objects, e.g., repairing a particular model of a car or laptop. In this work we aim to reconstruct such objects and to localize associated narrations in 3D. Contrary to the standard scenario of instance-level 3D reconstruction, where identical objects or scenes are present in all views, objects in different instructional videos may have large appearance variations given varying conditions a

Download