Reference Video Recording Instruction

The dataset to be built is to help the understanding of the "pointing" action. The goal is to record videos where you refer to a specific object in the scene to an imagined person (camera) with both sentences and pointing.

Setting: Referring to an object with both sentence and pointing:

Sample Video

Please watch the sample video with more detailed step-by-step instruction.