Towards Semantic Grounding via Video Data: The Case of push & pull

Date:

[Poster]