Papers of the day   All papers

VideoBERT: A Joint Model for Video and Language Representation Learning

Comments

Jun 27 2019 Yasser Souri

A short note on "VideoBERT: A Joint Model for Video and Language Representation Learning". https://arxiv.org/abs/1904.01766 By Chen Sun, Ausin Myers, @cvondrick, Kevin Murphy and Cordelia Schmid (1)
1 replies, 87 likes


Aug 13 2019 Paul Liang

some exciting recent work in self-supervised multimodal learning including VideoBERT (https://arxiv.org/abs/1904.01766), ViLBERT (https://arxiv.org/abs/1908.02265), and VisualBERT (https://arxiv.org/abs/1908.03557). for more papers in multimodal representation learning, check out https://github.com/pliang279/awesome-multimodal-ml https://t.co/8tCQ0Gg5Qo
0 replies, 84 likes


Sep 17 2019 DataScienceNigeria

AI powered by @GoogleAI VideoBERT can predict what will happen next in a video by learning visual-linguistic & visual representations from unlabeled videos. Self-supervised system that tackles proxy tasks to learn temporal representations Read more at https://arxiv.org/pdf/1904.01766.pdf https://t.co/1TV9yszY8m
0 replies, 35 likes


Aug 27 2019 William Wang

https://t.co/Cng1KgTMV0
1 replies, 12 likes


Jul 01 2019 Tuhin Chakrabarty

https://arxiv.org/pdf/1904.01766.pdf Nice paper using BERT for Video Captioning by Google :)
0 replies, 9 likes


Sep 17 2019 Christian Szegedy

More grounding for NLP, better visual interpretation: AI research at its best.
0 replies, 8 likes


Sep 25 2019 Xavier Giró🎗

Cordelia Schmid from @Inria overviews #ICCV19 VideoBERT, where cross-modal representations are learned from instructional cooking videos. #bmva https://arxiv.org/abs/1904.01766 https://t.co/6c1FaXiAtI
0 replies, 4 likes


Sep 13 2019 arXiv CS-CV

VideoBERT: A Joint Model for Video and Language Representation Learning http://arxiv.org/abs/1904.01766
0 replies, 2 likes


Aug 27 2019 Rogue 🌻. Bigham

https://t.co/OwqpzS1Yjx
0 replies, 2 likes


Content