Skip to content

Latest commit

 

History

History
23 lines (19 loc) · 2.22 KB

README.md

File metadata and controls

23 lines (19 loc) · 2.22 KB

awesomeCVpapers

Video Descriptor

Video Descriptor

(from Du Tran, Lubomir Bourdev, Rob Fergus, Lorenzo Torresani, Manohar Paluri, Learning Spatiotemporal Features with 3D Convolutional Networks, ICCV15'. )

  • C3D, Facebook AI Research [Paper] [Project Page] Hao
  • Du Tran, Lubomir Bourdev, Rob Fergus, Lorenzo Torresani, Manohar Paluri, Learning Spatiotemporal Features with 3D Convolutional Networks, ICCV15'.
  • CNN with VLAD, The University of Queensland [Paper] Hao
  • Zhongwen Xu, Yi Yang, Alexander G. Hauptmann, A Discriminative CNN Video Representation for Event Detection, CVPR15'.

Video Description Generation

Video Description

(from Li Yao, Atousa Torabi, Kyunghyun Cho, Nicolas Ballas, Christopher Pal, Hugo Larochelle, Aaron Courville, Describing Videos by Exploiting Temporal Structure, ICCV15'. )

  • HRNE, Zhejiang University [Paper] Hao
  • Pingbo Pan, Zhongwen Xu, Yi Yang, Fei Wu, Yueting Zhuang, Hierarchical Recurrent Neural Encoder for Video Representation with Application to Captioning, arXiv:1511.03476.
  • LSTM with CNN+3DCNN & Attention Mechanism, Universite ́ de Montre ́al [Paper] Hao
  • Li Yao, Atousa Torabi, Kyunghyun Cho, Nicolas Ballas, Christopher Pal, Hugo Larochelle, Aaron Courville, Describing Videos by Exploiting Temporal Structure, ICCV15'.
  • Video caption via LSTM, UT Austin [Paper] [Project Page] Hao
  • Subhashini Venugopalan, Huijuan Xu, Jeff Donahue, Marcus Rohrbach, Raymond Mooney, Kate Saenko, Translating Videos to Natural Language Using Deep Recurrent Neural Networks, ICCV15'.
  • LSTM-E, Microsoft Research, Beijing [Paper] Hao
  • Yingwei Pan, Tao Mei, Ting Yao, Houqiang Li, Yong Rui, Jointly Modeling Embedding and Translation to Bridge Video and Language, arXiv:1505.01861.