VX2TEXT: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs
Xudong Lin, Gedas Bertasius, Jue Wang, Shih-Fu Chang, Devi Parikh, Lorenzo Torresani
People
Research Scientist
I am a Research Scientist at Meta AI Research and Applied Machine Learning (AML). I am also an associate professor in the Computer Science Department at Dartmouth. I received a Laurea Degree in computer science with summa cum laude honors from the University of Milan (Italy) in 1996, and an MS and a PhD in computer science from Stanford University in 2001 and 2005, respectively. Prior to Meta, I have worked at several other industrial research labs, including Microsoft Research, Like.com, and Digital Persona. My research interests are in computer vision and deep learning. I am the recipient of a CVPR best student paper prize, a National Science Foundation CAREER Award, a Google Faculty Research Award, three Facebook Faculty Awards and a Fulbright US Scholar Award.
Computer vision, deep learning and artificial intelligence
Xudong Lin, Gedas Bertasius, Jue Wang, Shih-Fu Chang, Devi Parikh, Lorenzo Torresani
Humam Alwassel, Dhruv Mahajan, Bruno Korbar, Lorenzo Torresani, Bernard Ghanem, Du Tran
Heng Wang, Du Tran, Lorenzo Torresani, Matt Feiszli
Gedas Bertasius, Lorenzo Torresani
Ruohan Gao, Tae-Hyun Oh, Kristen Grauman, Lorenzo Torresani
Gedas Bertasius, Christoph Feichtenhofer, Du Tran, Jianbo Shi, Lorenzo Torresani