
Vittorio Ferrari
@VittoFerrariCV
Followers
5K
Following
11
Media
42
Statuses
60
Principal Research Scientist at Meta Reality Labs
Zurich
Joined June 2020
Paper accepted to the “Multimodal Algorithmic Reasoning” NeurIPS workshop!. HAMMR: Hierarchical multimodal agents for handing many diverse VQA tasks in a single system. @LluisCastrejon @tejmensink @howardzzh @andrefaraujo @JRRU
1
0
7
Come to poster 354 at #CVPR2024's to see our work! 10:30am today, Arch 4A-E. "Grounding Everything: Emerging Localization Properties in Vision-Language Transformers". Paper: Demo:Code:
0
9
33
Our EXPRESS-1 AI model enables @Synthesiaio avatars to understand and adjust to the script automatically 💥. This is a big milestone, so tune in tomorrow for a pre-launch chat with @MattNiessner, @jnstrck, @vriparbelli and @AlexVoica. X Spaces event link:
1
1
12
RT @synthesiaIO: AI Avatars have learned to interpret text now. 😬. Our soon-to-be-public EXPRESS-1 AI model enables Synthesia avatars to un….
0
11
0
Introducing HAMMR: hierarchical multimodal agents that handle a broad range of VQA tasks within a single system (counting, spatial reasoning, OCR, visual pointing, external knowledge, and more). @LluisCastrejon @tejmensink @howardzzh @andrefaraujo @JRRU
1
2
12
Paper accepted to #CVPR2024!. Grounding Everything: Emerging Localization Properties in Vision-Language Transformers. Paper: Demo:Code: With @BousselhamWalid, @FHKPetersen, @HildeKuehne
1
19
126
Happy to share this filmed interview - if you want to join @SynthesiaIO, now is the perfect time!.
Will AI ever take over humanity? 🤔. We’ve got a theory about this in our Social Media team, but let’s double check with an actual expert. Introducing our Director of Science, @VittoFerrariCV, who recently joined Synthesia!. And btw, he’s already looking for new hires, here:
2
1
27
Three papers accepted to #NeurIPS.3/3. NAVI: a dataset of image collections of objects, along with high-quality 3D object scans, near-perfect 2D-3D alignments, and accurate camera parameters. With @jampani_varun, @kmaninis, others
0
16
105
Three papers accepted to #NeurIPS.2/3. "Estimating Generic 3D Room Structures from 2D Annotations". 3D room layouts annotations for 2246 videos (part of CAD-Estate dataset). With @DRozumnyi,@StefanPopovCV, @kmaninis, @MattNiessner
0
9
92
Three papers accepted to #NeurIPS.1/3. StoryBench: a new benchmark for text-to-video generation of stories to guide progress in assistive technology for filmmaking 🧑🎨. With @ebugliarello, @hhm, many others
Wouldn’t it be cool if AI could help us generate movies?🎬.We built a new benchmark to measure progress in this direction🍿. “StoryBench: A Multifaceted Benchmark for Continuous Story Visualization”. 📄 👩💻 📈
1
2
16
I am happy to share that I have joined Synthesia as Director of Science. Excited to start this new adventure!.
Our R&D team just got a major boost - please welcome @VittoFerrariCV, our new Director of Science! 👋. He joins us from Google, where he was a Principal Scientist leading research in computer vision and machine learning. Before that, he built and led teams at ETH Zurich and the
10
1
120
Check out CAD-Estate: a large dataset with 3D object and room layout annotations on RGB videos of complex multi-object scenes (101k objects in total!). With @StefanPopovCV, @kmaninis, @MattNiessner
0
13
64
RT @ebugliarello: Wouldn’t it be cool if AI could help us generate movies?🎬.We built a new benchmark to measure progress in this direction🍿….
0
25
0
Four papers accepted to #ICCV2023!.2/4. CAD-Estate: Large-scale CAD Model Annotation in RGB Videos. >100k 3D objects annotated on RGB videos of complex scenes. Dataset release coming soon!. @StefanPopovCV, @kmaninis, @MattNiessner
1
26
120
Four papers accepted to #ICCV2023!.4/4. Tracking by 3D Model Estimation of Unknown Objects in Videos. With @DRozumnyi, @matas_jiri, Martin R. Oswald, @mapo1
0
1
25
Four papers accepted to #ICCV2023!.3/4. Agile Modeling: From Concept to Classifier in Minutes. We empower any user to develop a classifier for a subjective visual concept in under 30 minutes. With O.Strech, E.Vendrow, and many others
0
2
12
Four papers accepted to #ICCV2023!.1/4. Encyclopedic VQA: Visual questions about detailed properties of fine-grained categories. Dataset release coming soon!. With @tejmensink @JRRU @LluisCastrejon @goelarushi27 @eucadar @howardzzh @feishaAI, A.Araujo
1
2
27