Pavankumar Vasu @PavankumarVasu X Profile

Pavankumar Vasu

@PavankumarVasu

Followers

153

Following

36

Media

2

Statuses

35

Joined July 2013

Don't wanna be here? Send us removal request.

Pavankumar Vasu

@PavankumarVasu

4 months

Excited to share code & models for FastVLM — our blazing-fast Vision-Language Model appearing at #CVPR2025. Run it on-device with inference code optimized for Apple Silicon using #mlx. Code: Updated paper & results coming soon. Stay tuned! 👀

11

51

209

Pavankumar Vasu

@PavankumarVasu

10 days

RT @FartashFg: 🚨📅The submission deadline for #NeurIPS 2025 CCFM Workshop is just 8 days away on August 22. Get your papers in!.Submit your….

sites.google.com

Foundation models, despite their impressive capabilities, face a critical challenge: they naturally become outdated. Trained on vast datasets, frequently updating these models is expensive. Crucial...

0

1

0

Grok

@grok

5 days

Join millions who have switched to Grok.

257

289

2K

Pavankumar Vasu

@PavankumarVasu

10 days

RT @maxseitzer: Introducing DINOv3 🦕🦕🦕. A SotA-enabling vision foundation model, trained with pure self-supervised learning (SSL) at scale.….

0

139

0

Pavankumar Vasu

@PavankumarVasu

24 days

RT @andimarafioti: 🚀 We're thrilled to launch four new OCR datasets with 20M images: DoclingMatix, SynthFormulaNet, SynthCodeNet, and Synth….

0

78

0

Pavankumar Vasu

@PavankumarVasu

27 days

RT @teelinsan: Uncertainty quantification (UQ) is key for safe, reliable LLMs. but are we evaluating it correctly?. 🚨 Our ACL2025 paper f….

0

13

0

Pavankumar Vasu

@PavankumarVasu

1 month

RT @HPouransari: 🌟Explore key insights from the FastVLM project (real-time vision-language model) in this blog post:. .

machinelearning.apple.com

Vision Language Models (VLMs) enable visual understanding alongside textual inputs. They are typically built by passing visual tokens from a…

0

38

0

Pavankumar Vasu

@PavankumarVasu

1 month

RT @FartashFg: 📢Submissions are now open for #NeurIPS2025 CCFM workshop. Submission deadline: August 22, 2025, AoE. Website: https://t.co/….

openreview.net

Welcome to the OpenReview homepage for NeurIPS 2025 Workshop CCFM

0

6

0

Pavankumar Vasu

@PavankumarVasu

1 month

RT @MustafaShukor1: We propose new scaling laws that predict the optimal data mixture, for pretraining LLMs, native multimodal models and l….

0

47

0

Pavankumar Vasu

@PavankumarVasu

1 month

RT @RinMetcalfSusa: 📣 We are excited to present our work on inferring user preferences from writing samples at @icmlconf Poster Session 3 (….

0

3

0

Pavankumar Vasu

@PavankumarVasu

2 months

RT @FartashFg: 🚀Super excited to share TiC-LM (Oral at #ACL2025)!.How to keep FMs up-to-date over months/years? We have a benchmark and lot….

sites.google.com

Foundation models, despite their impressive capabilities, face a critical challenge: they naturally become outdated. Trained on vast datasets, frequently updating these models is expensive. Crucial...

0

2

0

Pavankumar Vasu

@PavankumarVasu

2 months

RT @thoma_gu: I will be attending #CVPR2025 and presenting our latest research at Apple MLR! Specifically, I will present our highlight pos….

0

22

0

Pavankumar Vasu

@PavankumarVasu

3 months

RT @ryan_hoque: Imitation learning has a data scarcity problem. Introducing EgoDex from Apple, the largest and most diverse dataset of de….

0

95

0

Pavankumar Vasu

@PavankumarVasu

4 months

Huge thanks to team and collaborators:.@HPouransari, David Koski, Christopher Webb, @FartashFg, @chunliang_tw, Cem, Nate, Albert, Gokul, James, Peter and @OncelTuzel.

0

8

Pavankumar Vasu

@PavankumarVasu

4 months

RT @cydhsieh: Excited to introduce FocalLens: an instruction tuning framework that turns existing VLMs/MLLMs into text-conditioned vision e….

0

8

0

Pavankumar Vasu

@PavankumarVasu

4 months

RT @MartinKlissarov: Here is an RL perspective on understanding LLMs for decision making. Are LLMs best used as: .policies / rewards / tra….

0

30

0

Pavankumar Vasu

@PavankumarVasu

5 months

RT @YizheZhangNLP: Excited to share our new paper on "Reversal Blessing" - where thinking BACKWARDS makes language models smarter on some m….

0

30

0

Pavankumar Vasu

@PavankumarVasu

7 months

RT @samira_abnar: 🚨 One question that has always intrigued me is the role of different ways to increase a model's capacity: parameters, par….

0

67

0

Pavankumar Vasu

@PavankumarVasu

8 months

RT @HPouransari: 📢📢📢. We released the code for dataset-decomposition [NeurIPS 2024]: a simple method to speed up LLM pre-training using seq….

github.com

Official repo of dataset-decomposition paper [NeurIPS 2024] - apple/ml-dataset-decomposition

0

6

0

Pavankumar Vasu

@PavankumarVasu

8 months

RT @ryan_hoque: 🚨 New research from my team at Apple - real-time augmented reality robot feedback with just your hands + Vision Pro! . Pape….

0

41

0

Pavankumar Vasu

@PavankumarVasu

8 months

RT @HPouransari: What matters for runtime optimization in Vision Language Models (VLMs)? Vision encoder latency 🤔? Image resolution 🤔? Numb….

0

21

0