Pavankumar Vasu Profile
Pavankumar Vasu

@PavankumarVasu

Followers
153
Following
36
Media
2
Statuses
35

Joined July 2013
Don't wanna be here? Send us removal request.
@PavankumarVasu
Pavankumar Vasu
4 months
Excited to share code & models for FastVLM — our blazing-fast Vision-Language Model appearing at #CVPR2025. Run it on-device with inference code optimized for Apple Silicon using #mlx. Code: Updated paper & results coming soon. Stay tuned! 👀
11
51
209
@grok
Grok
5 days
Join millions who have switched to Grok.
257
289
2K
@PavankumarVasu
Pavankumar Vasu
10 days
RT @maxseitzer: Introducing DINOv3 🦕🦕🦕. A SotA-enabling vision foundation model, trained with pure self-supervised learning (SSL) at scale.….
0
139
0
@PavankumarVasu
Pavankumar Vasu
24 days
RT @andimarafioti: 🚀 We're thrilled to launch four new OCR datasets with 20M images: DoclingMatix, SynthFormulaNet, SynthCodeNet, and Synth….
0
78
0
@PavankumarVasu
Pavankumar Vasu
27 days
RT @teelinsan: Uncertainty quantification (UQ) is key for safe, reliable LLMs. but are we evaluating it correctly?. 🚨 Our ACL2025 paper f….
0
13
0
@PavankumarVasu
Pavankumar Vasu
1 month
RT @HPouransari: 🌟Explore key insights from the FastVLM project (real-time vision-language model) in this blog post:. .
Tweet card summary image
machinelearning.apple.com
Vision Language Models (VLMs) enable visual understanding alongside textual inputs. They are typically built by passing visual tokens from a…
0
38
0
@PavankumarVasu
Pavankumar Vasu
1 month
RT @FartashFg: 📢Submissions are now open for #NeurIPS2025 CCFM workshop. Submission deadline: August 22, 2025, AoE. Website:  https://t.co/….
openreview.net
Welcome to the OpenReview homepage for NeurIPS 2025 Workshop CCFM
0
6
0
@PavankumarVasu
Pavankumar Vasu
1 month
RT @MustafaShukor1: We propose new scaling laws that predict the optimal data mixture, for pretraining LLMs, native multimodal models and l….
0
47
0
@PavankumarVasu
Pavankumar Vasu
1 month
RT @RinMetcalfSusa: 📣 We are excited to present our work on inferring user preferences from writing samples at @icmlconf Poster Session 3 (….
0
3
0
@PavankumarVasu
Pavankumar Vasu
2 months
RT @thoma_gu: I will be attending #CVPR2025 and presenting our latest research at Apple MLR! Specifically, I will present our highlight pos….
0
22
0
@PavankumarVasu
Pavankumar Vasu
3 months
RT @ryan_hoque: Imitation learning has a data scarcity problem. Introducing EgoDex from Apple, the largest and most diverse dataset of de….
0
95
0
@PavankumarVasu
Pavankumar Vasu
4 months
Huge thanks to team and collaborators:.@HPouransari, David Koski, Christopher Webb, @FartashFg, @chunliang_tw, Cem, Nate, Albert, Gokul, James, Peter and @OncelTuzel.
0
0
8
@PavankumarVasu
Pavankumar Vasu
4 months
RT @cydhsieh: Excited to introduce FocalLens: an instruction tuning framework that turns existing VLMs/MLLMs into text-conditioned vision e….
0
8
0
@PavankumarVasu
Pavankumar Vasu
4 months
RT @MartinKlissarov: Here is an RL perspective on understanding LLMs for decision making. Are LLMs best used as: .policies / rewards / tra….
0
30
0
@PavankumarVasu
Pavankumar Vasu
5 months
RT @YizheZhangNLP: Excited to share our new paper on "Reversal Blessing" - where thinking BACKWARDS makes language models smarter on some m….
0
30
0
@PavankumarVasu
Pavankumar Vasu
7 months
RT @samira_abnar: 🚨 One question that has always intrigued me is the role of different ways to increase a model's capacity: parameters, par….
0
67
0
@PavankumarVasu
Pavankumar Vasu
8 months
RT @HPouransari: 📢📢📢. We released the code for dataset-decomposition [NeurIPS 2024]: a simple method to speed up LLM pre-training using seq….
Tweet card summary image
github.com
Official repo of dataset-decomposition paper [NeurIPS 2024] - apple/ml-dataset-decomposition
0
6
0
@PavankumarVasu
Pavankumar Vasu
8 months
RT @ryan_hoque: 🚨 New research from my team at Apple - real-time augmented reality robot feedback with just your hands + Vision Pro! . Pape….
0
41
0
@PavankumarVasu
Pavankumar Vasu
8 months
RT @HPouransari: What matters for runtime optimization in Vision Language Models (VLMs)? Vision encoder latency 🤔? Image resolution 🤔? Numb….
0
21
0