YueYangAI Profile Banner
Yue Yang Profile
Yue Yang

@YueYangAI

Followers
578
Following
155
Media
31
Statuses
108

Research scientist @allen_ai | PhD @upennnlp | Vision and Language

Philadelphia, PA
Joined July 2018
Don't wanna be here? Send us removal request.
@YueYangAI
Yue Yang
14 days
RT @allen_ai: 🤖✨ What if models that take action in the physical world could think through your instructions? Meet MolmoAct, our new fully….
0
73
0
@YueYangAI
Yue Yang
3 months
RT @jeffrey_ch0: 🤖💬 Herding instincts… in AIs? Yes, even LLMs can follow the crowd!. • 📉 Conformity ↑ when agents lack confidence but trust….
0
7
0
@grok
Grok
6 days
What do you want to know?.
546
336
2K
@YueYangAI
Yue Yang
3 months
Successfully defended my PhD thesis and got hooded this week! Thanks to all the friends who supported me throughout this incredible journey! Excited to join PRIOR at @allen_ai next and continue exploring open vision-language research!
Tweet media one
16
4
154
@YueYangAI
Yue Yang
3 months
🎉CoSyn is accepted by ACL2025!.
@YueYangAI
Yue Yang
6 months
We share Code-Guided Synthetic Data Generation: using LLM-generated code to create multimodal datasets for text-rich images, such as charts📊, documents📄, etc., to enhance Vision-Language Models. Website: Dataset: Paper:
Tweet media one
0
0
6
@YueYangAI
Yue Yang
4 months
RT @AnnieFeng6: #ICLR2025 Oral. LLMs often struggle with reliable and consistent decisions under uncertainty 😵‍💫 — largely because they can….
0
39
0
@YueYangAI
Yue Yang
6 months
RT @artemispng: Exciting news! 🎉 Our paper “ViUniT: Visual Unit Tests for More Robust Visual Programming” got accepted at #CVPR2025.
0
2
0
@YueYangAI
Yue Yang
6 months
RT @codezakh: ✨ Introducing MutaGReP (Mutation-guided Grounded Repository Plan Search) - an approach that uses LLM-guided tree search to fi….
0
38
0
@YueYangAI
Yue Yang
6 months
This work is done during my great summer internship at @allen_ai with my awesome collaborators: Ajay Patel, @mattdeitke, @tanmay2099, @LucaWeihs, @drewmikehead, @yatskar, Chris Callison-Burch, @RanjayKrishna, @anikembhavi, Christopher Clark.
0
0
4
@YueYangAI
Yue Yang
6 months
We also show we can create synthetic pointing data to improve the click accuracy of VLMs in GUI agent tasks. On the ScreenSpot click prediction benchmark, our model trained on synthetic pointing data can outperform existing methods with much less training data.
Tweet media one
1
0
5
@YueYangAI
Yue Yang
6 months
We notice open VLMs struggle with novel out-of-domain tasks like interpreting nutrition labels. However, CoSyn’s controllable data generation can create targeted synthetic data for task-specific fine-tuning, achieving strong zero-shot performance with significantly less data.
Tweet media one
1
0
4
@YueYangAI
Yue Yang
6 months
On 7 text-rich benchmarks (e.g., ChartQA, DocVQA), our model trained on synthetic data outperforms competitive open and proprietary VLMs. Our zero-shot model, trained without benchmark examples, beats most baselines, proving the generalizability of training on synthetic data.
Tweet media one
2
0
4
@YueYangAI
Yue Yang
6 months
CoSyn uses code as the intermediate representation to build synthetic multimodal datasets. We prompt a text-only LLM to generate code that renders images, and then we use code as context to create instruction-tuning data, such as QA pairs, for fine-tuning VLMs.
Tweet media one
1
0
5
@YueYangAI
Yue Yang
6 months
Our CoSyn framework integrates 11 rendering tools for 20 robust generation pipelines, which support the creation of diverse text-rich images, including charts, documents, diagrams, tables, and even music sheets 🎼, and many more!
1
0
5
@YueYangAI
Yue Yang
6 months
We share Code-Guided Synthetic Data Generation: using LLM-generated code to create multimodal datasets for text-rich images, such as charts📊, documents📄, etc., to enhance Vision-Language Models. Website: Dataset: Paper:
Tweet media one
6
47
196
@YueYangAI
Yue Yang
7 months
RT @LongLeRobot: Articulate Anything has just been accepted to @iclr_conf #ICLR2025 ! Looking forward to seeing everyone in Singapore 🇸🇬 🙀….
0
8
0
@YueYangAI
Yue Yang
9 months
RT @Ai2Prior: 📢Applications are open for summer'25 internships at the PRIOR (computer vision) team.@allen_ai: . Come join us in building la….
0
13
0
@YueYangAI
Yue Yang
10 months
RT @cmalaviya11: Excited to share ✨ Contextualized Evaluations ✨!. Benchmarks like Chatbot Arena contain underspecified queries, which can….
0
31
0
@YueYangAI
Yue Yang
10 months
RT @veronica3207: 🤔What model explanation method should you use? How to ensure it reflects the model’s true reasoning?. 🌟 In our CL survey,….
0
8
0
@YueYangAI
Yue Yang
10 months
RT @cmalaviya11: ✨Updates✨:.• Dolomites was accepted to TACL: .- Our data is now also up on HuggingFace: https://t.….
0
11
0