Yusen Zhang Profile
Yusen Zhang

@YusenZhangNLP

Followers
412
Following
287
Media
28
Statuses
111

PhD Candidate @PennStateEECS | NLP Lab @NLP_PennState #NLProc | Prev Research Intern @MSFTResearch, @AmazonScience @GoogleAI

State College, PA
Joined November 2022
Don't wanna be here? Send us removal request.
@YusenZhangNLP
Yusen Zhang
3 months
🚀 How Far Are VLMs from Effective High-Resolution Image Understanding?.👉 We found: Still far. 🆕 Introducing HRScene Benchmark:.📸 25 Real-world Scenes + 🧪 2 Diagnostic NIAH Tests.🏙️ 8 Categories: Daily, Paper, Urban Planning, etc. 🖼️ Resolution: 1,024 × 1,024 ➡️ 35,503 ×
Tweet media one
Tweet media two
1
5
14
@YusenZhangNLP
Yusen Zhang
15 days
RT @RyoKamoi: Our paper VisOnlyQA has been accepted to @COLM_conf #COLM2025! See you in Montreal🍁.We find that even recent Vision Language….
0
9
0
@YusenZhangNLP
Yusen Zhang
18 days
RT @RyoKamoi: We updated our VisOnlyQA paper for #COLM2025!.* LVLMs exhibit weak geometric perception even on geometric shapes with 2–3 lin….
0
2
0
@YusenZhangNLP
Yusen Zhang
1 month
HRScene got accepted at #ICCV2025!. HRScene is a novel unified benchmark for high-resolution image understanding with 25 scenes and 2 NIAH tests. Home page: (Sorry, EvalAI for submission does not work currently. ). My PhD research began with long text
Tweet media one
@YusenZhangNLP
Yusen Zhang
3 months
🚀 How Far Are VLMs from Effective High-Resolution Image Understanding?.👉 We found: Still far. 🆕 Introducing HRScene Benchmark:.📸 25 Real-world Scenes + 🧪 2 Diagnostic NIAH Tests.🏙️ 8 Categories: Daily, Paper, Urban Planning, etc. 🖼️ Resolution: 1,024 × 1,024 ➡️ 35,503 ×
Tweet media one
Tweet media two
0
3
7
@YusenZhangNLP
Yusen Zhang
2 months
RT @GptMaestro: Vision Language Models display a peculiar blind spot: their ability to process image content declines in a U-shaped pattern….
0
1
0
@YusenZhangNLP
Yusen Zhang
2 months
RT @jackqqwang: NeuroGen: We explored a training-free idea—using prompts to guide large models to generate neural net parameters for downst….
0
1
0
@YusenZhangNLP
Yusen Zhang
2 months
RT @RyoKamoi: 📢 New paper!.FoVer enhances PRMs for step-level verification of LLM reasoning w/o human annotation 🚀.We synthesize training d….
0
25
0
@YusenZhangNLP
Yusen Zhang
3 months
RT @iScienceLuvr: HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding?. "we introduce HRScene, a novel unified ben….
0
11
0
@YusenZhangNLP
Yusen Zhang
3 months
RT @ruizhang_nlp: As Vision Language Models treat images as tokens, high-resolution images create long sequences, similar to long-context c….
0
5
0
@YusenZhangNLP
Yusen Zhang
3 months
RT @vipul_1011: Ever wondered how much you can trust a benchmark?. We did too - so we built SMART to make them smarter!. I will be presenti….
0
7
0
@YusenZhangNLP
Yusen Zhang
3 months
RT @snigdhac25: Want to learn about fairness in summarization? @HaoyuanLi9 will present our work on fairness in multidocument summarization….
0
2
0
@YusenZhangNLP
Yusen Zhang
3 months
RT @ruizhang_nlp: This work is led by my first PhD student Yusen @YusenZhangNLP, who is graduating soon and actively seeking a postdoc posi….
0
5
0
@YusenZhangNLP
Yusen Zhang
3 months
RT @ruizhang_nlp: 🎉Our paper on fairness of multidoc summarization has received an SAC award at NAACL 2025! 🥳 We appreciate the recognition….
0
5
0
@YusenZhangNLP
Yusen Zhang
3 months
I will be at NAACL this week. Welcome to discuss with me if you have any thoughts on this project and all the other research topics!.
@YusenZhangNLP
Yusen Zhang
3 months
🚀 How Far Are VLMs from Effective High-Resolution Image Understanding?.👉 We found: Still far. 🆕 Introducing HRScene Benchmark:.📸 25 Real-world Scenes + 🧪 2 Diagnostic NIAH Tests.🏙️ 8 Categories: Daily, Paper, Urban Planning, etc. 🖼️ Resolution: 1,024 × 1,024 ➡️ 35,503 ×
Tweet media one
Tweet media two
0
0
3
@YusenZhangNLP
Yusen Zhang
3 months
✍️ Authors:. Yusen Zhang @YusenZhangNLP, Wenliang Zheng, Aashrith Madasu, Peng Shi, Ryo Kamoi @RyoKamoi, Hao Zhou @hao_zhh, Zhuoyang Zou, Shu Zhao, Sarkar Snigdha Sarathi Das @sarkarssdas, Vipul Gupta @vipul_1011, Xiaoxin Lu, Nan Zhang @NanZhangNLP, Ranran Haoran Zhang.
0
0
1
@YusenZhangNLP
Yusen Zhang
3 months
🤔 Why do models fail to perform well?. Our Diagnostic Datasets give some insights!. 📍 Regional Defect:.Measures the gap between the highest performing region and the mean performance across all regions. Gemini 2.0 Flash shows 39.85% divergence on 10×10 grids. Meaning: If the
Tweet media one
Tweet media two
1
0
1