HaoyiQiu Profile Banner
Haoyi Qiu Profile
Haoyi Qiu

@HaoyiQiu

Followers
972
Following
1K
Media
33
Statuses
182

Research intern @SFResearch ☁️ PhD student @UCLANLP 🧸 BS in CS&Math @UMich 〽️ #NLP #Multimodal #Safety 🌷

Los Angeles, CA
Joined October 2018
Don't wanna be here? Send us removal request.
@HaoyiQiu
Haoyi Qiu
3 months
🌏How culturally safe are large vision-language models? 👉LVLMs often miss the mark. We introduce CROSS, a benchmark of 1,284 image-query pairs across 16 countries & 14 languages, revealing how LVLMs violate cultural norms in context. ⚖️ Evaluation via CROSS-EVAL.🧨 Safety
Tweet media one
5
21
65
@HaoyiQiu
Haoyi Qiu
11 days
RT @linxins2: Thank you so much Caiming! . We show that involving coding as a new type of action apart from GUI action for CUA can signific….
0
4
0
@HaoyiQiu
Haoyi Qiu
17 days
RT @SFResearch: 🌟 Happy National Intern Day!. Today we celebrate the brilliant minds and diverse perspectives that our interns bring to @SF….
0
11
0
@HaoyiQiu
Haoyi Qiu
17 days
RT @qiancheng1231: 🤝 Can LLM agents really understand us?. We introduce UserBench: a user-centric gym environment for benchmarking how well….
0
34
0
@HaoyiQiu
Haoyi Qiu
24 days
RT @Yihe__Deng: 🙌 We've released the full version of our paper, OpenVLThinker: Complex Vision-Language Reasoning via Iterative SFT-RL Cycle….
0
41
0
@HaoyiQiu
Haoyi Qiu
25 days
RT @alexfabbri4: Excited to share MultiNRC, a new SEAL Leaderboard at Scale AI! MultiNRC is a challenging multilingual reasoning benchmark….
Tweet card summary image
huggingface.co
0
7
0
@HaoyiQiu
Haoyi Qiu
2 months
RT @ManlingLi_: Can VLMs build Spatial Mental Models like humans?. Reasoning from limited views?.Reasoning from partial observations?.Reaso….
0
58
0
@HaoyiQiu
Haoyi Qiu
2 months
RT @QiyueGao123: 🤔 Have @OpenAI o3, Gemini 2.5, Claude 3.7 formed an internal world model to understand the physical world, or just align p….
0
44
0
@HaoyiQiu
Haoyi Qiu
2 months
RT @ziqiao_ma: Can we scale 4D pretraining to learn general space-time representations that reconstruct an object from a few views at any t….
0
41
0
@HaoyiQiu
Haoyi Qiu
2 months
RT @victor__li__: Glad to be part of the team!. It's been a great pleasure working with so many talented people at Tesla (both in and out o….
0
3
0
@HaoyiQiu
Haoyi Qiu
2 months
RT @tparekh97: 🚨 New work: LLMs still struggle at Event Detection due to poor long-context reasoning and inability to follow task constrain….
0
19
0
@HaoyiQiu
Haoyi Qiu
2 months
RT @yikewang_: LLMs are helpful for scientific research — but will they continuously be helpful?. Introducing 🔍ScienceMeter: current knowle….
0
55
0
@HaoyiQiu
Haoyi Qiu
3 months
RT @steeve__huang: 🚨 The Business AI Plot Thickens 🚨. CRMArena set the stage for business AI evaluation in realistic environments. Now we'r….
0
10
0
@HaoyiQiu
Haoyi Qiu
3 months
RT @StellaLisy: 🤯 We cracked RLVR with. Random Rewards?!.Training Qwen2.5-Math-7B with our Spurious Rewards improved MATH-500 by:.- Rando….
0
348
0
@HaoyiQiu
Haoyi Qiu
3 months
RT @YungSungChuang: 🚨Do passage rerankers really need explicit reasoning?🤔—Maybe Not!. Our findings:.⚖️Standard rerankers outperform those….
0
18
0
@HaoyiQiu
Haoyi Qiu
3 months
Safety data construction by re-purposing the CVQA dataset 🔧
Tweet media one
0
0
2
@HaoyiQiu
Haoyi Qiu
3 months
Quantitative comparison of cultural safety performance (English / multilingual) ⚖️
Tweet media one
0
0
2
@HaoyiQiu
Haoyi Qiu
3 months
Multi-dimensional categorization of data in CROSS ⬇️
Tweet media one
0
0
3
@HaoyiQiu
Haoyi Qiu
3 months
RT @steeve__huang: Cultural safety in AI isn't just nice-to-have, it's essential ✅. Our new paper reveals that leading VLMs struggle with c….
0
1
0
@HaoyiQiu
Haoyi Qiu
3 months
Grateful for the incredible team at UCLA Plus Lab, Salesforce AI Research, and Google DeepMind — @VioletNPeng, Ruichen Zheng, @steeve__huang, and @sunjiao123sun_! 🥳.
0
1
4