Adhiraj Ghosh ✈️ ACL 2025 Profile
Adhiraj Ghosh ✈️ ACL 2025

@adhiraj_ghosh98

Followers
258
Following
371
Media
8
Statuses
57

ELLIS PhD @uni_tue | vision-language & data-centric ML @bethgelab 🦋: https://t.co/Q03vvJFIPw

Tübingen, Deutschland
Joined April 2024
Don't wanna be here? Send us removal request.
@adhiraj_ghosh98
Adhiraj Ghosh ✈️ ACL 2025
3 months
🏆ONEBench accepted to ACL main! ✨.Stay tuned for the official leaderboard and real-time personalised benchmarking release!. If you’re attending ACL or are generally interested in the future of foundation model benchmarking, happy to talk!. #ACL2025NLP #ACL2025.@aclmeeting.
@adhiraj_ghosh98
Adhiraj Ghosh ✈️ ACL 2025
9 months
🚨Looking to test your foundation model on an arbitrary and open-ended set of capabilities, not explicitly captured by static benchmarks? . Check out ONEBench, part of my MS thesis, where we show how sample-level evaluation is the solution. 🔎. 🧵👇
Tweet media one
1
1
10
@adhiraj_ghosh98
Adhiraj Ghosh ✈️ ACL 2025
16 days
RT @benno_krojer: This is precisely why we released MVPBench to measure genuine video understanding with minimal video pairs. Otherwise we….
Tweet card summary image
arxiv.org
Existing benchmarks for assessing the spatio-temporal understanding and reasoning abilities of video language models are susceptible to score inflation due to the presence of shortcut solutions...
0
2
0
@adhiraj_ghosh98
Adhiraj Ghosh ✈️ ACL 2025
19 days
RT @Hu_Hsu: Truly appreciate the authors of Molmo @Molmo_AI (from @allen_ai and @UW) for promoting open research and adopting MetaCLIP. T….
0
8
0
@adhiraj_ghosh98
Adhiraj Ghosh ✈️ ACL 2025
30 days
RT @YungSungChuang: Scaling CLIP on English-only data is outdated now…. 🌍We built CLIP data curation pipeline for 300+ languages.🇬🇧We train….
0
80
0
@adhiraj_ghosh98
Adhiraj Ghosh ✈️ ACL 2025
1 month
RT @sbdzdz: If you're in Vienna for ACL, @adhiraj_ghosh98 and I will be presenting our work on benchmarking language and vision-language mo….
bethgelab.github.io
ONEBench: a new paradigm for open-ended benchmarking and evaluation of foundation models, aggregating sample-level tests across datasets.
0
4
0
@adhiraj_ghosh98
Adhiraj Ghosh ✈️ ACL 2025
1 month
Excited to be in Vienna for #ACL2025🇦🇹! You'll find @sbdzdz and I by our ONEBench poster, so do drop by!. 🗓️Wed, July 30, 11-12:30 CET.📍Hall 4/5. I’m also excited to talk about lifelong and personalised benchmarking, data curation and vision-language in general! Let’s connect!
Tweet media one
0
4
14
@adhiraj_ghosh98
Adhiraj Ghosh ✈️ ACL 2025
2 months
RT @KarelDoostrlnck: The fine folks at HuggingFace used our APO-zero algorithm to train a very efficient reasoning LM. Awesome 🙌 !.
0
6
0
@adhiraj_ghosh98
Adhiraj Ghosh ✈️ ACL 2025
2 months
RT @thao_nguyen26: Web data, the “fossil fuel of AI”, is being exhausted. What’s next?🤔.We propose Recycling the Web to break the data wall….
0
62
0
@adhiraj_ghosh98
Adhiraj Ghosh ✈️ ACL 2025
4 months
RT @DBahdanau: Adam deserves the award, but in Singapore everyone still uses SGD.
0
63
0
@adhiraj_ghosh98
Adhiraj Ghosh ✈️ ACL 2025
5 months
RT @AmyPrb: 🚨 New paper!. Exciting progress in GRPO variants, smarter training strategies, and curated datasets showing impressive improvem….
0
4
0
@adhiraj_ghosh98
Adhiraj Ghosh ✈️ ACL 2025
5 months
RT @debsarkar_sayan: 🏆 CrossOver is accepted as a 𝗛𝗶𝗴𝗵𝗹𝗶𝗴𝗵𝘁 at @CVPR #CVPR2025! ✨.💻 Fully open-sourced code with all pre-trained checkpoint….
Tweet card summary image
github.com
[CVPR 2025, Highlight] CrossOver: 3D Scene Cross-Modal Alignment - GradientSpaces/CrossOver
0
4
0
@adhiraj_ghosh98
Adhiraj Ghosh ✈️ ACL 2025
6 months
RT @fededagos: 🚨 New paper alert! 🚨.We’ve just launched openretina, an open-source framework for collaborative retina modeling across datas….
0
9
0
@adhiraj_ghosh98
Adhiraj Ghosh ✈️ ACL 2025
6 months
RT @AmyPrb: LMs excel at solving problems (~48% success) but falter at debunking them (<9% counterexample rate)! . Could form an AI Brandol….
0
1
0
@adhiraj_ghosh98
Adhiraj Ghosh ✈️ ACL 2025
6 months
If you needed a reason to visit Nashville in June🔥.
@debsarkar_sayan
Sayan Deb Sarkar
6 months
🎉 Excited to share our latest work, CrossOver: 3D Scene Cross-Modal Alignment, accepted to #CVPR2025 🌐✨. We learn a unified, modality-agnostic embedding space, enabling seamless scene-level alignment across multiple modalities — no semantic annotations needed!🚀
0
0
1
@adhiraj_ghosh98
Adhiraj Ghosh ✈️ ACL 2025
8 months
Valuable multimodal training data contribution and very thorough experimentation🎉. Interesting that fine tuning the visual encoder doesn’t help in such tasks, seems to be aligned with common wisdom in the LMM world.
@JiaruiZ58876329
Jiarui Zhang (Jerry)
8 months
[1/11] Many recent studies have shown that current multimodal LLMs (MLLMs) struggle with low-level visual perception (LLVP) — the ability to precisely describe the fine-grained/geometric details of an image. How can we do better?. Introducing Euclid, our first study at improving
Tweet media one
0
0
3
@adhiraj_ghosh98
Adhiraj Ghosh ✈️ ACL 2025
8 months
RT @AmyPrb: 🚨Looking for open problems machine unlearning for AI safety? . We provide a deep dive into the nuances of removing harmful know….
0
2
0
@adhiraj_ghosh98
Adhiraj Ghosh ✈️ ACL 2025
8 months
RT @JieyuZhang20: Excited to share my intern project at Salesforce Research! Huge thanks to everyone on the team!!.
0
15
0
@adhiraj_ghosh98
Adhiraj Ghosh ✈️ ACL 2025
8 months
Used LMMs-Eval to build ONEBench(: fantastic team who have diligently helped me with queries and access to data over the past 6 months. Glad to see it gain so much traction!.
Tweet card summary image
arxiv.org
Traditional fixed test sets fall short in evaluating open-ended capabilities of foundation models. To address this, we propose ONEBench(OpeN-Ended Benchmarking), a new testing paradigm that...
@liuziwei7
Ziwei Liu
8 months
🚀LMMs-Eval🚀 has reached 2.2K stars with 60+ contributors from the community!. - Repo: - Dataset @huggingface : * Join us to build a standardized evaluation toolkit for large multimodal models (image, video and audio)🤩.
0
2
9
@adhiraj_ghosh98
Adhiraj Ghosh ✈️ ACL 2025
8 months
No surprises whatsoever but congratulations!.
@KarelDoostrlnck
Karel D’Oosterlinck
8 months
🫡 accepted to TACL.
0
0
1