roeiherzig Profile Banner
Roei Herzig Profile
Roei Herzig

@roeiherzig

Followers
1K
Following
8K
Media
120
Statuses
2K

Researcher @IBMResearch. Postdoc @berkeley_ai. PhD @TelAvivUni. Working on Compositionality, Multimodal Foundation Models, and Structured Physical Intelligence.

Berkeley, CA
Joined March 2017
Don't wanna be here? Send us removal request.
@roeiherzig
Roei Herzig
7 months
What happens when vision๐Ÿค robotics meet? Happy to share our new work on Pretraining Robotic Foundational Models!๐Ÿ”ฅ ARM4R is an Autoregressive Robotic Model that leverages low-level 4D Representations learned from human video data to yield a better robotic model. @berkeley_ai๐Ÿ˜Š
9
73
437
@roeiherzig
Roei Herzig
24 days
Big personal news: I've traded my "extraordinary alien"๐Ÿ‘ฝ status for a much cooler one--just a very standard permanent resident! Yay!๐ŸŽ‰๐Ÿฅณ My green card has officially arrived! ๐Ÿ’š ๐Ÿ‡บ๐Ÿ‡ธ๐Ÿ—ฝ
Tweet media one
5
0
24
@roeiherzig
Roei Herzig
29 days
We are working on that direction. Stay tuned ๐Ÿ˜‰
@TairanHe99
Tairan He
29 days
GPT-5 Example: โ€œHow many fingers per hand?โ€ We need vision-centric VLM โ€” and even more, vision-action-centric VLA. Language is a powerful tool, but itโ€™s hard to bend it to the needs of vision and robotics.
Tweet media one
0
0
2
@roeiherzig
Roei Herzig
1 month
๐Ÿš€ Excited to speak at the Agentic AI Summit NOW! ๐Ÿ“Frontier Stage - Session 4: Foundations of Agents Iโ€™ll give a 5-min spotlight on my research: ๐‘บ๐’•๐’“๐’–๐’„๐’•๐’–๐’“๐’†๐’… ๐‘ท๐’‰๐’š๐’”๐’Š๐’„๐’‚๐’ ๐‘ฐ๐’๐’•๐’†๐’๐’๐’Š๐’ˆ๐’†๐’๐’„๐’† โ€” and why it matters for the future of vision & robotics
@dawnsongtweets
Dawn Song
1 month
Really excited for the Agentic AI Summit 2025 at @UCBerkeleyโ€”2K+ in-person attendees and ~10K online! Building on our 25K+ LLM Agents MOOC community, this is the premier global forum for advancing #AgenticAI. ๐Ÿ‘€ Livestream starts at 9:15 AM PT on August 2โ€”tune in!
Tweet media one
0
3
14
@roeiherzig
Roei Herzig
1 month
๐Ÿš€ Excited to speak tomorrow at the Agentic AI Summit! Iโ€™ll give a 5-min spotlight on my research: ๐‘บ๐’•๐’“๐’–๐’„๐’•๐’–๐’“๐’†๐’… ๐‘ท๐’‰๐’š๐’”๐’Š๐’„๐’‚๐’ ๐‘ฐ๐’๐’•๐’†๐’๐’๐’Š๐’ˆ๐’†๐’๐’„๐’† โ€” and why it matters for the future of vision & robotics ๐Ÿค–๐Ÿง  ๐ŸŽฏ https://t.co/4Fyv90sThN #AgenticAI #AI #Robotics
Tweet media one
0
1
5
@roeiherzig
Roei Herzig
1 month
ื–ื” ื›ืžื•ื‘ืŸ ืœื ื ื›ื•ืŸ. ื”ืชืงืจืช ื–ื›ื•ื›ื™ืช ืงื™ื™ืžืช ื‘ื›ืœ ื”ืžืงืจื™ ืงืฆื”, ืฉื‘ื”ื ืื™ืŸ ื•ืœื ื™ื”ื™ื” ืœื ื• ืžืกืคื™ืง ื“ื™ืื˜ื”. ื‘ื™ืŸ ืื ืžื“ื•ื‘ืจ ื‘ื‘ืขื™ื•ืช ืฉืœื ื ื™ืชืŸ ืœื”ืฉื™ื’ ื‘ื”ื ื“ืื˜ื” (ืœืžืฉืœ ืจื•ื‘ื•ื˜ื™ื), ืื• ื‘ื™ืŸ ืื ืžื“ื•ื‘ืจ ืขืœ ื‘ืขื™ื•ืช ืžืจืืฉ ืฉืžืขื•ืœื ืœื ื™ื”ื™ื” ืœื ื• ืžืœื ื“ื’ื™ืžื•ืช. ืœืžืฉืœ ืœื’ื ืจื˜ ื—ื™ืคื•ืฉื™ืช ื ื“ื™ืจื” ืžื”ืืžื–ื•ื ืก.
@ReemSherman
Reem Sherman
1 month
ืžืขื‘ืจ ืœื›ืœ ื”ืคื•ื ืงืฆื™ื ืœื™ื•ืช ื”ืžื“ื”ื™ืžื”, ืขืœ ื’ื‘ื•ืœ ื”ืงืกื, ืฉืœ ืžื•ื“ืœื™ ื”ื•ื™ื“ืื•/ืชืžื•ื ื” ื˜ืžื•ืŸ ืฉื™ื ื•ื™ ื˜ื›ื ื•ืœื•ื’ื™ ืžืจื’ืฉ ื›ืœ ื›ืš, ื•ื–ื” ื”ืžืขื‘ืจ ืžืจืฉืชื•ืช ืงื•ื ื‘ื•ืœื•ืฆื™ื” (CNNื™ื) ืœืžื•ื“ืœื™ ืชืžื•ื ื” ืžื‘ื•ืกืกื™ ื˜ืจื ืกืคื•ืจืžืจื™ื, ืฉื”ืชื‘ืกืกื• ืขืœ ืื•ืชื• ืžืืžืจ ื”ื™ืกื˜ื•ืจื™ ืž-2017. ื‘ืกื•ืฃ, ืžื” ืฉืฆืจื™ืš ืœื”ื‘ื™ืŸ ื–ื” ืฉื”ื“ืจืš ื”ืงื•ื“ืžืช ืœืืžืŸ AI ื”ื™ื™ืชื” ื—ืกื•ืžื”. ื‘ืฉืœื‘ ืžืกื•ื™ื, ื›ื›ืœ
1
0
1
@roeiherzig
Roei Herzig
1 month
Awesome idea! Congrats folks๐Ÿš€
@davidrmcall
David McAllister
1 month
Excited to share Flow Matching Policy Gradients: expressive RL policies trained from rewards using flow matching. Itโ€™s an easy, drop-in replacement for Gaussian PPO on control tasks.
0
0
4
@roeiherzig
Roei Herzig
2 months
My son?๐Ÿ˜‚
@2prime_PKU
Yiping Lu
2 months
Anyone knows adam?
Tweet media one
0
0
1
@roeiherzig
Roei Herzig
2 months
Thanks @IlirAliu_ for highlighting our work!๐Ÿ™Œ ๐ŸŒ Project page: https://t.co/gmPJqMLGNE ๐Ÿ”— Code: https://t.co/7XgQVSPDhI More exciting projects on the wayโ€”stay tuned!๐Ÿค–
Tweet card summary image
github.com
Official Implementation of ARM4R ICML 2025. Contribute to Dantong88/arm4r development by creating an account on GitHub.
@IlirAliu_
Ilir Aliu - eu/acc
2 months
Robots usually need tons of labeled data to learn precise actions. What if they could learn control skills directly from human videosโ€ฆ no labels needed? Robotics pretraining just took a BIG jump forward. A new Autoregressive Robotic Model, learns low-level 4D representations
0
1
7
@roeiherzig
Roei Herzig
2 months
๐Ÿš€ Our code for ARM4R is now released! Check it out here ๐Ÿ‘‰
Tweet card summary image
github.com
Official Implementation of ARM4R ICML 2025. Contribute to Dantong88/arm4r development by creating an account on GitHub.
@roeiherzig
Roei Herzig
7 months
What happens when vision๐Ÿค robotics meet? Happy to share our new work on Pretraining Robotic Foundational Models!๐Ÿ”ฅ ARM4R is an Autoregressive Robotic Model that leverages low-level 4D Representations learned from human video data to yield a better robotic model. @berkeley_ai๐Ÿ˜Š
0
14
70
@roeiherzig
Roei Herzig
2 months
๐Ÿš€ Excited that our ARM4R paper will be presented next week at #ICML2025! If youโ€™re into 4D representations and robotic particle-based representations, donโ€™t miss it! ๐Ÿค–โœจ I wonโ€™t be there in person, but make sure to stop by and chat with Yuvan! ๐Ÿ™Œ
@roeiherzig
Roei Herzig
7 months
What happens when vision๐Ÿค robotics meet? Happy to share our new work on Pretraining Robotic Foundational Models!๐Ÿ”ฅ ARM4R is an Autoregressive Robotic Model that leverages low-level 4D Representations learned from human video data to yield a better robotic model. @berkeley_ai๐Ÿ˜Š
1
13
60
@roeiherzig
Roei Herzig
2 months
Love the core message here! Predictions โ‰  World Models. Predictions are task-specific, but world models can generalize across many tasks.
@keyonV
Keyon Vafa
2 months
Can an AI model predict perfectly and still have a terrible world model? What would that even mean? Our new ICML paper formalizes these questions One result tells the story: A transformer trained on 10M solar systems nails planetary orbits. But it botches gravitational laws ๐Ÿงต
1
0
3
@tsunghan_wu
Tsung-Han (Patrick) Wu
2 months
Final chances to #ICCV2025 ๐ŸŒด๐ŸŒบ Submit your best work to the MMFM @ ICCV workshop on all things multimodal: vision, language, audio and more. ๐Ÿ—“๏ธ Deadline: July 1 ๐Ÿ”—
openreview.net
Welcome to the OpenReview homepage for ICCV 2025 Workshop MMFM4
@roeiherzig
Roei Herzig
2 months
๐Ÿšจ Rough luck with your #ICCV2025 submission? Weโ€™re organizing the 4th Workshop on Whatโ€™s Next in Multimodal Foundation Models at @ICCVConference in Honolulu ๐ŸŒบ๐ŸŒด Send us your work on vision, language, audio & more! ๐Ÿ—“๏ธ Deadline: July 1, 2025 ๐Ÿ”—
0
1
4
@roeiherzig
Roei Herzig
2 months
๐Ÿšจ Rough luck with your #ICCV2025 submission? Weโ€™re organizing the 4th Workshop on Whatโ€™s Next in Multimodal Foundation Models at @ICCVConference in Honolulu ๐ŸŒบ๐ŸŒด Send us your work on vision, language, audio & more! ๐Ÿ—“๏ธ Deadline: July 1, 2025 ๐Ÿ”—
sites.google.com
News: ๐Ÿˆ Submissions are on OpenReview (Open Now!): https://openreview.net/group?id=thecvf.com/ICCV/2025/Workshop/MMFM4 MMFM has two paper tracks. More information under Call for Papers. Proceedings...
2
8
24
@roeiherzig
Roei Herzig
2 months
Re "vision researchers move to robotics"-theyโ€™re just returning to the fieldโ€™s roots. Computer vision began as "robotic vision", focused on agents perceiving & interacting within the world. The shift to "internet vision" came later with the rise of online 2D data. Org CV book๐Ÿ‘‡
Tweet media one
@YuXiang_IRVL
Yu Xiang
3 months
โ€why are so many vision / learning researchers moving to robotics?โ€ Keynote from @trevordarrell #RSS2025
Tweet media one
Tweet media two
Tweet media three
Tweet media four
0
0
6
@CSProfKGD
Kosta Derpanis
2 months
Overall, I think the move from CMT to OpenReview was a great decision. Now if only we can improve the paper-reviewer matching system!
2
4
20
@roeiherzig
Roei Herzig
2 months
๐Ÿš€ Excited to share that our latest work on Sparse Attention Vectors (SAVs) has been accepted to @ICCVConference โ€” see you all in Hawaii! ๐ŸŒธ๐ŸŒด ๐ŸŽ‰ SAVs is a finetuning-free method leveraging sparse attention heads in LMMs as powerful representations for VL classification tasks.
@chancharikm
Chancharik Mitra
8 months
๐ŸŽฏ Introducing Sparse Attention Vectors (SAVs): A breakthrough method for extracting powerful multimodal features from Large Multimodal Models (LMMs). SAVs enable SOTA performance on discriminative vision-language tasks (classification, safety alignment, etc.)! Links in replies!
0
5
32
@roeiherzig
Roei Herzig
3 months
Yes!๐Ÿฅณ
@LerrelPinto
Lerrel Pinto
3 months
It was nice engaging with the CV community on ways to stand out in the crowd. My answer was simple: work on robotics. There are so many unanswered problems and open pastures for research if you are a new researcher. Below are 6 problems I focussed on in my talk.
1
0
0
@roeiherzig
Roei Herzig
3 months
Honored to be named an ๐Ž๐ฎ๐ญ๐ฌ๐ญ๐š๐ง๐๐ข๐ง๐  ๐‘๐ž๐ฏ๐ข๐ž๐ฐ๐ž๐ซ for ๐‚๐•๐๐‘ ๐Ÿ๐ŸŽ๐Ÿ๐Ÿ“ !๐ŸŽ‰ Grateful to contribute to the community and support the high standards of the conference. Maybe it's time to start thinking about AC-ing? ๐Ÿ™ƒ #CVPR2025 @CVPR
1
0
12
@roeiherzig
Roei Herzig
3 months
Got a big, bold question? Let me know! Open to your questionsโ€”ambitious ones especially! ๐Ÿค–๐Ÿ’ฌ
@roeiherzig
Roei Herzig
3 months
๐Ÿšจ Our panel kicks off at 11:30 AM in Room 207 Aโ€“D (Level 2)! Don't miss an amazing discussion with: Ludwig Schmidt, Andrew Owens, Arsha Nagrani, and Ani Kembhavi ๐Ÿ”ฅ
0
1
4
@roeiherzig
Roei Herzig
3 months
0
0
1