rohit_malh5 Profile Banner
Rohit Malhotra Profile
Rohit Malhotra

@rohit_malh5

Followers
92
Following
3K
Media
10
Statuses
142

Openhands Maintainer | Ex-CTO @sitewizai | NLP @ CMU | Primarily interested in Agents | Secondary interests in creative design

Joined July 2018
Don't wanna be here? Send us removal request.
@rohit_malh5
Rohit Malhotra
11 days
RT @rbren_dev: Nothing more frustrating than seeing "private scaffold" on public benchmark results. I love that model providers like Qwen a….
0
7
0
@rohit_malh5
Rohit Malhotra
11 days
RT @Alibaba_Qwen: >>> Qwen3-Coder is here! ✅. We’re releasing Qwen3-Coder-480B-A35B-Instruct, our most powerful open agentic code model to….
0
1K
0
@rohit_malh5
Rohit Malhotra
12 days
FWIW I think this applies to human developers too. I just see a lot of exceptions being made for agents under the guise of velocity.
0
0
1
@rohit_malh5
Rohit Malhotra
12 days
I believe the scope of a task you delegate to an agent should match the scope you’re willing to review in a PR. IDE agents are tactical—you keep prompting and refine until the code looks good. But autonomous code agents like OpenHands are more hands-off. So how much work you.
1
0
2
@rohit_malh5
Rohit Malhotra
14 days
OpenHands is so general-purpose that I now think of leveraging it with workflow-driven prompting. Also stating constraints works well for me. Examples:. • Examine the existing architecture, read docs for Y, plan how to implement X, then do it.→ Instead of: "Implement feature.
0
0
3
@rohit_malh5
Rohit Malhotra
23 days
RT @MistralAI: Introducing Devstral Small and Medium 2507! This latest update offers improved performance and cost efficiency, perfectly su….
0
333
0
@rohit_malh5
Rohit Malhotra
1 month
RT @gneubig: What will software development look like in 2026?. With coding agents rapidly improving, dev roles may look quite different. M….
0
16
0
@rohit_malh5
Rohit Malhotra
2 months
PSA for engineering leadership exploring software agent solutions 🚨. This post nails the difference between agentic and agentless approaches — and why it actually matters for real software tasks, beyond SWE-Bench scores!.
@allhands_ai
All Hands AI
2 months
Congratulations to Moonshot AI on their release of Kimi-Dev-72B, an open-weights model that achieves a great score of 60.4% on SWE-Bench Verified!. Our community tried it in OpenHands, but it didn't work well, only 17% accuracy. Is this surprising? Actually not really! 🧵
Tweet media one
0
1
5
@rohit_malh5
Rohit Malhotra
2 months
Some users click with code agents. Others struggle. Why? Agents are flexible and creative - just like their users! It's confusing!. Agents should understand, educate, and adapt to users. Even personalize. If the agent isn’t willing to grow, the user likely won’t either.
@rohit_malh5
Rohit Malhotra
2 months
Agents like OpenHands are flexible and have common sense, adjusting to insensible or illogical user demands. Traditional software, however, follows rigid rules of behavior, which is then designed for imaginative users. "Good" design will change as software becomes more flexible.
0
0
1
@rohit_malh5
Rohit Malhotra
2 months
We really do believe in open source, but it wouldn't be possible to sustain it unless someone took over these complexities. Shoutout @rbren_dev.
0
0
2
@rohit_malh5
Rohit Malhotra
2 months
Open source is powerful. Building a company around it? Complicated. Strong insights on how to stay rooted in open source while juggling many stakeholder needs.
@allhands_ai
All Hands AI
2 months
At @allhands_ai we are dedicated to:. 1. Making OpenHands the best agent for software development.2. Doing it in an open, community-driven way. To make this sustainable, we formed a company around the project. @rbren_dev explains our philosophy in our new blog.
Tweet media one
1
0
3
@rohit_malh5
Rohit Malhotra
2 months
RT @allhands_ai: What if we could have *trustworthy* agents that don't just write code, but also do research, understand multimodal content….
0
27
0
@rohit_malh5
Rohit Malhotra
2 months
I believe “execution” and “evaluation” are two major challenges to adoption of code agents from a user perspective. Users must learn out how to leverage the agent effectively, and how to evaluate its work (asap). Could determine whether good agents also delivers great experiences.
0
0
2
@rohit_malh5
Rohit Malhotra
2 months
Used OpenHands to build a speculative spatial tracking museum experience in Unity, despite no Unity experience. It was pure "vibe coding" with no tests or reviews. Challenges: adding features without breaking existing ones and managing a messy codebase. Review your code folks!
2
2
8
@rohit_malh5
Rohit Malhotra
2 months
Agents like OpenHands are flexible and have common sense, adjusting to insensible or illogical user demands. Traditional software, however, follows rigid rules of behavior, which is then designed for imaginative users. "Good" design will change as software becomes more flexible.
0
1
2
@rohit_malh5
Rohit Malhotra
2 months
When OpenHands gets confused about file locations or alters code unrelated to the (relatively simple) task, I now see it as a sign of excessive code complexity. I used to blame agent quality before, but more often than not, it's an issue of poor code quality.
0
0
2
@rohit_malh5
Rohit Malhotra
2 months
Initial results look great 🚀.
@allhands_ai
All Hands AI
2 months
We tested out Claude 4 in OpenHands, and yeah, it's good. 70.4% on SWE-Bench Verified out of the box, +10 points over 3.7 Sonnet. Our vibe checks also passed, so it's available in OpenHands and the OpenHands Cloud today.
Tweet media one
0
0
3
@rohit_malh5
Rohit Malhotra
3 months
RT @gneubig: Some people have said that OpenAI achieved state of the art results on the SWE-Bench Verified leaderboard with their codex mod….
0
28
0