
Patel Maitreya
@patelmaitreya
Followers
358
Following
3K
Media
53
Statuses
523
Research Intern @Adobe | PhD at @ApgAsu @ASU | Vision & Language | T2I Diffusion Modeling | Prev. @SonyAI_global @Adobe
Tempe, AZ
Joined November 2012
🚀 Introducing FlowChef: "Steering Rectified Flow Models in the Vector Field for Controlled Image Generation"! 🌌✨. 💡 Key Highlights:. - Perform image editing, solve inverse problems, and more. - Achieved inversion-free, gradient-free, & training-free inference time steering!
2
4
21
RT @patelmaitreya: Thinking of kicking off a new open research project… can’t decide which direction to take. Both have some very spicy ide….
0
1
0
RT @tunahansalih: Combining multiple LoRAs is tricky. One concept often dominates, killing the composition. Our paper, CLoRA, accepted as….
0
17
0
Do not ask me to use your model if it cannot at least do this. It seems that video models are finally becoming truly useful. Exciting!.
Introducing Runway Aleph, a new way to edit, transform and generate video. Aleph is a state-of-the-art in-context video model, setting a new frontier for multi-task visual generation, with the ability to perform a wide range of edits on an input video such as adding, removing
1
0
0
First time I’m not feeling FOMO over the OpenAI agent — I’ve got early access to @yutori_ai and it’s way, way better 😁. The team has absolutely nailed the use case. Not surprised — they’re brilliant.
0
2
20
I mean, the Windsurf/ScaleAI saga was wild to watch… but honestly, it feels unfair to all the early employees who took the biggest risks. If this keeps happening, why would anyone join a startup early? 🚩 Something’s gotta change.
0
0
5
I have lost confidence in all papers only using Qwen for alignment improvements. How can we trust that these improvements are not influenced by spurious biases or methodological changes?. Interesting work though.
🤯 We cracked RLVR with. Random Rewards?!.Training Qwen2.5-Math-7B with our Spurious Rewards improved MATH-500 by:.- Random rewards: +21%.- Incorrect rewards: +25%.- (FYI) Ground-truth rewards: + 28.8%.How could this even work⁉️ Here's why: 🧵.Blogpost:
0
0
3
I agree with this. There are still many great papers that predate the recent diffusion LLM hype that are being overlooked. Hint: At least check the #ICLR2025 papers.
@arankomatsuzaki Tbh, there are many good papers coming out of academic labs too, for example diffusion LLMs, but they didn’t really get attention until Google released a paper on it. Perhaps you should also tweet a bit about academic research :).
0
0
2
RT @jasonbaldridge: Veo 3 is here, and in addition to better visuals, it makes noises and speaks! This was a massive effort made possible b….
0
34
0
Whoa… just got access to the model—and it’s phenomenal. Solves many of the classic LLM problems right out of the gate. Time to put it through its paces. Wish I knew the parameter count for fair comparison, but I’ll benchmark it soon enough.
We’ve developed Gemini Diffusion: our state-of-the-art text diffusion model. Instead of predicting text directly, it learns to generate outputs by refining noise, step-by-step. This helps it excel at coding and math, where it can iterate over solutions quickly. #GoogleIO
0
0
3
I’m seriously getting FOMO.
We’ve developed Gemini Diffusion: our state-of-the-art text diffusion model. Instead of predicting text directly, it learns to generate outputs by refining noise, step-by-step. This helps it excel at coding and math, where it can iterate over solutions quickly. #GoogleIO
0
0
4