patelmaitreya Profile Banner
Patel Maitreya Profile
Patel Maitreya

@patelmaitreya

Followers
358
Following
3K
Media
53
Statuses
523

Research Intern @Adobe | PhD at @ApgAsu @ASU | Vision & Language | T2I Diffusion Modeling | Prev. @SonyAI_global @Adobe

Tempe, AZ
Joined November 2012
Don't wanna be here? Send us removal request.
@patelmaitreya
Patel Maitreya
9 months
🚀 Introducing FlowChef: "Steering Rectified Flow Models in the Vector Field for Controlled Image Generation"! 🌌✨. 💡 Key Highlights:. - Perform image editing, solve inverse problems, and more. - Achieved inversion-free, gradient-free, & training-free inference time steering!
2
4
21
@patelmaitreya
Patel Maitreya
9 days
RT @patelmaitreya: Thinking of kicking off a new open research project… can’t decide which direction to take. Both have some very spicy ide….
0
1
0
@grok
Grok
1 day
Join millions who have switched to Grok.
20
20
159
@patelmaitreya
Patel Maitreya
10 days
Thinking of kicking off a new open research project… can’t decide which direction to take. Both have some very spicy ideas I think the community will love 👀. 1️⃣ Text-to-image: runs any hardware pre-distill/quant.2️⃣ Flexible unified autoregressive model. Both prod-ready from day.
0
1
2
@patelmaitreya
Patel Maitreya
21 days
RT @tunahansalih: Combining multiple LoRAs is tricky. One concept often dominates, killing the composition. Our paper, CLoRA, accepted as….
0
17
0
@patelmaitreya
Patel Maitreya
26 days
Also this 👇.
@GoogleLabs
Google Labs
27 days
We just discovered the 🔥 COOLEST 🔥 trick in Flow that we have to share:. Instead of wordsmithing the perfect prompt, you can just. draw it. Take the image of your scene, doodle what you'd like on it (through any editing app), and then briefly describe what needs to happen
0
0
0
@patelmaitreya
Patel Maitreya
26 days
Do not ask me to use your model if it cannot at least do this. It seems that video models are finally becoming truly useful. Exciting!.
@runwayml
Runway
27 days
Introducing Runway Aleph, a new way to edit, transform and generate video. Aleph is a state-of-the-art in-context video model, setting a new frontier for multi-task visual generation, with the ability to perform a wide range of edits on an input video such as adding, removing
1
0
0
@patelmaitreya
Patel Maitreya
1 month
First time I’m not feeling FOMO over the OpenAI agent — I’ve got early access to @yutori_ai and it’s way, way better 😁. The team has absolutely nailed the use case. Not surprised — they’re brilliant.
0
2
20
@patelmaitreya
Patel Maitreya
1 month
I mean, the Windsurf/ScaleAI saga was wild to watch… but honestly, it feels unfair to all the early employees who took the biggest risks. If this keeps happening, why would anyone join a startup early? 🚩 Something’s gotta change.
@ns123abc
NIK
1 month
🚨BREAKING: GOOGLE TO PAY $2.4 BILLION FOR WINDSURF STAFF AND IP . we are so back.
Tweet media one
Tweet media two
0
0
5
@patelmaitreya
Patel Maitreya
2 months
RT @alifarhat79: Just do things
Tweet media one
0
623
0
@patelmaitreya
Patel Maitreya
2 months
Only in the Bay Area:.Had a full convo on the context limitations of generative vision models with a random guy on Caltrain. No intros, no names — just shared frustration with 224x224 and short memories. 🤝. #GenAI #BayArea.
0
0
2
@patelmaitreya
Patel Maitreya
2 months
Okay… diffusability is a real concern. 🙁 .Any post-hoc solution without equivariance training?.
0
0
2
@patelmaitreya
Patel Maitreya
3 months
I have been wondering about this for months now. This makes total sense. Gotta change my scripts. 🏃‍➡️.
@_arohan_
rohan anil
3 months
Tweet media one
0
0
3
@patelmaitreya
Patel Maitreya
3 months
I'm glad to see that someone finally wrote a paper on this low hanging fruit, which I shared on X back in December and also included in invited talks. (check this out)
@_akhaliq
AK
3 months
FlowMo. Variance-Based Flow Guidance for Coherent Motion in Video Generation
0
0
0
@patelmaitreya
Patel Maitreya
3 months
Lately, so many incredible projects are being released—it’s inspiring and a little overwhelming. Some days I wake up excited to be part of this space. Other days, I wonder if what I’m building will still matter tomorrow. That back-and-forth is constant.
0
0
7
@patelmaitreya
Patel Maitreya
3 months
I have lost confidence in all papers only using Qwen for alignment improvements. How can we trust that these improvements are not influenced by spurious biases or methodological changes?. Interesting work though.
@StellaLisy
Stella Li
3 months
🤯 We cracked RLVR with. Random Rewards?!.Training Qwen2.5-Math-7B with our Spurious Rewards improved MATH-500 by:.- Random rewards: +21%.- Incorrect rewards: +25%.- (FYI) Ground-truth rewards: + 28.8%.How could this even work⁉️ Here's why: 🧵.Blogpost:
Tweet media one
0
0
3
@patelmaitreya
Patel Maitreya
3 months
I agree with this. There are still many great papers that predate the recent diffusion LLM hype that are being overlooked. Hint: At least check the #ICLR2025 papers.
@gowthami_s
Gowthami
3 months
@arankomatsuzaki Tbh, there are many good papers coming out of academic labs too, for example diffusion LLMs, but they didn’t really get attention until Google released a paper on it. Perhaps you should also tweet a bit about academic research :).
0
0
2
@patelmaitreya
Patel Maitreya
3 months
RT @jasonbaldridge: Veo 3 is here, and in addition to better visuals, it makes noises and speaks! This was a massive effort made possible b….
0
34
0
@patelmaitreya
Patel Maitreya
3 months
Whoa… just got access to the model—and it’s phenomenal. Solves many of the classic LLM problems right out of the gate. Time to put it through its paces. Wish I knew the parameter count for fair comparison, but I’ll benchmark it soon enough.
@GoogleDeepMind
Google DeepMind
3 months
We’ve developed Gemini Diffusion: our state-of-the-art text diffusion model. Instead of predicting text directly, it learns to generate outputs by refining noise, step-by-step. This helps it excel at coding and math, where it can iterate over solutions quickly. #GoogleIO
0
0
3
@patelmaitreya
Patel Maitreya
3 months
I’m seriously getting FOMO.
@GoogleDeepMind
Google DeepMind
3 months
We’ve developed Gemini Diffusion: our state-of-the-art text diffusion model. Instead of predicting text directly, it learns to generate outputs by refining noise, step-by-step. This helps it excel at coding and math, where it can iterate over solutions quickly. #GoogleIO
0
0
4
@patelmaitreya
Patel Maitreya
3 months
Just had a Final Destination-type moment on my way to the Bay Area. Life really threw a plot twist today. Luckily, I did not get hurt and just got a big logistical task on my to-do list. 😮‍💨.
2
0
3