vcossss Profile Banner
Vikash Yadav Profile
Vikash Yadav

@vcossss

Followers
81
Following
93
Media
20
Statuses
27

AI SuperBuilder @ Trilogy Innovations | IIT Jodhpur'24

Joined January 2025
Don't wanna be here? Send us removal request.
@vinayak_sachdev
Vinayak Sachdeva
2 months
@zarazhangrui Love seeing innovation in this space! 🙌 We're tackling a similar problem with LearnLens ( https://t.co/prJu5qbAPb) — a Chrome extension that takes a different approach: ✅ Lives directly in YouTube (side-by-side panel, zero friction) ✅ Knowledge checks that test retention as
Tweet card summary image
learnerslens.ai
AI-powered Chrome extension that turns any YouTube video into an interactive learning experience.
1
8
14
@vcossss
Vikash Yadav
4 months
Not all hours behind a screen are learning hours💻 Some build knowledge, others slip into distraction. TimeBack makes the difference crystal clear! Fix your anti-patterns, and let every minute count⏳ Been cooking this for weeks—can’t wait to ship 🚀
2
1
15
@vcossss
Vikash Yadav
6 months
Know any Gemini video hacks I missed? 🙋 Drop them below 📝👇🏻
0
0
1
@vcossss
Vikash Yadav
6 months
Rule of Thumb 📌 1️⃣ Just one video, per prompt. Don’t pass multiple videos at once for best results. (Even though the limit is 10) 2️⃣ Place video before the prompt. Gemini works better when the video is the first element in the prompt, before any questions or instructions.
1
0
1
@vcossss
Vikash Yadav
6 months
Flash caveats 🚨 ⚖️ It’s not as instant as old Flash (because it now "thinks"). Turn off the 'thinking' mode if speed is a priority. 🏆 Pro still outperforms for complex reasoning. Consider trying both. If Flash performs well enough, switch. You might not notice the difference,
1
0
1
@vcossss
Vikash Yadav
6 months
Still using the 'pro' model for everything? Think again! ⚡ Gemini Flash was once the "fast but shallow" model, but no more! Since Gemini 2.5, 'Flash' has also gained a "reasoning" mode! 🧠 It often delivers comparable results to pro at a fraction of the cost. 🚀 🧩 Reasoning
1
0
1
@vcossss
Vikash Yadav
6 months
Want to query specific parts of the video? 📐 ❌ Don’t manually trim, no ffmpeg overhead. ✅ Just use the 'start_offset' and 'end_offset' params. Upload once via Files API, then reference the clip with offsets. No need to re-upload segments — just point to different time ranges
1
0
1
@vcossss
Vikash Yadav
6 months
Re-querying the same video or parts of the same video? ♻️ 🚫 Don’t use the inline video passing method. 📁 Use the Files API. Upload once, reuse in all queries. Why this is better? • 🔁 No re-upload: Files are kept up to 48 h, saving subsequent upload times. • ⚡ Faster:
1
0
1
@vcossss
Vikash Yadav
6 months
Struggling with fast motion clips? 🏃🏻 Most people don’t know Gemini samples all your videos at 1 FPS by default. So the action in between frames (1 second apart) is lost. 😱 What should we do? Earlier, slowing down or speeding up the video was the way to go. ⏪⏩ But now
1
0
1
@vcossss
Vikash Yadav
6 months
Dealing with timestamps? ⏲️ Use the "MM:SS" format ✅ (Like '1:28' instead of '88s') This keeps it consistent with what Gemini internally prefers, hence: • 📍 Improved timestamp accuracy • 🐛 Reduced parsing errors
1
0
1
@vcossss
Vikash Yadav
6 months
Analyzing long videos? 📹 Chunk them ✂️ One call per chunk. Why chunk? • 🎯 Smaller context window → less hallucination • ⌚ Avoids the slight drift in timestamps observed beyond 1 hour • 💰 Keeps you in the cheaper Gemini Pro pricing tier (<=200k tokens) Bonus: Overlap
1
0
1
@vcossss
Vikash Yadav
6 months
Using Gemini for video analysis?🎥 Want faster, cheaper, better results?✨ Here are 8 pro tips you probably didn't know 🧵👇🏻
1
2
7
@vcossss
Vikash Yadav
6 months
This has many use cases: - User activity monitoring - Automated UI Testing - Video content analysis - Attention tracking - Security Systems Has compression ever tripped you up? Let’s talk 👇
0
0
2
@vcossss
Vikash Yadav
6 months
The system is now precise and intelligent! Key Learnings: 🔍Always visualize — numbers alone can lie 🫧Video compression in common formats creates invisible differences
1
0
1
@vcossss
Vikash Yadav
6 months
But wait—what about non-user changes? Clocks, blinking cursors, loading animations... They do create differences, but they’re not user input Solution: Now that we have clean diffs, we can feed them to vision models 🧠 Let them decide the type of change.
1
0
0
@vcossss
Vikash Yadav
6 months
After a bit of threshold tuning: BOOM! 💥 Clean difference detection. Compression artifacts? Gone✅ Real changes? Captured✅ Magic? Nah — just better math.
1
0
1
@vcossss
Vikash Yadav
6 months
I needed a better way to detect real activity — not compression noise. New plan: Take absolute pixel difference b/w frames → then denoise 🥅 Step1️⃣Filter out small differences below a threshold. Result -> Weird leftover dots. Step 2⃣ Filter out tiny regions by area. (those
2
0
0
@vcossss
Vikash Yadav
6 months
The culprit? 🎭Compression artifacts Most common video formats like .mp4, .mov, and .webm use lossy compression. They don’t preserve exact pixels, just “good enough” approximations. So even if nothing visibly changes, the frames (pixels) aren't identical.
1
0
0
@vcossss
Vikash Yadav
6 months
So I plotted the SSIM difference maps. Boom. Weird patches showed up. Pixel changes where there shouldn't be any.
1
0
0
@vcossss
Vikash Yadav
6 months
Sometimes the SSIM score was lower even when nothing visibly changed. Lower than when something actually changed ! 🤯 This made zero sense ! What kind of visual sorcery was I dealing with?
1
0
0