DataScienceHarp Profile Banner
harpreet Profile
harpreet

@DataScienceHarp

Followers
7K
Following
27K
Media
969
Statuses
6K

🤖 👨🏽‍💻 Hacker-in-residence @voxel51| ❤️open source deep learning | VLMs| Visual AI| Learn. Hack. Write. Teach. Repeat. 🪯

I ship daily
Joined April 2020
Don't wanna be here? Send us removal request.
@DataScienceHarp
harpreet
2 days
RT @Voxel51: In Part 1, @datascienceharp will walk through:. 👉 Why standard vision models fail catastrophically on GUI tasks.👉 The annotati….
0
1
0
@DataScienceHarp
harpreet
3 days
Check out the dataset shown here: Here's the LeRobot dataset importer for FiftyOne: Listen to the podcast epiosode here:
0
0
1
@DataScienceHarp
harpreet
3 days
I was literally inspired to action listening to.@Redpoint 's Unsupervised Learning podcast . They had the guys from @physical_int @hausman_k.talking about the need for tools to understand massive robotics datasets. They wanted to spot labeling errors across millions of samples,
3
0
3
@DataScienceHarp
harpreet
4 days
RT @Voxel51: Here’s a peek at what’s on the agenda. ➡️ “Foundational models for generalist computer agents” – Raghav Kapoor, @Adobe . ➡️….
0
1
0
@DataScienceHarp
harpreet
15 days
Labeling data has got to be the most time consuming task I’ve ever done in deep learning.
6
0
4
@DataScienceHarp
harpreet
18 days
I'm teaching a series of workshops in August that go deep into Visual Agents (specifically GUI Agents). The last session is fine-tuning a VLM for this task. A huge challenge is building a dataset for fine-tuning. I want annotations in COCO Format, but nothing exists for this,
4
0
2
@DataScienceHarp
harpreet
22 days
Older. 3.1 on a 386.
@GamewithDave
Dave
22 days
Are you this old?
Tweet media one
1
0
4
@DataScienceHarp
harpreet
22 days
Will have to integrate this into FiftyOne asap!.
@mervenoyann
merve
22 days
all modality RAG 🔥. ColQwen-Omni is a new multimodal retrieval model that can retrieve anything (videos, audios, documents and more!). use with transformers 🤗.here's a smol demo on video retrieval ↙️
1
0
3
@DataScienceHarp
harpreet
22 days
Had an awesome time visiting @huggingface office in Paris! Thank you @mervenoyann for the invite. Good you finally meet you and the legend @reach_vb in person. Looking forward to the next time. Cheers!
Tweet media one
Tweet media two
Tweet media three
Tweet media four
2
2
46
@DataScienceHarp
harpreet
24 days
So pumped to hear @NielsRogge speak at the Brussels meetup!
Tweet media one
0
1
14
@DataScienceHarp
harpreet
25 days
Packed house for @tuanacelik’s talk about RAG with @llama_index at the Amsterdam meetup!
Tweet media one
2
4
22
@DataScienceHarp
harpreet
27 days
Does no one remember what happened with Inflection?.
@bwinterrose
britton winterrose
27 days
I’m sorry but an executive founder leaving their company seems like an obvious fucking breach of fiduciary duty and massive fucking conflict of interest.
1
0
0
@DataScienceHarp
harpreet
1 month
RT @Voxel51: The era of single-purpose vision models is ending. Introducing agglomerative vision models: a fundamental shift from how we’ve….
0
2
0
@DataScienceHarp
harpreet
1 month
RT @Voxel51: If you're in Germany, come out and join us for a hands-on computer vision workshop on Advanced Car Damage Detection with Fifty….
0
1
0
@DataScienceHarp
harpreet
1 month
Resources:. ⭐️ the repo on GitHub: 👨🏽‍💻 Notebook to get started:
Tweet card summary image
github.com
Integrating OS-Atlas Base into FiftyOne as a Remote Source Zoo Model - harpreetsahota204/os_atlas
0
0
3
@DataScienceHarp
harpreet
1 month
OS Atlas 7B is a solid vision model that will localize UI elements reliably, even when you deviate from their suggested prompts. Here's what I learned after two days of experimentation ⇣. 1) OS Atlas 7B reliably localizes UI elements even with prompt variations. • The model
1
1
5
@DataScienceHarp
harpreet
1 month
Two days with Nemotron Nano VL taught me that it can spot a left leg in a crowd but can't find a button on a screen. It's surprisingly capable at natural images but completely breaks on UI tasks. Here are my main takeaways. 1. It's surprisingly good at natural images, despite
1
0
3