
PyImageSearch
@PyImageSearch
Followers
25K
Following
10K
Media
1K
Statuses
12K
Leading deep learning educator. Tutorials and courses on #DeepLearning, #ComputerVision, #LLMs, #GenAI, #OpenCV, #Keras, #TensorFlow, #PyTorch, and many more.
USA
Joined January 2014
π§ Unconditional + prompt-guided captions.β‘ Hash-based caching on @Redisinc = instant repeats.π οΈ Served via @FastAPI, @salesforce BLIP model from @huggingface.
0
0
0
New tutorial! π. BLIP Captioning API + Redis Cache w/ FastAPI.π.Author: Vikram Singh.#BLIP #ImageCaptioning #FastAPI #Redis #HuggingFace #MLOps #Python
3
0
4
RT @KalshiSports: Game by the numbers. Kalshi volume: $26.6m.Spit ejections: 1.Rizzler commercials: 1.Weather delays: 1.AJ Brown catches: 1.
0
30
0
New tutorial! π. From CNN+RNN β BLIP. π§© @salesforce BLIP on @huggingface = vision + language fused.βοΈ Practical, scalable, deployment-ready. π. Author: Vikram Singh. #BLIP #ImageCaptioning #VLM #AI #HuggingFace #PyImageSearch
0
2
13
βοΈ Qwen Model from @alibaba_cloud decides which model gives better VQA answers β @salesforce BLIP vs @Google PaliGemma.π No humans, just pure AI judgment.π₯ Final result: a polished VQA @huggingface dataset.
0
0
1
New tutorial! π.Synthetic Data Generation Using the VLM-as-Judge Method.π.Author: Piyush Thakur.#VQA #SyntheticData #Qwen #BLIP #PaliGemma #AI #MachineLearning #HuggingFace.
pyimagesearch.com
Use Qwen as a judge to compare BLIP and PaliGemma outputs, creating a high-quality synthetic VQA dataset automatically.
1
0
10
@cosmo3769 Generated 85.7k+ synthetic VQA answers using VLMs π€―. πΌοΈ Models: @salesforce BLIP + @Google PaliGemma.π¦ Format: @huggingface Datasets.π No labeling needed!.
0
1
2
π New tutorial just dropped!.Synthetic Data Generation Using the BLIP and PaliGemma Models.π Read the full tutorial: βοΈ Author: @cosmo3769 .#AI #VisionLanguageModels #SyntheticData #VQA #BLIP #PaliGemma #MachineLearning #HuggingFace #OpenSourceAI
2
3
7
New tutorial! π.Fine-tuning @huggingface SmolVLM with Direct Preference Optimization π.π§ Just pure alignment using human preferences!.Built with LoRA + PEFT + TRL.π Author: @puneet2k.#SmolVLM #DPO #AIAlignment #HuggingFace #RLHF #LoRA #Transformers.
pyimagesearch.com
Fine tune SmolVLM using DPO β maximize human alignment with streamlined, effective preference optimization.
0
0
5
π€ Master Vision-Language Models (VLMs).π§ Chat with documents.πΉ Generate Video Highlight Reels .π Turn sketches into HTML/CSS.
0
0
1
Want your tactical tools seen on national TVβnot just social?.Our tactical shows air on cable, satellite & streaming across the U.S. Make customers see your brand like never before. See how it works.
0
0
1
π "Build Your Own AI Vision Bot" Kickstarter launches Friday!. β‘ Early Bird = 50% off. π Join the pre-launch list now:. #AI #VLM #Kickstarter
1
0
6
πΆοΈ Skip the backend, run object detection fully in your browser!.π§ Powered by @onnxai and @onnxruntime + @ultralytics YOLOv8 magic.π οΈ Built with @nextjs + @tailwindcss, real-time inference with WebAssembly!.
0
1
1
New tutorial! π.Run YOLO in the Browser with ONNX, WebAssembly, and Next.js.πΆοΈ Skip the backend, run object detection fully in your browser!.π.Author: Vikram Singh.#YOLO #ONNX #WebAssembly #ComputerVision #Nextjs #OpenCVjs #BrowserAI #MLinJS.
pyimagesearch.com
Run YOLO object detection models directly in the browser using ONNX, WebAssembly, and Next.js β no server or GPU needed. Fast, private, and interactive.
1
2
2
π³οΈ Build an AI that doesnβt just find potholes β it tells you how bad they are.π€ Fine-tuned with @ultralytics YOLOv12 and powered by @roboflow .π¦ Deployed with @Docker + interactive @Gradio app for both image & video inputs.
0
0
1
New tutorial! π§.Training YOLOv12 for Detecting Pothole Severity Using a Custom Dataset.π.Author: Vikram Singh.#ComputerVision #YOLOv12 #PotholeDetection #Gradio #Docker #Ultralytics #Roboflow #AI4Infrastructure.
pyimagesearch.com
Train YOLOv12 to detect and classify potholes by severity. Run it anywhere using a Gradio interface for image and video input.
1
2
6
π― Learn how to count people in and out of a space using just a video feed and some Python magic.π§ Powered by the attention-equipped YOLOv12 model from @ultralytics .π οΈ Built using OpenCV (@OpenCVUniverse) + a custom Centroid Tracker (lightweight and perfect for real-time apps).
0
0
0
New tutorial! πΆβοΈπΆβοΈπ.People Tracker with YOLOv12 and Centroid Tracker.π.Author: Vikram Singh.#YOLOv12 #ComputerVision #PeopleCounting #Python #OpenCV #Ultralytics #ObjectTracking.
pyimagesearch.com
Learn how to monitor a real-time people tracker with YOLOv12 and Centroid Tracker for efficient entry and exit counting.
1
1
2
π€― Say goodbye to CNNs β YOLOv12 just added attention and didn't break a sweat!.π§ Powered by Area Attention + FlashAttention, this model is fast, smart, and ready for real-time action. βοΈ Built with @ultralytics, optimized for @nvidia A100, and ready to @Gradio your socks off.
0
0
3
π¨ New tutorial drop! π¨.Breaking the CNN Mold: YOLOv12 Brings Attention to Real-Time Object Detection.π.Author: Vikram Singh.#YOLOv12 #ObjectDetection #FlashAttention #Ultralytics #ComputerVision #RealTimeAI #MachineLearning.
pyimagesearch.com
Discover how YOLOv12 breaks free from CNNs by integrating attention for real-time object detection, achieving top accuracy without sacrificing speed.
1
2
8
@cosmo3769 ποΈ Turn long videos into punchy highlight reels β automatically!.π§ Powered by the compact SmolVLM2 model from @huggingface .βοΈ Built with @Gradio, FFmpeg, and a splash of FlashAttention magic π₯.
0
0
0
New tutorial! π¬β¨.Generating Video Highlights Using the SmolVLM2 Model.ποΈ Turn long videos into punchy highlight reelsβautomatically!..Author: @cosmo3769 .#VideoAI #SmolVLM2 #Gradio #VisionLanguageModel #HuggingFace #AItools #MultimodalAI #PyImageSearch
1
0
8