learnopencv repository

opencv machine-learning deep-neural-networks ai computer-vision deep-learning deeplearning opencv-library opencv-python computervision opencv3 opencv-tutorial opencv-cpp

No version for distro humble. Known supported distros are highlighted in the buttons above.

learnopencv repository

opencv machine-learning deep-neural-networks ai computer-vision deep-learning deeplearning opencv-library opencv-python computervision opencv3 opencv-tutorial opencv-cpp

No version for distro jazzy. Known supported distros are highlighted in the buttons above.

learnopencv repository

opencv machine-learning deep-neural-networks ai computer-vision deep-learning deeplearning opencv-library opencv-python computervision opencv3 opencv-tutorial opencv-cpp

No version for distro kilted. Known supported distros are highlighted in the buttons above.

learnopencv repository

opencv machine-learning deep-neural-networks ai computer-vision deep-learning deeplearning opencv-library opencv-python computervision opencv3 opencv-tutorial opencv-cpp

No version for distro rolling. Known supported distros are highlighted in the buttons above.

opencv machine-learning deep-neural-networks ai computer-vision deep-learning deeplearning opencv-library opencv-python computervision opencv3 opencv-tutorial opencv-cpp carla_msgs pcl_conversions pcl_ros perception_pcl vehicle_ctrl slam lego_loam_sr cloud_msgs

Repository Summary

Description	Learn OpenCV : C++ and Python Examples
Checkout URI	https://github.com/spmallick/learnopencv.git
VCS Type	git
VCS Version	master
Last Updated	2025-12-09
Dev Status	UNKNOWN
Released	UNRELEASED
Tags	opencv machine-learning deep-neural-networks ai computer-vision deep-learning deeplearning opencv-library opencv-python computervision opencv3 opencv-tutorial opencv-cpp
Contributing	Help Wanted (-) Good First Issues (-) Pull Requests to Review (-)

Packages

Name	Version
carla_msgs	1.3.0
pcl_conversions	2.6.1
pcl_ros	2.6.1
perception_pcl	2.6.1
vehicle_ctrl	0.0.0
slam	0.0.0
lego_loam_sr	1.0.0
cloud_msgs	1.0.0

README

LearnOpenCV

This repository contains code for Computer Vision, Deep learning, and AI research articles shared on our blog LearnOpenCV.com.

Want to become an expert in AI? AI Courses by OpenCV is a great place to start.

</a>

List of Blog Posts

Blog Post	Code
SAM 3D: Foundation Model for Single-Image 3D Reconstruction
SAM-3: What’s New, How It Works, and Why It Matters	Code
Image-GS: Adaptive Image Reconstruction using 2D Gaussians	Code
Ultimate Guide to Vector Databases and RAG Pipeline	Code
What Makes DeepSeek OCR So Powerful	Code
2D Gaussian Splatting: Geometrically Accurate Radiance Field Reconstruction	Code
TRM: Tiny Recursive Models	Code
Deploying ML Models on Arduino: From Blink to Think	Code
VideoRAG: Redefining Long-Context Video Comprehension
AI Agent in Action: Automating Desktop Tasks with VLMs	Code
Top VLM Evaluation Metrics for Optimal Performance Analysis	Code
Getting Started with VLM on Jetson Nano	Code
VLM on Edge: Worth the Hype or Just a Novelty?	Code
AnomalyCLIP : Harnessing CLIP for Weakly-Supervised Video Anomaly Recognition	Code
AI_for_Video_Understanding_From_Content_Moderation_to_Summarization	Code
Video-RAG: Training-Free Retrieval for Long-Video LVLMs	Code
Object Detection and Spatial Understanding with VLMs ft. Qwen2.5-VL	Code
LangGraph: Building Self-Correcting RAG Agent for Code Generation	Code
Inside Sinusoidal Position Embeddings: A Sense of Order	Code
Inside RoPE: Rotary Magic into Position Embeddings	Code
SimLingo-Vision-Language-Action-Model-for-Autonomous-Driving	Code
FineTuning Gemma 3n for Medical VQA on ROCOv2	Code
SmolLM3 Blueprint: SOTA 3B-Parameter LLM
LangGraph-A-Visual-Automation-and-Summarization-Pipeline	Code
Fine-Tuning AnomalyCLIP: Class-Agnostic Zero-Shot Anomaly Detection	Code
SigLIP 2: DeepMind’s Multilingual Vision-Language Model
MedGemma: Google’s Medico VLM for Clinical QA, Imaging, and More	Code
Nanonets-OCR-s: Enabling Rich, Structured Markdown for Document Understanding
Optimizing VJEPA-2: Tackling Latency & Context in Real-Time Video Classification Scripts	Code
V-JEPA 2: Meta’s Breakthrough in AI for the Physical World	Code
NVIDIA Cosmos Reason1: Video Understanding	Code
GR00T N1.5 Explained
LLaVA	Code
SmolVLA: Affordable & Efficient VLA Robotics on Consumer GPUs	Code
Fine-Tuning Grounding DINO: Open-Vocabulary Object Detection	Code
Getting Started with Qwen3 – The Thinking Expert	Code
Inside the GPU: A Comprehensive Guide to Modern Graphics Architecture
Distributed Parallel Training: PyTorch	Code
MONAI: The Definitive Framework for Medical Imaging Powered by PyTorch
SANA-Sprint: The One-Step Revolution in High-Quality AI Image Synthesis
FramePack-Video-Diffusion-but-feels-like-Image-Diffusion	Code
Model Weights File Formats in Machine Learning
Unsloth: A Guide from Basics to Fine-Tuning Vision Models	Code
Iterative Closest Point (ICP) Algorithm Explained	Code
MedSAM2 Explained: One Prompt to Segment Anything in Medical Imaging	Code
Batch Normalization and Dropout as Regularizers
DINOv2_by_Meta_A_Self-Supervised_foundational_vision_model	Code
Beginner’s Guide to Embedding Models
MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors	Code
Google’s A2A Protocol
Nvidia SANA : Faster Image Generation
Fine-tuning RF-DETR	Code
Qwen2.5-Omni: A Real-Time Multimodal AI
Vision Language Action Models: Robotic Control	Code
Fine-Tuning Gemma 3 VLM using QLoRA for LaTeX-OCR Dataset	Code
ComfyUI	Code
Gemma-3: A Comprehensive Introduction
YOLO11 on Raspberry Pi: Optimizing Object Detection for Edge Devices	Code
VGGT: Visual Geometry Grounded Transformer – For Dense 3D Reconstruction	Code
DDIM: The Faster, Improved Version of DDPM for Efficient AI Image Generation	Code
Introduction to Model Context Protocol (MCP)
MASt3R and MASt3R-SfM Explanation: Image Matching and 3D Reconstruction	Code
MatAnyone Explained: Consistent Memory for Better Video Matting	Code
GraphRAG: For Medical Document Analysis	Code
OmniParser: Vision Based GUI Agent
Fine-Tuning-YOLOv12-Comparison-With-YOLOv11-And-YOLOv7-Based-Darknet	Code
FineTuning RetinaNet for Wildlife Detection with PyTorch: A Step-by-Step Tutorial	Code
DUSt3R: Geometric 3D Vision Made Easy : Explanation and Results	Code
YOLOv12: Attention Meets Speed	Code
Video Generation: A Diffusion based approach	Code
Agentic AI: A Comprehensive Introduction	Code
Finetuning SAM2 for Leaf Disease Segmentation	Code
Object Insertion in Gaussian Splatting: Paper Explained and Training Code for MCMC and Bilateral Grid	Code
Depth Pro: Sharp Monocular Metric Depth	Code
Fine-tuning-Stable-Diffusion-3_5-UI-images	Code
SimSiam: Streamlining SSL with Stop-Gradient Mechanism	Code
Image Captioning using ResNet and LSTM	Code
Molmo VLM: Paper Explanation and Demo	Code
3D Gaussian Splatting Paper Explanation: Training Custom Datasets with NeRF-Studio Gsplats	Code
FLUX Image Generation: Experimenting with the Parameters	Code
Contrastive-Learning-SimCLR-and-BYOL(With Code Example)	Code
The Annotated NeRF : Training on Custom Dataset from Scratch in Pytorch	Code
Stable Diffusion 3 and 3.5: Paper Explanation and Inference	Code

File truncated at 100 lines see the full file

CONTRIBUTING

learnopencv repository

opencv machine-learning deep-neural-networks ai computer-vision deep-learning deeplearning opencv-library opencv-python computervision opencv3 opencv-tutorial opencv-cpp

No version for distro galactic. Known supported distros are highlighted in the buttons above.

learnopencv repository

opencv machine-learning deep-neural-networks ai computer-vision deep-learning deeplearning opencv-library opencv-python computervision opencv3 opencv-tutorial opencv-cpp

No version for distro iron. Known supported distros are highlighted in the buttons above.

learnopencv repository

opencv machine-learning deep-neural-networks ai computer-vision deep-learning deeplearning opencv-library opencv-python computervision opencv3 opencv-tutorial opencv-cpp

No version for distro melodic. Known supported distros are highlighted in the buttons above.

learnopencv repository

opencv machine-learning deep-neural-networks ai computer-vision deep-learning deeplearning opencv-library opencv-python computervision opencv3 opencv-tutorial opencv-cpp

No version for distro noetic. Known supported distros are highlighted in the buttons above.