Repo symbol

learnopencv repository

opencv machine-learning deep-neural-networks ai computer-vision deep-learning deeplearning opencv-library opencv-python computervision opencv3 opencv-tutorial opencv-cpp
Repo symbol

learnopencv repository

opencv machine-learning deep-neural-networks ai computer-vision deep-learning deeplearning opencv-library opencv-python computervision opencv3 opencv-tutorial opencv-cpp
Repo symbol

learnopencv repository

opencv machine-learning deep-neural-networks ai computer-vision deep-learning deeplearning opencv-library opencv-python computervision opencv3 opencv-tutorial opencv-cpp
Repo symbol

learnopencv repository

opencv machine-learning deep-neural-networks ai computer-vision deep-learning deeplearning opencv-library opencv-python computervision opencv3 opencv-tutorial opencv-cpp
Repo symbol

learnopencv repository

opencv machine-learning deep-neural-networks ai computer-vision deep-learning deeplearning opencv-library opencv-python computervision opencv3 opencv-tutorial opencv-cpp carla_msgs pcl_conversions pcl_ros perception_pcl vehicle_ctrl slam lego_loam_sr cloud_msgs

Repository Summary

Description Learn OpenCV : C++ and Python Examples
Checkout URI https://github.com/spmallick/learnopencv.git
VCS Type git
VCS Version master
Last Updated 2025-07-30
Dev Status UNKNOWN
Released UNRELEASED
Tags opencv machine-learning deep-neural-networks ai computer-vision deep-learning deeplearning opencv-library opencv-python computervision opencv3 opencv-tutorial opencv-cpp
Contributing Help Wanted (-)
Good First Issues (-)
Pull Requests to Review (-)

Packages

Name Version
carla_msgs 1.3.0
pcl_conversions 2.6.1
pcl_ros 2.6.1
perception_pcl 2.6.1
vehicle_ctrl 0.0.0
slam 0.0.0
lego_loam_sr 1.0.0
cloud_msgs 1.0.0

README

LearnOpenCV

This repository contains code for Computer Vision, Deep learning, and AI research articles shared on our blog LearnOpenCV.com.

Want to become an expert in AI? AI Courses by OpenCV is a great place to start.

</a>

List of Blog Posts

Blog Post Code
LangGraph: Building Self-Correcting RAG Agent for Code Generation Code
Inside Sinusoidal Position Embeddings: A Sense of Order Code
Inside RoPE: Rotary Magic into Position Embeddings Code
SimLingo-Vision-Language-Action-Model-for-Autonomous-Driving Code
FineTuning Gemma 3n for Medical VQA on ROCOv2 Code
SmolLM3 Blueprint: SOTA 3B-Parameter LLM  
LangGraph-A-Visual-Automation-and-Summarization-Pipeline Code
Fine-Tuning AnomalyCLIP: Class-Agnostic Zero-Shot Anomaly Detection Code
SigLIP 2: DeepMind’s Multilingual Vision-Language Model  
MedGemma: Google’s Medico VLM for Clinical QA, Imaging, and More Code
Nanonets-OCR-s: Enabling Rich, Structured Markdown for Document Understanding  
Optimizing VJEPA-2: Tackling Latency & Context in Real-Time Video Classification Scripts Code
V-JEPA 2: Meta’s Breakthrough in AI for the Physical World Code
NVIDIA Cosmos Reason1: Video Understanding Code
GR00T N1.5 Explained  
LLaVA Code
SmolVLA: Affordable & Efficient VLA Robotics on Consumer GPUs Code
Fine-Tuning Grounding DINO: Open-Vocabulary Object Detection Code
Getting Started with Qwen3 – The Thinking Expert Code
Inside the GPU: A Comprehensive Guide to Modern Graphics Architecture  
Distributed Parallel Training: PyTorch Code
MONAI: The Definitive Framework for Medical Imaging Powered by PyTorch  
SANA-Sprint: The One-Step Revolution in High-Quality AI Image Synthesis  
FramePack-Video-Diffusion-but-feels-like-Image-Diffusion Code
Model Weights File Formats in Machine Learning  
Unsloth: A Guide from Basics to Fine-Tuning Vision Models Code
Iterative Closest Point (ICP) Algorithm Explained Code
MedSAM2 Explained: One Prompt to Segment Anything in Medical Imaging Code
Batch Normalization and Dropout as Regularizers  
DINOv2_by_Meta_A_Self-Supervised_foundational_vision_model Code
Beginner’s Guide to Embedding Models  
MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors Code
Google’s A2A Protocol  
Nvidia SANA : Faster Image Generation  
Fine-tuning RF-DETR Code
Qwen2.5-Omni: A Real-Time Multimodal AI  
Vision Language Action Models: Robotic Control Code
Fine-Tuning Gemma 3 VLM using QLoRA for LaTeX-OCR Dataset Code
ComfyUI Code
Gemma-3: A Comprehensive Introduction  
YOLO11 on Raspberry Pi: Optimizing Object Detection for Edge Devices Code
VGGT: Visual Geometry Grounded Transformer – For Dense 3D Reconstruction Code
DDIM: The Faster, Improved Version of DDPM for Efficient AI Image Generation Code
Introduction to Model Context Protocol (MCP)  
MASt3R and MASt3R-SfM Explanation: Image Matching and 3D Reconstruction Code
MatAnyone Explained: Consistent Memory for Better Video Matting Code
GraphRAG: For Medical Document Analysis Code
OmniParser: Vision Based GUI Agent  
Fine-Tuning-YOLOv12-Comparison-With-YOLOv11-And-YOLOv7-Based-Darknet Code
FineTuning RetinaNet for Wildlife Detection with PyTorch: A Step-by-Step Tutorial Code
DUSt3R: Geometric 3D Vision Made Easy : Explanation and Results Code
YOLOv12: Attention Meets Speed Code
Video Generation: A Diffusion based approach Code
Agentic AI: A Comprehensive Introduction Code
Finetuning SAM2 for Leaf Disease Segmentation Code
Object Insertion in Gaussian Splatting: Paper Explained and Training Code for MCMC and Bilateral Grid Code
Depth Pro: Sharp Monocular Metric Depth Code
Fine-tuning-Stable-Diffusion-3_5-UI-images Code
SimSiam: Streamlining SSL with Stop-Gradient Mechanism Code
Image Captioning using ResNet and LSTM Code
Molmo VLM: Paper Explanation and Demo Code
3D Gaussian Splatting Paper Explanation: Training Custom Datasets with NeRF-Studio Gsplats Code
FLUX Image Generation: Experimenting with the Parameters Code
Contrastive-Learning-SimCLR-and-BYOL(With Code Example) Code
The Annotated NeRF : Training on Custom Dataset from Scratch in Pytorch Code
Stable Diffusion 3 and 3.5: Paper Explanation and Inference Code
LightRAG - Legal Document Analysis Code
NVIDIA AI Summit 2024 – India Overview  
Introduction to Speech to Speech: Most Efficient Form of NLP Code
Training 3D U-Net for Brain Tumor Segmentation (BraTS-GLI) Code
DETR: Overview and Inference Code
YOLO11: Faster Than You Can Imagine! Code
Exploring DINO: Self-Supervised Transformers for Road Segmentation with ResNet50 and U-Net Code
Sapiens: Foundation for Human Vision Models by Meta Code
Multimodal RAG with ColPali and Gemini Code
Building Autonomous Vehicle in Carla: Path Following with PID Control & ROS 2 Code
Handwritten Text Recognition using OCR Code
Training CLIP from Sratch for Image Retrieval Code
Introduction to LiDAR SLAM: LOAM and LeGO-LOAM Paper and Code Explanation with ROS 2 Implementation Code
Recommendation System using Vector Search Code
Fine Tuning Whisper on Custom Dataset Code
SAM 2 – Promptable Segmentation for Images and Videos Code
Introduction to Feature Matching Using Neural Networks Code

File truncated at 100 lines see the full file

Repo symbol

learnopencv repository

opencv machine-learning deep-neural-networks ai computer-vision deep-learning deeplearning opencv-library opencv-python computervision opencv3 opencv-tutorial opencv-cpp
Repo symbol

learnopencv repository

opencv machine-learning deep-neural-networks ai computer-vision deep-learning deeplearning opencv-library opencv-python computervision opencv3 opencv-tutorial opencv-cpp
Repo symbol

learnopencv repository

opencv machine-learning deep-neural-networks ai computer-vision deep-learning deeplearning opencv-library opencv-python computervision opencv3 opencv-tutorial opencv-cpp
Repo symbol

learnopencv repository

opencv machine-learning deep-neural-networks ai computer-vision deep-learning deeplearning opencv-library opencv-python computervision opencv3 opencv-tutorial opencv-cpp