LearnOpenCV

https://github.com/spmallick/learnopencv?tab=readme-ov-file
This repository contains code for Computer Vision, Deep learning, and AI research articles shared on our blog LearnOpenCV.com.
Contains more than 400 projects!
Such as:
Pasted image 20250312223612.png|300
The full list of the topics that are presented at this moment:
OmniParser: Vision Based GUI Agent
Fine-Tuning-YOLOv12-Comparison-With-YOLOv11-And-YOLOv7-Based-Darknet
FineTuning RetinaNet for Wildlife Detection with PyTorch: A Step-by-Step Tutorial
DUSt3R: Geometric 3D Vision Made Easy :  Explanation and Results
YOLOv12: Attention Meets Speed
Video Generation: A Diffusion based approach
Agentic AI: A Comprehensive Introduction
Finetuning SAM2 for Leaf Disease Segmentation
Object Insertion in Gaussian Splatting: Paper Explained and Training Code for MCMC and Bilateral Grid
Depth Pro: Sharp Monocular Metric Depth
Fine-tuning-Stable-Diffusion-3_5-UI-images
SimSiam: Streamlining SSL with Stop-Gradient Mechanism
Image Captioning using ResNet and LSTM
Molmo VLM: Paper Explanation and Demo
3D Gaussian Splatting Paper Explanation: Training Custom Datasets with NeRF-Studio Gsplats
FLUX Image Generation: Experimenting with the Parameters
Contrastive-Learning-SimCLR-and-BYOL(With Code Example)
The Annotated NeRF : Training on Custom Dataset from Scratch in Pytorch
Stable Diffusion 3 and 3.5: Paper Explanation and Inference
LightRAG - Legal Document Analysis
NVIDIA AI Summit 2024 – India Overview
Introduction to Speech to Speech: Most Efficient Form of NLP
Training 3D U-Net for Brain Tumor Segmentation (BraTS-GLI)
DETR: Overview and Inference
YOLO11: Faster Than You Can Imagine!
Exploring DINO: Self-Supervised Transformers for Road Segmentation with ResNet50 and U-Net
Sapiens: Foundation for Human Vision Models by Meta
Multimodal RAG with ColPali and Gemini
Building Autonomous Vehicle in Carla: Path Following with PID Control & ROS 2
Handwritten Text Recognition using OCR
Training CLIP from Sratch for Image Retrieval
Introduction to LiDAR SLAM: LOAM and LeGO-LOAM Paper and Code Explanation with ROS 2 Implementation
Recommendation System using Vector Search
Fine Tuning Whisper on Custom Dataset
SAM 2 – Promptable Segmentation for Images and Videos
Introduction to Feature Matching Using Neural Networks
Introduction to ROS2 (Robot Operating System 2): Tutorial on ROS2 Working, DDS, ROS1 RMW, Topics, Nodes, Publisher, Subscriber in Python
CVPR 2024 Research Papers - Part- 2
CVPR 2024: An Overview and Key Papers
Object Detection on Edge Device - OAK-D-Lite
Fine-Tuning YOLOv10 Models on Custom Dataset
ROS2 and Carla Setup Guide for Ubuntu 22.04
Understanding Visual SLAM for Robotics Perception: Building Monocular SLAM from Scratch in Python
Enhancing Image Segmentation using U2-Net: An Approach to Efficient Background Removal
YOLOv10: The Dual-Head OG of YOLO Series
Fine-tuning Faster R-CNN on Sea Rescue Dataset
Mastering Recommendation System: A Complete Guide
Automatic Speech Recognition with Diarization : Speech-to-Text
Building MobileViT Image Classification Model from Scratch In Keras 3
SDXL Inpainting: Fusing Image Inpainting with Stable Diffusion
YOLOv9 Instance Segmentation on Medical Dataset
A Comprehensive Guide to Robotics
Integrating Gradio with OpenCV DNN
Fine-Tuning YOLOv9 on Custom Dataset
Dreambooth using Diffusers
Introduction to Hugging Face Diffusers
Introduction to Ultralytics Explorer API
YOLOv9: Advancing the YOLO Legacy
Fine-Tuning LLMs using PEFT
Depth Anything: Accelerating Monocular Depth Perception
Deciphering LLMs: From Transformers to Quantization
YOLO Loss Function Part 2: GFL and VFL Loss
YOLOv8-Object-Tracking-and-Counting-with-OpenCV
Stereo Vision in ADAS: Pioneering Depth Perception Beyond LiDAR
YOLO Loss Function Part 1: SIoU and Focal Loss
Moving Object Detection with OpenCV
Integrating ADAS with Keypoint Feature Pyramid Network for 3D LiDAR Object Detection
Mastering All YOLO Models from YOLOv1 to YOLO-NAS: Papers Explained (2024)
GradCAM: Enhancing Neural Network Interpretability in the Realm of Explainable AI
Text Summarization using T5: Fine-Tuning and Building Gradio App
3D LiDAR Visualization using Open3D: A Case Study on 2D KITTI Depth Frames for Autonomous Driving
Fine Tuning T5: Text2Text Transfer Transformer for Building a Stack Overflow Tag Generator
SegFormer 🤗 : Fine-Tuning for Improved Lane Detection in Autonomous Vehicles
Fine-Tuning BERT using Hugging Face Transformers
YOLO-NAS Pose
BERT: Bidirectional Encoder Representations from Transformers
Comparing KerasCV YOLOv8 Models on the Global Wheat Data 2020
Top 5 AI papers of September 2023
Empowering Drivers: The Rise and Role of Advanced Driver Assistance Systems
Semantic Segmentation using KerasCV DeepLabv3+
Object Detection using KerasCV YOLOv8
Fine-tuning YOLOv8 Pose Models for Animal Pose Estimation
Top 5 AI papers of August 2023
Fine Tuning TrOCR - Training TrOCR to Recognize Curved Text
TrOCR - Getting Started with Transformer Based OCR
Facial Emotion Recognition
Object Keypoint Similarity in Keypoint Detection
Real Time Deep SORT with Torchvision Detectors
Top 5 AI papers of July 2023
Medical Image Segmentation
Weighted Boxes Fusion in Object Detection: A Comparison with Non-Maximum Suppression
Medical Multi-label Classification with PyTorch & Lightning
Getting Started with PaddlePaddle: Exploring Object Detection, Segmentation, and Keypoints
Drone Programming With Computer Vision A Beginners Guide
How to Build a Pip Installable Package & Upload to PyPi
IoU Loss Functions for Faster & More Accurate Object Detection
Exploring Slicing Aided Hyper Inference for Small Object Detection
Advancements in Face Recognition Models, Toolkit and Datasets
Train YOLO NAS on Custom Dataset
Train YOLOv8 Instance Segmentation on Custom Data
YOLO-NAS: New Object Detection Model Beats YOLOv6 & YOLOv8
Segment Anything – A Foundation Model for Image Segmentation
Build a Video to Slides Converter Application using the Power of Background Estimation and Frame Differencing in OpenCV
A Closer Look at CVAT: Perfecting Your Annotations
ControlNet - Achieving Superior Image Generation Results
InstructPix2Pix - Edit Images With Prompts
NVIDIA Spring GTC 2023 Day 4: Ending on a High Note with Top Moments from the Finale!
NVIDIA Spring GTC 2023 Day 3: Digging deeper into Deep Learning, Semiconductors & more!
NVIDIA Spring GTC 2023 Day 2: Jensen’s keynote & the iPhone moment of AI is here!
NVIDIA Spring GTC 2023 Day 1: Welcome to the future!
NVIDIA GTC Spring 2023 Curtain Raiser
Stable Diffusion - A New Paradigm in Generative AI
OpenCV Face Recognition – Does Face Recognition Work on AI-Generated Images?
An In-Depth Guide to Denoising Diffusion Probabilistic Models – From Theory to Implementation
From Pixels to Paintings: The Rise of Midjourney AI Art
Mastering DALL·E 2: A Breakthrough in AI Art Generation
Top 10 AI Art Generation Tools using Diffusion Models
The Future of Image Recognition is Here: PyTorch Vision Transformer
Understanding Attention Mechanism in Transformer Neural Networks
Deploying a Deep Learning Model using Hugging Face Spaces and Gradio
Train YOLOv8 on Custom Dataset – A Complete Tutorial
Introduction to Diffusion Models for Image Generation
Building An Automated Image Annotation Tool: PyOpenAnnotate
Ultralytics YOLOv8: State-of-the-Art YOLO Models
Getting Started with YOLOv5 Instance Segmentation
The Ultimate Guide To DeepLabv3 - With PyTorch Inference
AI Fitness Trainer using MediaPipe: Squats Analysis
YoloR - Paper Explanation & Inference -An In-Depth Analysis
Roadmap To an Automated Image Annotation Tool Using Python
Performance Comparison of YOLO Object Detection Models – An Intensive Study
FCOS - Anchor Free Object Detection Explained
YOLOv6 Custom Dataset Training – Underwater Trash Detection
What is EXIF Data in Images?
t-SNE: T-Distributed Stochastic Neighbor Embedding Explained
CenterNet: Objects as Points – Anchor-free Object Detection Explained
YOLOv7 Pose vs MediaPipe in Human Pose Estimation
YOLOv6 Object Detection – Paper Explanation and Inference
YOLOX Object Detector Paper Explanation and Custom Training
Driver Drowsiness Detection Using Mediapipe In Python
GTC 2022 Big Bang AI announcements: Everything you need to know
NVIDIA GTC 2022 : The most important AI event this Fall
Object Tracking and Reidentification with FairMOT
What is Face Detection? – The Ultimate Guide for 2022
Document Scanner: Custom Semantic Segmentation using PyTorch-DeepLabV3
Fine Tuning YOLOv7 on Custom Dataset
Center Stage for Zoom Calls using MediaPipe
Mean Average Precision (mAP) in Object Detection
YOLOv7 Object Detection Paper Explanation and Inference
Pothole Detection using YOLOv4 and Darknet
Automatic Document Scanner using OpenCV
Demystifying GPU architectures for deep learning: Part 2
Demystifying GPU Architectures For Deep Learning
Intersection-over-Union(IoU)-in-Object-Detection-and-Segmentation
Understanding Multiple Object Tracking using DeepSORT
Optical Character Recognition using PaddleOCR
Gesture Control in Zoom Call using Mediapipe
A Deep Dive into Tensorflow Model Optimization
DepthAI Pipeline Overview: Creating a Complex Pipeline
TensorFlow Lite Model Maker: Create Models for On-Device Machine Learning
TensorFlow Lite: Model Optimization for On Device Machine Learning
Object detection with depth measurement using pre-trained models with OAK-D
Custom Object Detection Training using YOLOv5
Object Detection using Yolov5 and OpenCV DNN (C++/Python)
Create Snapchat/Instagram filters using Mediapipe
AUTOSAR C++ compliant deep learning inference with TensorRT
NVIDIA GTC 2022 Day 4 Highlights: Meet the new Jetson Orin
NVIDIA GTC 2022 Day 3 Highlights: Deep Dive into Hopper architecture
NVIDIA GTC 2022 Day 2 Highlights: Jensen’s Keynote
NVIDIA GTC 2022 Day 1 Highlights: Brilliant Start
Automatic License Plate Recognition using Python
Building a Poor Body Posture Detection and Alert System using MediaPipe
Introduction to MediaPipe
Disparity Estimation using Deep Learning
How to build Chrome Dino game bot using OpenCV Feature Matching
Top 10 Sources to Find Computer Vision and AI Models
Multi-Attribute and Graph-based Object Detection
Plastic Waste Detection with Deep Learning
Ensemble Deep Learning-based Defect Classification and Detection in SEM Images
Building Industrial embedded deep learning inference pipelines with TensorRT
Transfer Learning for Medical Images
Stereo Vision and Depth Estimation using OpenCV AI Kit
Introduction to OpenCV AI Kit and DepthAI
WeChat QR Code Scanner in OpenCV
AI behind the Diwali 2021 ‘Not just a Cadbury ad’
Model Selection and Benchmarking with Modelplace.AI
Real-time style transfer in a zoom meeting
Introduction to OpenVino Deep Learning Workbench
Running OpenVino Models on Intel Integrated GPU
Post Training Quantization with OpenVino Toolkit
Introduction to Intel OpenVINO Toolkit
Human Action Recognition using Detectron2 and LSTM
Pix2Pix:Image-to-Image Translation in PyTorch & TensorFlow
Conditional GAN (cGAN) in PyTorch and TensorFlow
Deep Convolutional GAN in PyTorch and TensorFlow
Introduction to Generative Adversarial Networks (GANs)
Human Pose Estimation using Keypoint RCNN in PyTorch
Non Maximum Suppression: Theory and Implementation in PyTorch
MRNet – The Multi-Task Approach
Generative and Discriminative Models
Playing Chrome's T-Rex Game with Facial Gestures
Variational Autoencoder in TensorFlow
Autoencoder in TensorFlow 2: Beginner’s Guide
Deep Learning with OpenCV DNN Module: A Definitive Guide
Depth perception using stereo camera (Python/C++)
Contour Detection using OpenCV (Python/C++)
Super Resolution in OpenCV
Improving Illumination in Night Time Images
Video Classification and Human Activity Recognition
How to use OpenCV DNN Module with Nvidia GPU on Windows
How to use OpenCV DNN Module with NVIDIA GPUs
Code OpenCV in Visual Studio
Install OpenCV on Windows – C++ / Python
Face Recognition with ArcFace
Background Subtraction with OpenCV and BGS Libraries
RAFT: Optical Flow estimation using Deep Learning
Making A Low-Cost Stereo Camera Using OpenCV
Optical Flow in OpenCV (C++/Python)
Introduction to Epipolar Geometry and Stereo Vision
Classification With Localization: Convert any keras Classifier to a Detector
Photoshop Filters in OpenCV
Tetris Game using OpenCV Python
Image Classification with OpenCV for Android
Image Classification with OpenCV Java
PyTorch to Tensorflow Model Conversion
Snake Game with OpenCV Python
Stanford MRNet Challenge: Classifying Knee MRIs
Experiment Logging with TensorBoard and wandb
Understanding Lens Distortion
Image Matting with state-of-the-art Method “F, B, Alpha Matting”
Bag Of Tricks For Image Classification - Let's check if it is working or not
Getting Started with OpenCV CUDA Module
Training a Custom Object Detector with DLIB & Making Gesture Controlled Applications
How To Run Inference Using TensorRT C++ API
Using Facial Landmarks for Overlaying Faces with Medical Masks
Tensorboard with PyTorch Lightning
Otsu's Thresholding with OpenCV
PyTorch-to-CoreML-model-conversion
Playing Rock, Paper, Scissors with AI
CNN Receptive Field Computation Using Backprop with TensorFlow
CNN Fully Convolutional Image Classification with TensorFlow
How to convert a model from PyTorch to TensorRT and speed up inference
Efficient image loading
Graph Convolutional Networks: Model Relations In Data
Getting Started with Federated Learning with PyTorch and PySyft
Creating a Virtual Pen & Eraser
Getting Started with PyTorch Lightning
Multi-Label Image Classification with PyTorch: Image Tagging
Funny Mirrors Using OpenCV
t-SNE for ResNet feature visualization
Multi-Label Image Classification with Pytorch
CNN Receptive Field Computation Using Backprop
CNN Receptive Field Computation Using Backprop with TensorFlow
Augmented Reality using AruCo Markers in OpenCV(C++ and Python)
Fully Convolutional Image Classification on Arbitrary Sized Image
Camera Calibration using OpenCV
Geometry of Image Formation
Ensuring Training Reproducibility in Pytorch
Gaze Tracking
Simple Background Estimation in Videos Using OpenCV
Applications of Foreground-Background separation with Semantic Segmentation
EfficientNet: Theory + Code
PyTorch for Beginners: Mask R-CNN Instance Segmentation with PyTorch
PyTorch for Beginners: Faster R-CNN Object Detection with PyTorch
PyTorch for Beginners: Semantic Segmentation using torchvision
PyTorch for Beginners: Comparison of pre-trained models for Image Classification
PyTorch for Beginners: Basics
PyTorch Model Inference using ONNX and Caffe2
Image Classification Using Transfer Learning in PyTorch
Hangman: Creating games in OpenCV
Image Inpainting with OpenCV (C++/Python)
Hough Transform with OpenCV (C++/Python)
Xeus-Cling: Run C++ code in Jupyter Notebook
Gender & Age Classification using OpenCV Deep Learning ( C++/Python )
Invisibility Cloak using Color Detection and Segmentation with OpenCV
Fast Image Downloader for Open Images V4 (Python)
Deep Learning based Text Detection Using OpenCV (C++/Python)
Video Stabilization Using Point Feature Matching in OpenCV
Training YOLOv3 : Deep Learning based Custom Object Detector
Using OpenVINO with OpenCV
Duplicate Search on Quora Dataset
Shape Matching using Hu Moments (C++/Python)
Install OpenCV 4 on CentOS (C++ and Python)
Install OpenCV 3.4.4 on CentOS (C++ and Python)
Install OpenCV 3.4.4 on Red Hat (C++ and Python)
Install OpenCV 4 on Red Hat (C++ and Python)
Install OpenCV 4 on macOS (C++ and Python)
Install OpenCV 3.4.4 on Raspberry Pi
Install OpenCV 3.4.4 on macOS (C++ and Python)
OpenCV QR Code Scanner (C++ and Python)
Install OpenCV 3.4.4 on Windows (C++ and Python)
Install OpenCV 3.4.4 on Ubuntu 16.04 (C++ and Python)
Install OpenCV 3.4.4 on Ubuntu 18.04 (C++ and Python)
Universal Sentence Encoder
Install OpenCV 4 on Raspberry Pi
Install OpenCV 4 on Windows (C++ and Python)
Face Detection – Dlib, OpenCV, and Deep Learning ( C++ / Python )
Hand Keypoint Detection using Deep Learning and OpenCV
Deep learning based Object Detection and Instance Segmentation using Mask R-CNN in OpenCV (Python / C++)
Install OpenCV 4 on Ubuntu 18.04 (C++ and Python)
Install OpenCV 4 on Ubuntu 16.04 (C++ and Python)
Multi-Person Pose Estimation in OpenCV using OpenPose
Heatmap for Logo Detection using OpenCV (Python)
Deep Learning based Object Detection using YOLOv3 with OpenCV ( Python / C++ )
Convex Hull using OpenCV in Python and C++
MultiTracker : Multiple Object Tracking using OpenCV (C++/Python)
Convolutional Neural Network based Image Colorization using OpenCV
SVM using scikit-learn
GOTURN: Deep Learning based Object Tracking
Find the Center of a Blob (Centroid) using OpenCV (C++/Python)
Support Vector Machines (SVM)
Batch Normalization in Deep Networks
Deep Learning based Character Classification using Synthetic Dataset
Image Quality Assessment : BRISQUE
Understanding AlexNet
Deep Learning based Text Recognition (OCR) using Tesseract and OpenCV
Deep Learning based Human Pose Estimation using OpenCV ( C++ / Python )
Number of Parameters and Tensor Sizes in a Convolutional Neural Network (CNN)
How to convert your OpenCV C++ code into a Python module
CV4Faces : Best Project Award 2018
Facemark : Facial Landmark Detection using OpenCV
Image Alignment (Feature Based) using OpenCV (C++/Python)
Barcode and QR code Scanner using ZBar and OpenCV
Keras Tutorial : Fine-tuning using pre-trained models
OpenCV Transparent API
Face Reconstruction using EigenFaces (C++/Python)
Eigenface using OpenCV (C++/Python)
Principal Component Analysis
Keras Tutorial : Transfer Learning using pre-trained models
Keras Tutorial : Using pre-trained Imagenet models
Technical Aspects of a Digital SLR
Using Harry Potter interactive wand with OpenCV to create magic
Install OpenCV 3 and Dlib on Windows ( Python only )
Image Classification using Convolutional Neural Networks in Keras
Understanding Autoencoders using Tensorflow (Python)
Best Project Award : Computer Vision for Faces
Understanding Activation Functions in Deep Learning
Image Classification using Feedforward Neural Network in Keras
Exposure Fusion using OpenCV (C++/Python)
Understanding Feedforward Neural Networks
High Dynamic Range (HDR) Imaging using OpenCV (C++/Python)
Deep learning using Keras – The Basics
Selective Search for Object Detection (C++ / Python)
Installing Deep Learning Frameworks on Ubuntu with CUDA support
Parallel Pixel Access in OpenCV using forEach
cvui: A GUI lib built on top of OpenCV drawing primitives
Install Dlib on Windows
Install Dlib on Ubuntu
Install OpenCV3 on Ubuntu
Read, Write and Display a video using OpenCV ( C++/ Python )
Install Dlib on MacOS
Install OpenCV 3 on MacOS
Install OpenCV 3 on Windows
Get OpenCV Build Information ( getBuildInformation )
Color spaces in OpenCV (C++ / Python)
Neural Networks : A 30,000 Feet View for Beginners
Alpha Blending using OpenCV (C++ / Python)
User stories : How readers of this blog are applying their knowledge to build applications
How to select a bounding box ( ROI ) in OpenCV (C++/Python) ?
Automatic Red Eye Remover using OpenCV (C++ / Python)
Bias-Variance Tradeoff in Machine Learning
Embedded Computer Vision: Which device should you choose?
Object Tracking using OpenCV (C++/Python)
Handwritten Digits Classification : An OpenCV ( C++ / Python ) Tutorial
Training a better Haar and LBP cascade based Eye Detector using OpenCV
Deep Learning Book Gift Recipients
Minified OpenCV Haar and LBP Cascades
Deep Learning Book Gift
Histogram of Oriented Gradients
Image Recognition and Object Detection : Part 1
Head Pose Estimation using OpenCV and Dlib
Live CV : A Computer Vision Coding Application
Approximate Focal Length for Webcams and Cell Phone Cameras
Configuring Qt for OpenCV on OSX
Rotation Matrix To Euler Angles
Speeding up Dlib’s Facial Landmark Detector
Warp one triangle to another using OpenCV ( C++ / Python )
Average Face : OpenCV ( C++ / Python ) Tutorial
Face Swap using OpenCV ( C++ / Python )
Face Morph Using OpenCV — C++ / Python
Deep Learning Example using NVIDIA DIGITS 3 on EC2
NVIDIA DIGITS 3 on EC2
Homography Examples using OpenCV ( Python / C ++ )
Filling holes in an image using OpenCV ( Python / C++ )
How to find frame rate or frames per second (fps) in OpenCV ( Python / C++ ) ?
Delaunay Triangulation and Voronoi Diagram using OpenCV ( C++ / Python)
OpenCV (C++ vs Python) vs MATLAB for Computer Vision
Facial Landmark Detection
Why does OpenCV use BGR color format ?
Computer Vision for Predicting Facial Attractiveness
applyColorMap for pseudocoloring in OpenCV ( C++ / Python )
Image Alignment (ECC) in OpenCV ( C++ / Python )
How to find OpenCV version in Python and C++ ?
Baidu banned from ILSVRC 2015
OpenCV Transparent API
How Computer Vision Solved the Greatest Soccer Mystery of All Time
Embedded Vision Summit 2015
Read an Image in OpenCV ( Python, C++ )
Non-Photorealistic Rendering using OpenCV ( Python, C++ )
Seamless Cloning using OpenCV ( Python , C++ )
OpenCV Threshold ( Python , C++ )
Blob Detection Using OpenCV ( Python, C++ )
Turn your OpenCV Code into a Web API in under 10 minutes — Part 1
How to compile OpenCV sample Code ?
Install OpenCV 3 on Yosemite ( OSX 10.10.x )
P.S. I got that list via AI assistant in Google Chrome dev console instead of using jQuery!
Pasted image 20250312224945.png|1000
#courses #repositories #lectures #code #learn #github #openCV #deepLearning #AI