Computer Vision
Computer vision libraries and models for image understanding, generation, OCR, and object detection.
Repositories
ultralytics / yolov5
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
ultralytics / ultralytics
Ultralytics YOLO 🚀
facebookresearch / segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
ageitgey / face_recognition
The world's simplest facial recognition api for Python and the command line
tesseract-ocr / tesseract
Tesseract Open Source OCR Engine (main repository)
PaddlePaddle / PaddleOCR
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
deepfakes / faceswap
Deepfakes Software For All
hacksider / Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
opencv / opencv
Open Source Computer Vision Library
CompVis / stable-diffusion
A latent text-to-image diffusion model
AUTOMATIC1111 / stable-diffusion-webui
Stable Diffusion web UI