Repository navigation
ade20k
- Website
- Wikipedia
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Pytorch implementation for Semantic Segmentation/Scene Parsing on MIT ADE20K dataset
Official PyTorch implementation of SegFormer
CVNets: A library for training computer vision networks
[CVPR 2023] OneFormer: One Transformer to Rule Universal Image Segmentation
Add bisenetv2. My implementation of BiSeNet
SOTA Semantic Segmentation Models in PyTorch
[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention
[ICML 2023] Official PyTorch implementation of Global Context Vision Transformers
TensorFlow-based implementation of "ICNet for Real-Time Semantic Segmentation on High-Resolution Images".
TensorFlow-based implementation of "Pyramid Scene Parsing Network".
The official repo for [NeurIPS'21] "ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias" and [IJCV'22] "ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond"
[NIVT Workshop @ ICCV 2023] SeMask: Semantically Masked Transformers for Semantic Segmentation
Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"
[ICLR 2024] MogaNet: Efficient Multi-order Gated Aggregation Network
Indoor segmentation for robot navigating, which is based on deeplab model in TensorFlow.
[CVPR'23 & TPAMI'25] Hard Patches Mining for Masked Image Modeling
Semantic segmentation task for ADE20k & cityscapse dataset, based on several models.
Semantic Segmentation using Tensorflow on popular Datasets like Ade20k, Camvid, Coco, PascalVoc