Repository navigation

#

mscoco

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python
14659
9 个月前

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

Jupyter Notebook
1446
2 年前
Python
531
4 年前

This repository contains the source code of our work on designing efficient CNNs for computer vision

Python
412
9 个月前

VarifocalNet: An IoU-aware Dense Object Detector

Python
353
4 年前

The official repo for [NeurIPS'21] "ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias" and [IJCV'22] "ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond"

Python
274
2 年前

Video Platform for Action Recognition and Object Detection in Pytorch

Python
221
3 年前

Semantic Propositional Image Caption Evaluation

Java
140
2 年前

High-resolution Networks for the Fully Convolutional One-Stage Object Detection (FCOS) algorithm

Python
125
5 年前

generate captions for images using a CNN-RNN model that is trained on the Microsoft Common Objects in COntext (MS COCO) dataset

Python
78
7 年前

A tensorflow implement mobilenetv3 centernet, which can be easily deployeed on android(MNN) and ios(CoreML).

Python
70
4 年前

A tool for converting computer vision label formats.

Python
62
3 天前

A repository and interchange format for weed identification annotation

Python
62
1 年前

Adds SPICE metric to coco-caption evaluation server codes

Jupyter Notebook
49
2 年前

Implementation of models in our EMNLP 2019 paper: A Logic-Driven Framework for Consistency of Neural Models

Python
30
4 年前