Repository navigation

imagecaptioning

Website
Wikipedia

Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.

multimodal multimodal-learning Python paddlepaddle PyTorch crossmodal-retrieval imagecaptioning classification

Python

476

2 年前

guillaumegenthial / im2latex

Image to LaTeX (Seq2seq + Attention with Beam Search) - Tensorflow

seq2seq imagecaptioning beam-search Tensorflow im2latex

Python

462

128

5 年前

luopeixiang / im2latex

Pytorch implemention of Deep CNN Encoder + LSTM Decoder with Attention for Image to Latex

im2latex seq2seq imagecaptioning PyTorch encoder-decoder-model

Python

197

2 年前

HughKu / Im2txt

Image captioning ready-to-go inference: show and tell model compatible with Tensorflow r1.9

Tensorflow imagecaptioning Python

Python

3 年前

Dong-JinKim / DenseRelationalCaptioning

Code of Dense Relational Captioning

机器视觉 imagecaptioning torch dataset cvpr2019

Lua

2 年前

qingzwang / GHA-ImageCaptioning

Code for GHA (ACCV2018)

imagecaptioning attention-mechanism

Python

7 年前

MrAnayDongre / Machine-Learning-Collection

Repo for Implementing Research Papers & Projects related to Machine Learning

深度神经网络 Generative Adversarial Network object-detection yolo pytorch-implementation transformer imagecaptioning cnn-classification diffusion-models 机器学习 time-series-analysis ChatGPT pytorch-lightning ollama rag

Python

6 个月前

Aryavir07 / Image-Captioning-Using-CNN-and-LSTM

Generating Captions for images using CNN & LSTM on Flickr8K dataset.The generation of captions from images has various practical benefits, ranging from aiding the visually impaired.

imagecaptioning API cnn lstm 机器学习深度学习人工智能

Jupyter Notebook

4 年前

Mountchicken / ImageCaptioning-Attention-PyQt5

ImageCaptioning improved with an attention mechanism. Also a PyQt5 application

attention pyqt5 imagecaptioning PyTorch

Python

4 年前

MingtaoGuo / RNN-TensorFlow

Some interesting applications of RNN, e.g. char rnn (pomes generation), seq2seq (machine translation), image captioning (NIC)

seq2seq imagecaptioning Tensorflow

Python

7 年前

surajsoni5 / Context-Aware-Image-Captioning

An implementation of the paper "Context-aware Captions from Context-agnostic Supervision"

机器视觉自然语言处理 imagecaptioning Project PyTorch encoder-decoder-model attention-mechanism

Python

6 年前

AmritK10 / Image-Captioner-Web-App

A dockerised web-app to generate captions for uploaded images.

imagecaptioning lstm Web app image-captioning HTML Node.js Bootstrap Docker Docker Compose Flask MongoDB 机器学习 captions

HTML

2 年前

harishB97 / Im2Latex-TensorFlow-2

TensorFlow-2 implementation of Im2Latex deep learning model described in HarvardNLP paper "What You Get Is What You See: A Visual Markup Decompiler"

convolutional-neural-networks 深度学习 encoder-decoder-model imagecaptioning recurrent-neural-networks Tensorflow tensorflow2

Jupyter Notebook

3 年前

aartighatkesar / Deep-Learning-Fundamentals

Implementation of various basic layers forward and back propagation. CS 231n Stanford Spring 2018: Convolutional Neural Networks for Visual Recognition. Solutions to Assignments

深度学习 convolutional-neural-networks backpropagation recurrent-neural-networks imagecaptioning style-transfer Tensorflow Generative Adversarial Network transferlearning

Jupyter Notebook

3 年前

J0SAL / Aide

An App with Voice Assisted Image Captioning and VQA For Visually Challenged Individuals

captioning Flutter imagecaptioning

Dart

3 年前

Prajwal10031999 / Scene-Prediction-using-CNN-with-AlexNet

A CNN model to predict the scene or location from any given image

深度学习 alexnet Keras Tensorflow imagecaptioning 神经网络 keras-tensorflow Python

Jupyter Notebook

5 年前

Islam-hady9 / Generative-AI-Models

Generative AI Models is a comprehensive repository dedicated to the implementation of cutting-edge generative AI models using Python. It features various models, including those for image captioning and text-to-image generation, leveraging advanced architectures like Vision Transformers (ViT), GPT-2, and Stable Diffusion.

机器视觉深度学习 generativeai gpt-2 huggingface-transformers imagecaptioning 自然语言处理 PyTorch stablediffusion text-to-image-generation

Jupyter Notebook

1 年前

Sh-31 / ImgCap

ImgCap is an image captioning model designed to automatically generate descriptive captions for images. It has two versions CNN + LSTM model and CNN + LSTM + Attention mechanism model.

深度学习 imagecaptioning lstm resnet torch torchvision beam-search

Python

1 年前