Repository navigation

controllable-image-captioning

Website
Wikipedia

Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything

ChatGPT controllable-generation segment-anything controllable-image-captioning image-captioning

Python

1754

104

2 年前

bearcatt / LaBERT

A length-controllable and non-autoregressive image captioning model.

non-autoregressive controllable-image-captioning eccv2020 image-captioning

Python

4 年前

AnnikaLindh / Controllable_Region_Pointer_Advancement

PyTorch implementation of a Controllable Image Captioning model with a language-driven mechanism for advancing the region pointer state that keeps it in sync with the state of the language model. Code for the paper Language-Driven Region Pointer Advancement for Controllable Image Captioning (Lindh et al., 2020).

controllable-image-captioning PyTorch paper-implementations 深度学习 neural-networks natural-language-generation 机器视觉机器学习 multimodal-learning research Python image-captioning

Python

4 年前

AnnikaLindh / show-prefer-tell

Pipeline model for controllable image captioning with user preference settings. Code and model output for the paper Show, Prefer and Tell: Incorporating User Preferences into Image Captioning (Lindh et al., 2023).

assistive-technology controllable-image-captioning 深度学习 image-captioning 机器学习 Python research neural-networks

Python

3 个月前