Repository navigation

image2text

Website
Wikipedia

pix2tex: Using a ViT to convert images of equations into LaTeX code.

机器学习 transformer im2latex 深度学习 image2text LaTeX dataset PyTorch im2markup OCR latex-ocr vit math-ocr vision-transformer 图像处理 Python im2text

Python

15759

1254

9 个月前

zai-org / GLM-V

GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

image2text video-understanding vlm reasoning

Python

1679

13 天前

OleehyO / TexTeller

TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability, enabling it to cover most usage scenarios.

image2text latex-ocr

Python

600

1 个月前

prabhakar267 / image2text

📋 Python wrapper to grab text from images and save as text files using Tesseract Engine

tesseract optical-character-recognition OCR image2text tesseract-ocr

Python

410

141

2 个月前

wangleihitcs / Papers

读过的CV方向的一些论文，图像生成文字、弱监督分割等

机器视觉自然语言处理 captions vqa image2text cvpr eccv iccv scene-text-detection-recognition

125

5 年前

Hangover3832 / ComfyUI-Hangover-Nodes

Various nodes for ComfyUI

comfyui image2text stable-diffusion

Python

5 个月前

ekiim / vim-mathpix

Vim commands to use mathpix from your screen

Vim LaTeX image2text

Shell

1 年前

yuanxiaosc / Image-Captioning

CNN-Encoder and RNN-Decoder (Bahdanau Attention) for image caption or image to text on MS-COCO dataset. 图片描述

image-captioning image2text tensorflow2 template-project Tensorflow

Jupyter Notebook

6 年前

etosworld / etos-deepcut

Deep Extreme Cut http://www.vision.ee.ethz.ch/~cvlsegmentation/dextr . a tool to do automatically object segmentation from extreme points.

segmentation object-segmentation 深度学习 semantic-segmentation annotation pspnet image2text PyTorch

Python

5 年前

JulioPeixoto / softrag

Minimal local-first multimodal RAG library powered by SQLite + sqlite-vec.

generative-ai 大语言模型 Open Source rag retrieval-augmented-generation SQL sqlite3 vector-database agent ChatGPT image2text 自然语言处理 openai

Python

2 个月前

TheLime1 / CheatoMate

A collection of scripts to "help" you with your programming exams and assignments.

人工智能 chat cheat cheating exam assignment image2text codebase

Python

2 年前

MurageKabui / AutoIT-OCRSpace-UDF

A AutoIT 3 wrapper library around the OCRSpace API.

optical-character-recognition recognition 图像处理 image2text text2image OCR API Library devtools developer-tools

AutoIt

1 年前

thefcraft / civitai-stable-diffusion-337k

Civitai Stable Diffusion 337k Dataset; dataset of ai generated image

civitai dataset image-classification image-generation image2text stable-diffusion

Python

9 个月前

Jerey / image-to-pdf-and-txt

Python tool, which takes 1..n images, tries to rotate them based on the text, extract the text and store 1..n images to a pdf.

Python image2text opencv-python tesseract OCR Hacktoberfest

Python

3 年前

michelecafagna26 / HL-dataset

[INLG2023] The High-Level (HL) dataset is a Vision and Language (V&L) resource aligning object-centric descriptions from COCO with high-level descriptions crowdsourced along 3 axes: scene, action, rationale.

dataset vision-and-language image-captioning image2text

2 年前