Repository navigation

vlms

Website
Wikipedia

oumi-ai / oumi

Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!

dpo evaluation fine-tuning inference llama 大语言模型 sft vlms gpt-oss

Python

8418

637

2 小时前

NanoNets / docext

An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)

Python

1642

121

2 个月前

yueliu1999 / Awesome-Jailbreak-on-LLMs

Awesome-Jailbreak-on-LLMs is a collection of state-of-the-art, novel, exciting jailbreak methods on LLMs. It contains papers, codes, datasets, evaluations, and analyses.

人工智能 jailbreak 大语言模型隐私 safety 安全 vlm vlms

855

7 天前

dvlab-research / VisionZip

Official repository for VisionZip (CVPR 2025)

efficiency multi-modality vision-language-model vlms

Python

337

1 个月前

tianyi-lab / HallusionBench

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

benchmark vlms gpt-4 gpt-4v llava benchmarks hallucination 大语言模型 lmm large-language-models large-vision-language-models

Python

297

9 个月前

cequence-io / openai-scala-client

Scala client for OpenAI API and other major LLM providers

ChatGPT openai Scala gemini-ai groq-api 大语言模型 nlp-library vertex-ai-gemini-api vlms aws-bedrock anthropic gemini

Scala

231

9 天前

Beckschen / ViTamin

[CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"

vlms

Python

207

1 年前

Alpha-Innovator / OmniCaptioner

Official Repository of OmniCaptioner

deepseek-r1 multi-modal vlms

Python

157

4 个月前

MCG-NJU / AWT

[NeurIPS 2024] AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation

clip 机器视觉 video-understanding vlms zero-shot-learning transfer-learning

Python

104

10 个月前

TUM-AVS / FM-AD-Survey

This repository collects research papers of large Foundation Models for Scenario Generation and Analysis in Autonomous Driving. The repository will be continuously updated to track the latest update.

diffusion-models 大语言模型 vlms world-models autonomous-driving foundation-models scenario-analysis

22 天前