Repository navigation
pixtral
- Website
- Wikipedia
MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.
DelphiMistralAI wrapper brings Mistral’s text-vision-audio models and agentic Conversations to Delphi, with chat, embeddings, Codestral codegen, fine-tuning, batching, moderation, async/await helpers and live request monitoring.
An open-source implementaion for fine-tuning Pixtral by MistralAI.
In this we finetune Pixtral-12B-2409 model using unsloth for visual Question Answering(NLP Task)
Examples of Mistral API using Python REST / curl
LoRA-based fine-tuning pipeline for Pixtral‑12B on multimodal instruction tasks using Hugging Face and vision-language templates.
LookOutAI is a tool for spotting a target person across images or video clips using facial recognition. It leverages NLP and LLMs to describe appearances and includes features to censor or highlight the target’s face.
RevealVLLMSafetyEval is a comprehensive pipeline for evaluating Vision-Language Models (VLMs) on their compliance with harm-related policies. It automates the creation of adversarial multi-turn datasets and the evaluation of model responses, supporting responsible AI development and red-teaming efforts.
Pixtral Streamlit Demo App