Repository navigation
ml-testing
- Website
- Wikipedia
🐢 Open-Source Evaluation & Testing library for LLM Agents
Deliver safe & effective language models
📚 A curated list of papers & technical articles on AI Quality & Safety
An End-to-End Evaluation Framework for Entity Resolution Systems
Streamlit app for "Honey, I broke the PyTorch model" - Talk @ PyCon & PyData 2023
Evaluation & testing framework for computer vision models
ML Testing for Everyone. Find issues before they become problems.
Build confidence in your AI with systematic slice-based testing
✍️ Collaborate on writing technical content for the Giskard Community
Algorithmic inspection for trustworthy ML models
learning python day 4
ML-focused synthetic data platform with realistic traffic patterns, seasonal effects, and temporal drift. BNPL transaction generator with risk scoring, configurable arrival patterns (Poisson, NHPP, Burst). Live API: simtom-production.up.railway.app | Day-per-second historical replay.