ChatGH

Ask me anything about your GitHub repositories

Or start instantly with a popular repositories below

Trending Repositories

Trending repositories on GitHub today

A local-first context compression layer for AI agents. Compresses tool outputs, logs, RAG chunks, files, and conversation history by 60–95% before sending to the LLM — with reversible retrieval via CCR. Works as a Python/TS library, zero-code proxy, MCP server, or one-command agent wrapper for Claude, Codex, Cursor, and more.

Python
8.7k

Microsoft's lightweight Python tool that converts PDF, Word, Excel, PowerPoint, images, audio, HTML, and more into Markdown — preserving headings, tables, and links. Designed for LLM and RAG pipelines, with CLI/Python API, MCP server for AI agents, plugin extensibility, and optional Azure cloud and LLM-powered OCR.

Python
142.4k

A cross-harness AI coding agent optimization system with 60 specialized agents, 232 skills, automated hooks, security scanning, continuous learning, and memory persistence. Works across Claude Code, Codex, Cursor, OpenCode, Copilot and more, covering 12+ language ecosystems for production-ready agentic development.

JavaScript
205.2k
D4Vinci/Scrapling

Scrapling is an adaptive Python web scraping framework that handles everything from single requests to full-scale crawls. Its smart parser automatically relocates elements after website changes, built-in fetchers bypass anti-bot systems like Cloudflare, and the spider framework supports concurrent crawling with pause/resume, proxy rotation, and AI integration via MCP server.

Python
59.9k

Self-hosted web interface for Hermes Agent with full CLI parity. Three-panel layout with session management, streaming chat, and workspace file browser. Features persistent memory, self-improving skills, cron scheduling, multi-provider support (OpenAI/Anthropic/Google/DeepSeek/local models), 10+ messaging platforms, password/passkey auth, 11 skins, and Docker deployment. Pure Python + vanilla JS, no build step.

Python
12.9k

An open-source OSINT graph investigation platform with visual node exploration and 30+ automated enrichers across domains, IPs, crypto wallets, social profiles, and more. Self-hosted, privacy-first, with Neo4j graph engine and N8n workflow integration for cybersecurity analysts and investigators.

TypeScript
4.8k

A tokenizer-free text-to-speech system using diffusion autoregressive architecture. The 2B-parameter VoxCPM2 supports 30 languages, voice design from text descriptions, controllable voice cloning with style guidance, and 48kHz studio-quality audio output. Fully open-source under Apache-2.0, with real-time streaming and production deployment via vLLM-Omni.

Python
25.5k
stefan-jansen/machine-learning-for-trading

Comprehensive ML for algorithmic trading codebase with 150+ notebooks. Covers end-to-end workflow from market/fundamental/alternative data sourcing, alpha factor engineering, to strategy backtesting. Models range from linear regression, random forests, gradient boosting to CNN, RNN, autoencoders, GANs, and deep reinforcement learning. Includes NLP pipelines for SEC filings, earnings calls, and news sentiment analysis.

Jupyter Notebook
18.9k

A 7-week hands-on course to build production-grade Agentic RAG systems from scratch. Covers infrastructure setup, arXiv data ingestion, BM25 keyword search, hybrid retrieval, LLM integration with streaming, Langfuse monitoring, Redis caching, LangGraph agent orchestration, and Telegram bot — using FastAPI, OpenSearch, Airflow, and Docker.

Python
6.6k

Top Repositories

The most popular repositories on GitHub.

1

A comprehensive collection of step-by-step tutorials for rebuilding popular technologies from scratch. Covers databases, operating systems, programming languages, web servers, AI models, and more across multiple languages. Perfect for developers seeking to deeply understand core tech principles through hands-on implementation.

2

A curated list of awesome lists covering all kinds of interesting topics. This meta-list serves as the central directory for discovering high-quality resources on programming languages, frameworks, tools, and various technology domains.

3

freeCodeCamp.org's open-source codebase and curriculum. A friendly community where you can learn to code for free through thousands of interactive coding challenges, helping over 100,000 people land their first developer job.

4

A collectively curated list of free public APIs for software and web development. This extensive repository contains thousands of APIs across 100+ categories including animals, finance, health, music, news, science, sports, weather, and more. Maintained by the community and APILayer team.

5

A collaborative, multilingual collection of freely available programming books, courses, podcasts, and interactive tutorials. Maintained by the Free Ebook Foundation, it indexes thousands of learning resources across numerous programming languages and computer science topics.

6

OpenClaw is a self-hosted personal AI assistant that connects to WhatsApp, Telegram, Discord, iMessage and 20+ channels. Features voice wake, live canvas, browser automation, and runs on any OS with local-first privacy.

7

Community-driven interactive roadmaps, guides, and resources for developers. Covers frontend, backend, DevOps, AI, and more with clickable nodes for detailed learning paths and best practices.

8

Learn how to design large-scale systems. Prep for the system design interview. An organized collection of resources covering scalability, performance, availability, and common system design patterns with solutions and Anki flashcards.

9

A comprehensive computer science study plan for software engineering interviews. Covers data structures, algorithms, system design, and coding problems to prepare for tech giants like Google, Amazon, and Facebook.

10

A curated list of awesome Python frameworks, libraries, software and resources. Covers web frameworks, data analysis, machine learning, testing, and more.