Repository navigation

#

html-to-markdown

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

TypeScript
36340
17 小时前

🛏 An HTML to Markdown converter written in JavaScript

HTML
9570
9 个月前

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

Python
4150
1 个月前
JohannesKaufmann/html-to-markdown

⚙️ Convert HTML to Markdown. Even works with entire websites and can be extended through rules.

Go
2778
6 天前

CommonMark/Markdown Java parser with source level AST. CommonMark 0.28, emulation of: pegdown, kramdown, markdown.pl, MultiMarkdown. With HTML to MD, MD to PDF, MD to DOCX conversion modules.

Java
2394
4 天前

helloworld 开发者社区开源的一个轻量级,强大的 html 一键转 md 工具,支持多平台文章一键转换,并保存下载到本地。

JavaScript
731
1 年前

It's time for your markup to get down! HTML to markdown converter. Breakdance is a highly pluggable, flexible and easy to use.

JavaScript
531
3 年前

A multithreaded 🕸️ web crawler that recursively crawls a website and creates a 🔽 markdown file for each page, designed for LLM RAG

Python
376
8 个月前

🖱 Browser extension to copy hyperlinks, images, and selected text as Markdown with GFM support

JavaScript
340
6 个月前

reader is for your command line what the “readability” view is for modern browsers: A lightweight tool offering better readability of web pages on the CLI.

Go
340
3 天前

🔥 This repository contains complete application examples, including websites and other projects, developed using Firecrawl.

TypeScript
309
10 天前

Slurps webpages and saves them as clean, uncluttered Markdown. Think Pocket, but better.

TypeScript
212
4 个月前

Firefox add-on to copy selection as Markdown

HTML
198
8 个月前

A CLI tool that converts exported Medium posts (html) to Jekyll/Hugo compatible markdown with front matter.

JavaScript
147
1 年前

😼 Dependency-free and lean DOM parser that outputs Markdown

JavaScript
87
3 年前

The best HTML to Markdown library, A esm-native & Useful Utilities with simple, lightweight and epic quality.

TypeScript
64
16 天前

HTML-to-Markdown converter that adaptively preserves HTML when needed (eg. when center-aligning, or resizing images)

TypeScript
64
2 年前

📝 XK-Editor | 一个支持富文本和Markdown的编辑器

CSS
57
4 年前

Transform your HTML into clean, easy-to-read markdown with html2md.

C++
55
6 天前