Repository navigation

#

html-to-markdown

🛏 An HTML to Markdown converter written in JavaScript

HTML
10323
2 个月前

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

Python
4763
22 天前
JohannesKaufmann/html-to-markdown

⚙️ Convert HTML to Markdown. Even works with entire websites and can be extended through rules.

Go
3113
20 天前

CommonMark/Markdown Java parser with source level AST. CommonMark 0.28, emulation of: pegdown, kramdown, markdown.pl, MultiMarkdown. With HTML to MD, MD to PDF, MD to DOCX conversion modules.

Java
2513
6 个月前

AnyCrawl 🚀: A Node.js/TypeScript crawler that turns websites into LLM-ready data and extracts structured SERP results from Google/Bing/Baidu/etc. Native multi-threading for bulk processing.

TypeScript
2327
5 天前

helloworld 开发者社区开源的一个轻量级,强大的 html 一键转 md 工具,支持多平台文章一键转换,并保存下载到本地。

JavaScript
772
1 年前

🔥 This repository contains complete application examples, including websites and other projects, developed using Firecrawl.

Jupyter Notebook
561
4 个月前

It's time for your markup to get down! HTML to markdown converter. Breakdance is a highly pluggable, flexible and easy to use.

JavaScript
532
3 年前

A multithreaded 🕸️ web crawler that recursively crawls a website and creates a 🔽 markdown file for each page, designed for LLM RAG

Python
403
1 年前

reader is for your command line what the “readability” view is for modern browsers: A lightweight tool offering better readability of web pages (and EML files!) on the CLI.

Go
377
3 个月前

📋 Browser extension to copy text as Markdown (with GFM and MathML support)

JavaScript
366
4 个月前

Slurps webpages and saves them as clean, uncluttered Markdown. Think Pocket, but better.

TypeScript
244
10 个月前

Firefox add-on to copy selection as Markdown

JavaScript
208
4 个月前

A CLI tool that converts exported Medium posts (html) to Jekyll/Hugo compatible markdown with front matter.

JavaScript
148
1 年前

Export Atlassian Confluence pages as markdown files.

Python
142
1 个月前

😼 Dependency-free and lean DOM parser that outputs Markdown

JavaScript
86
3 年前

Claude Chat Exporter is a JavaScript tool that allows you to export your conversations with Claude AI into a well-formatted Markdown file.

JavaScript
82
1 个月前