Repository navigation

#

article-parser

extractus/article-extractor

To extract main article from given URL with Node.js

JavaScript
1748
1 个月前

Extract article or news by url or html, parse the title and content, output in markdown format.

Python
50
1 年前

A web page article parser which returns an object containing the article's formatted text and other attributes including sentiment, keyphrases, people, places, organisations, spelling suggestions, in-article links, meta data & lighthouse audit results.

JavaScript
18
5 天前

This Python-based repository hosts a sophisticated service designed for scraping web articles and converting them into Markdown format. The core functionality of this service includes extracting the main content of articles, such as headlines, key paragraphs, and associated images, and then seamlessly transforming this content into well-structured…

Python
5
2 年前

Library in .NET with blazor components to support rendering article from database with interactive server side component within the article. In summary, it is a library that supports the creation and rendering of pages with articles, such as blogs or educational materials.

HTML
0
3 个月前

Article parser for Habr, Proglib, and vc.ru that extracts main content, removes ads and unnecessary elements, preserving proper formatting

Python
0
2 天前