Repository navigation
webscraper
- Website
- Wikipedia
Web Scraper in Go, similar to BeautifulSoup
Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.
Scalable Python web scraping scripts for +40 popular domains
a class that uses scraped proxies to make http GET/POST requests (Python requests)
An R web crawler and scraper
An AI assistant tool that integrates coding, writing, and reading functions. For better alternatives see https://monica.im/desktop
Python class to scrape data from rightmove.co.uk and return listings in a pandas DataFrame object
Lego AI Parser is an open-source application that uses OpenAI to parse visible text of HTML elements.
📚 This is an adapted version of Jina AI's Reader for local deployment using Docker. Convert any URL to an LLM-friendly input with a simple prefix http://127.0.0.1:3000/https://website-to-scrape.com/
DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c
RSS feed builder created with Bun🥖 and Hono🔥- builds from webpages, email folders, and REST API calls.
Financial Web Scraper & Sentiment Classifier
Web scrapper for Shutterstock
Scrapes g4g and creates PDF
A Python command-line tool for scraping and downloading subtitles from AppleTV and iTunes movie pages.
Spotify Scraper to extract all the information from spotify, download mp3 with cover of the song
Cryptocurrency Historical Market Data R Package
A php crawler that finds emails on the internets