webpagedatascraper
Web scraping extension for puppeteer. Quickly scrape html page data to JSON object.
crawldown
Crawl websites and convert their content into clean, readable Markdown using Mozilla's Readability and Turndown
crawl-to-markdown
Crawl-to-markdown is a powerful TypeScript package designed to search search engines for a given keyword, crawl the resulting websites, and deliver the content in clean, readable Markdown format. Additionally, it can directly crawl specified websites for