unfluff
A web page content extractor
metascraper
A library to easily scrape metadata from an article on the web using Open Graph, JSON+LD, regular HTML metadata, and series of fallbacks.
website-scraper
Download website to a local directory (including all css, images, js, etc.)