oust
Extract URLs to stylesheets, scripts, links, images or HTML imports from HTML
metascraper
A library to easily scrape metadata from an article on the web using Open Graph, JSON+LD, regular HTML metadata, and series of fallbacks.
innertext
Extract the innerText from a snippet of HTML