decomment
Removes comments from JSON/JavaScript, CSS/HTML, CPP/H, etc.
ts-webcrawler
A typescript webcrawler library for downloading and parsing webpages
htmljs-parser
An HTML parser recognizes content and string placeholders and allows JavaScript expressions as attribute values