manual-html

HTML 手册
v1.0.10 URL: https://unpkg.com/manual-html@1.0.10/index.js
OpenBrowse Files
htmlhtml5html-tagshtml-taghtml-css-javascripthtml-parsing

forgiving-xml-parser

An forgiving XML/HTML parser and serializer for JavaScript.
v1.4.2 URL: https://unpkg.com/forgiving-xml-parser@1.4.2/dist/index.cjs.js
OpenBrowse Files
forgiving-xml-parserxml-parsingxml-parserhtml-parsinghtml-parserxml2jsonxml2jsjson2xmljs2xmlhtml2jsonhtml2jsjson2htmljs2htmlparserserializerhtmlxml

@lydio/semantics

HTML standard nodes extension for Lydio
v1.0.0 URL: https://unpkg.com/@lydio/semantics@1.0.0/src/index.js
OpenBrowse Files
lydiosemanticshtmlstructured-htmlsemantic-htmlhtml-generatormarkupdocument-structureaccessibilityariaseoauditvalidationweb-developmentweb-standardsweb-accessibilityhtml5metadatadata-extractionjson-representationsemantic-datahtml-parsingdynamic-htmlautomated-testingfrontendtemplating

js-harvester

Harvester is a lightweight and highly optimized javascript library for extracting data from the DOM tree. It supports extraction of tag texts with specified types and attributes. it's tiny and has no dependencies and also works with Puppeteer
v0.3.14 URL: https://unpkg.com/js-harvester@0.3.14/index.js
OpenBrowse Files
puppeteerplaywrightlightweightoptimizedweb-scrapingwebscrapingdata-extractiondataextractionhtml-parsinghtmlparsingdom-parsingdomparsingscrapingextractionharvestingdata-harvestingtemplate-based-scrapingtemplate-basedscrapingtemplate-extractiontemplateextractionpattern-based-scrapingpattern-basedscrapingvisual-scraping-templatedeclarative-scrapingfuzzy-scrapingfuzzyscrapingapproximate-scrapingapproximatescrapingresilient-scrapingresilientscrapingflexible-scrapingflexiblescrapingstructure-agnostic-scrapingsemantic-scrapingtree-template-scrapingtree-templatescrapingpseudo-tree-templatestring-template-scrapingstring-templatescrapingindentation-based-templatevisual-templatejavascript-scrapingjavascriptscrapingnpm-packagebrowser-scrapingnodejs-scrapingnode-jsnodejsdom-traversaldom-manipulationfrontend-scrapinghierarchical-data-extractionnested-data-extractionattribute-extractiontext-extractionweb-automationcontent-extractionweb-data-extraction

xscrape

A flexible and powerful library designed to extract and transform data from HTML documents using user-defined schemas
v3.0.4 URL: https://unpkg.com/xscrape@3.0.4/dist/index.js
OpenBrowse Files
web-scrapingdata-extractionautomationhtml-parsingdata-transformationuser-defined-schemascrawlerscraperzodvalibotarktypeeffect-schemastandard-schema