js-harvester

Harvester is a lightweight and highly optimized javascript library for extracting data from the DOM tree. It supports extraction of tag texts with specified types and attributes. it's tiny and has no dependencies and also works with Puppeteer
v0.3.14 URL: https://unpkg.com/js-harvester@0.3.14/index.js
OpenBrowse Files
puppeteerplaywrightlightweightoptimizedweb-scrapingwebscrapingdata-extractiondataextractionhtml-parsinghtmlparsingdom-parsingdomparsingscrapingextractionharvestingdata-harvestingtemplate-based-scrapingtemplate-basedscrapingtemplate-extractiontemplateextractionpattern-based-scrapingpattern-basedscrapingvisual-scraping-templatedeclarative-scrapingfuzzy-scrapingfuzzyscrapingapproximate-scrapingapproximatescrapingresilient-scrapingresilientscrapingflexible-scrapingflexiblescrapingstructure-agnostic-scrapingsemantic-scrapingtree-template-scrapingtree-templatescrapingpseudo-tree-templatestring-template-scrapingstring-templatescrapingindentation-based-templatevisual-templatejavascript-scrapingjavascriptscrapingnpm-packagebrowser-scrapingnodejs-scrapingnode-jsnodejsdom-traversaldom-manipulationfrontend-scrapinghierarchical-data-extractionnested-data-extractionattribute-extractiontext-extractionweb-automationcontent-extractionweb-data-extraction