office-text-extractor

Yet another library to extract text from MS Office and PDF files

v3.0.3 URL: https://unpkg.com/office-text-extractor@3.0.3/build/index.js

Open Browse Files

text-extraction get-text parser ms-office ms-excel ms-word ms-powerpoint xlsx docx pptx pdf

n8n-nodes-local-ai-stack

n8n custom nodes for AI services including image captionning, OCR, face detection, and more AI-powered features

v1.1.7 URL: https://unpkg.com/n8n-nodes-local-ai-stack@1.1.7/dist/index.js

Open Browse Files

n8n node custom ai ocr image-captioning text-extraction face-detection image-processing machine-learning artificial-intelligence llm local-ai-stack

@danilidonbeltran/webscrapper

A web scraper using Playwright to extract all text content from websites

v1.7.0 URL: https://unpkg.com/@danilidonbeltran/webscrapper@1.7.0/src/index.js

Open Browse Files

webscraping playwright text-extraction

node-easyocr

A Node.js wrapper for the Python EasyOCR library

v1.0.9 URL: https://unpkg.com/node-easyocr@1.0.9/dist/easyocr.js

Open Browse Files

ocr easyocr optical-character-recognition image-processing text-extraction document-analysis python-wrapper image-to-text

ppu-pdf

Easily extract text from digital PDF files with coordinate and font size included, and optionally group text by lines or render scanned pdf to canvas/png.

v5.6.0 URL: https://unpkg.com/ppu-pdf@5.6.0/index.js

Open Browse Files

pdf-reader text-extraction pdf-rag bbox pdf pdf-typescript bun pdf-digital pdf-scan pdf-canvas pdfjs mupdf mupdfjs ocr