@wdelhagen/textprep

Document text extraction with pluggable extractors. Supports PDF, DOCX, DOC, RTF, TXT, and image files with OCR capabilities.

Browse on unpkg