@docling/docling-core

TypeScript definitions and functions for using Docling output.
v0.0.7 URL: https://unpkg.com/@docling/docling-core@0.0.7/dist/index.js
OpenBrowse Files
htmlmarkdownpdfaiconvertxlsxpdf-converterdocxdocumentspptxpdf-to-texttablesdocument-parserpdf-to-jsondocument-parsing

@docling/docling-components

Web components for displaying Docling output.
v0.0.7 URL: https://unpkg.com/@docling/docling-components@0.0.7/dist/index.es.js
OpenBrowse Files
htmlmarkdownpdfaiconvertxlsxpdf-converterdocxdocumentspptxpdf-to-texttablesdocument-parserpdf-to-jsondocument-parsing

doc-to-readable

Universal document-to-markdown and section splitter for HTML, URLs, and PDFs.
v1.5.3 URL: https://unpkg.com/doc-to-readable@1.5.3/src/index.js
OpenBrowse Files
markdownmarkdown-converterhtml-to-markdownpdf-to-markdownurl-to-markdowndocument-parsercontent-extractionreadabilityarticle-extractortext-processingdocument-processingcontent-splittersection-splitterragretrieval-augmented-generationai-preprocessingweb-scrapingcontent-cleanupuniversal-parsercross-platformbrowser-compatiblenodejstypescriptjavascriptturndownreadability-parserpdfjsdompurify

covers-extractor

This library extract cover from a lot of document formats from exact folder
v0.19.6 URL: https://unpkg.com/covers-extractor@0.19.6/index.js
OpenBrowse Files
book-coversextractordocument-parserdocument to image

@xoxoharsh/multiparser

A Text extracting package docx, pdf and pptx files
v1.0.0 URL: https://unpkg.com/@xoxoharsh/multiparser@1.0.0/index.js
OpenBrowse Files
parserdocxpdfpptxtext-extractiondocument-parser