pdf-ts
PDF text extraction in TypeScript
crack-json
Extracts all JSON objects from an arbitrary text document.
textract
Extracting text from files of various type including html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf, text/*, and various open office.