pdf2json
PDF file parser that converts PDF binaries to JSON and text, powered by porting a fork of PDF.JS to Node.js
officeparser
A Node.js library to parse text out of any office file. Currently supports docx, pptx, xlsx, odt, odp, ods, pdf files.
node-poppler
Asynchronous node.js wrapper for the Poppler PDF rendering library