Unified OCR library with multi-driver support for Tesseract.js and AI models, providing structured text extraction using hast-based output format