Mistral AI has released OCR 4, a specialized model designed to extract structured data from diverse file formats including PDFs, Word documents, and PowerPoint presentations. Unlike traditional optical character recognition tools that produce raw text, this model identifies specific document elements such as tables, equations, and signatures. It provides precise bounding boxes for each element and per-token confidence scores to facilitate automated verification and human-in-the-loop reviews.
Source