-

Mindee Introduces Advanced Open-Source Optical Character Recognition with docTR

Leveraging deep learning and open-source, Mindee opens up state-of-the-art OCR capabilities to benefit the entire developer community

SAN FRANCISCO--(BUSINESS WIRE)--Mindee, the API-first platform designed for developers to eliminate manual data entry, announced the introduction of docTR, a seamless, high-performing, and accessible open-source library for OCR-related tasks powered by deep learning.

Mindee’s docTR provides optical character recognition with accessibility for the entire developer community. Combining textual parsing through text and object detection and recognition, this open-source repository offers a wider range and complex use cases. Going beyond the textual elements, it provides a holistic view of information encoded in visual forms, including QR codes, barcodes, information in ID pictures, and even logos.

Powered by the machine learning tool of your choice, TensorFlow 2 or PyTorch, DocTR features training capabilities for text detection in documents and images as well as recognition with pretrained parameters. It incorporates a five-line code to load documents, extract text with a predictor, and optimize for very high end-to-end performances, including inference speed on both CPU and GPU.

“At Monk, we have integrated state-of-the-art OCR models using docTR into our production pipeline to tackle our clients’ needs,” said Nicolas Schuhl, Head of Delivery at Monk. “DocTR offers amazing open-source tools to develop and deploy python OCR at scale with PyTorch or TensorFlow.”

With this offering, Mindee provides a wide audience, from entry-level developers to domain experts who want to train their model (researchers), the tools to support efforts in their transformation from intensive manual data entry (e.g., from physical documents, PDFs or images) to a full digital process. docTR was developed to provide organizations with tangible results ranging from time savings through the development process; easy integration with existing systems and architectures; minimized deployment costs; to increased productivity across departments with faster retrieval of information from documents.

“Releasing docTR as an open-source library opened a world of possibilities for innovation,” said Frédéric Harper, Director of Developer Relations at Mindee. “At Mindee, we take pride in adding value to the developer community. We made this code available with that in mind, to ensure developers can read it, understand it and be sure it’s safe. We are providing everyone with the possibility of making this OCR tool their own by allowing them to modify the code to fit their applications and infrastructure needs.”

docTR is fully available now with multiple ways to access:

To learn more about Mindee, please visit Mindee.com

About Mindee

Mindee is a pioneer of document parsing API leveraging machine learning to reduce manual data entry in software products. Headquartered in San Francisco, CA, U.S. and Paris, France, the company serves the finance, research, insurance, government, healthcare and logistics industries with state-of-the-art software for the developer community. Backed by venture capitalists including GGV Capital, Alven, Serena Capital, Venture Capital BPI France, as well as executive standouts from the application development industry, Mindee is well positioned to take advantage of the robotics process automation trend. For more information visit us at: mindee.com and follow us on LinkedIn, Twitter

Contacts

Mindee


Release Versions

Contacts

Social Media Profiles
More News From Mindee

Mindee Continues Momentum with Strong Global Growth and Significant Product Advances for 2022

SAN FRANCISCO--(BUSINESS WIRE)--Mindee, the developer platform for document understanding, announced today significant growth in its business since exiting stealth mode in October. The company is taking its API-first approach to new geographies, scaling its team to introduce new specialized products and features for different verticals while acquiring a growing list of marquee customers. “Our mission is to continue to be a developer’s best friend. This year, our focus is building SDKs in all la...

HRIS Leader, Lucca, Partners with Mindee to Launch a Procurement Offering to Enhance Business Productivity

SAN FRANCISCO--(BUSINESS WIRE)--Mindee, the API-first platform designed for developers to automate document-understanding and OCR tasks within their software, and Lucca, a European leader in HRIS, announced today their continued partnership as they embark on launching Lucca’s new solution, Cleemy Procurement. In partnership with Mindee, Lucca now provides a new procurement solution that focuses on automating the tedious Purchase to Pay process. The hallmark of this product is the hassle-free ha...

Mindee Comes Out of Stealth Mode Raising $14M with its Developer Tool for Document Parsing that Eliminates Manual Data Entry in Software Applications and Automates Document Processing

SAN FRANCISCO--(BUSINESS WIRE)--Mindee, the API-first platform designed for developers to eliminate manual data entry, has emerged from stealth mode and announces today that it has raised a $14 million Series A round led by GGV Capital with participation from Alven, and existing investors Serena Capital and Bpifrance through its Digital Venture fund. Notable executives in the application development space who also participated in the round include the co-founder and former CEO of Algolia, Nicol...
Back to Newsroom
  1. There was an issue with the authorization server. Please contact support if the issue persists.