Building an OCR API for Invoice Processing with Rusty Tesseract OCR and Actix Web

In This Blog Post we will explore another wrapper of Tesseract OCR for rust programmers who want to parse receipts and invoices or any other document images

This blog post guides Rust developers through creating an Optical Character Recognition (OCR) API using the rusty_tesseract wrapper and actix-web framework, tailored for processing invoices with Tesseract OCR Page Segmentation Mode (PSM) 12. PSM 12 is ideal for sparse text layouts, such as invoices, where text is scattered in blocks rather than in continuous lines. We'll break down the provided code step by step, explain its components, and show how to set up and run the API.

Prerequisites

Before diving into the code, ensure you have:

Project Setup

Create a new Rust project:

Add the following dependencies to your Cargo.toml:

Understanding the Code

The provided code implements an OCR API that accepts an image file upload, processes it with Tesseract OCR using PSM 12, and returns the extracted text in JSON format. Let's break it down step by step.

Step 1: Import Necessary Crates

These imports bring in the required modules for:

Step 2: Define Data Structures

Step 3: Implement the OCR Endpoint

The /ocr endpoint handles POST requests with an image file, processes it with Tesseract OCR, and returns the extracted text.

Breakdown of the Endpoint

Step 4: Set Up the Actix Web Server

Complete Code

Here’s the full main.rs file for reference:

Running the API

The response will be a JSON object like:

Why PSM 12 for Invoices?

Tesseract’s OCR Page Segmentation Mode (PSM) 12 is specifically designed for sparse text, making it ideal for invoices. Invoices often have text in isolated blocks (e.g., vendor details, line items, totals) rather than continuous paragraphs. PSM 12 treats the image as a single text block with no assumptions about layout, which helps accurately extract text from such documents.

Tips for Better OCR Results

FInal Words:

This tutorial demonstrated how to build an OCR API for invoice processing using rusty_tesseract and actix-web. By leveraging PSM 12, the API effectively handles the sparse text layout of invoices. You can extend this project by adding image preprocessing, supporting multiple languages, or extracting specific invoice fields (e.g., totals, dates) using regex or NLP.

Tesseract OCR Rusty Tesseract Actix Web

React to this Post

Comments

No comments yet. Be the first to comment!

Explore More AI Viewz Blog Posts

Ditch Pytesseract and Switch to Better Alternative

Optical Character Recognition (OCR) is a crucial tool for extracting text from images, PDFs, and scanned documents. While Pytesseract is the most popular Python wrapper for Tesseract OCR, it suffers from performance bottlenecks due to its Python-based implementation.

OCR Showdown: Tesseract vs Other Open Source Alternatives

Optical Character Recognition (OCR) has revolutionized how machines interpret text from images. With several powerful OCR engines available, choosing the right one depends on factors like accuracy, speed, language support, and hardware requirements. In this blog post, we’ll dive deep into Tesseract OCR, Pytesseract, EasyOCR, PaddleOCR, and TesserOCR, comparing their performance, limitations, and best use cases.

How to Create an OCR API in Rust Using Tesseract OCR and Actix-Web

Optical Character Recognition (OCR) is a powerful tool for extracting text from images, and Rust provides excellent libraries to build high-performance OCR applications. In this tutorial, we'll create a Rust-based OCR API using Tesseract OCR (Leptess) and Actix-Web to process uploaded images and return extracted text.

How To Install Latest Version 5.4 of Tesseract OCR ?

Optical Character Recognition (OCR) is a powerful tool that converts images of text into machine-readable text. Among the most popular OCR engines is Tesseract OCR, an open-source solution developed by Google. Whether you're working on document processing, data extraction, or automation, installing the right version of Tesseract is crucial. In this guide, we’ll walk you through the installation process for different versions of Tesseract OCR, including stable and developer releases, as well as language packs.

Tesseract OCR: Setup a High Performance Document Processing On Premise Server

In The Modern Era Of Generative AI, we have many APIs and solutions to process our documents and images. But Some Clients and Companies enforce restrictions to only develop and use on premise solutions which can become very complex from cost and performance perspective. Let's explore one approach in this guide.

How TesserOCR Outperforms Pytesseract in 2025?

Why TesserOCR Outshines Pytesseract for OCR in Python? Optical Character Recognition (OCR) is a critical tool for extracting text from images, and Python developers often rely on libraries like Pytesseract and TesserOCR to harness Google’s Tesseract OCR engine. While Pytesseract is popular for its ease of use, it has significant limitations that make it less suitable for performance-critical or complex applications. TesserOCR, a direct binding to Tesseract’s C++ API, offers superior efficiency, flexibility, and control. This blog post explores why Pytesseract may not be the best choice for OCR in Python and highlights the benefits of switching to TesserOCR, with a practical example of optimizing performance.

What is OCR?

Optical Character Recognition (OCR) converts images or scanned documents into editable text, making it easy to digitize receipts, notes, or books.

OCR for Everyday Use

Use OCR to extract text from photos of signs, menus, or handwritten notes, saving time and effort in daily tasks.

OCR for Students

Students can digitize lecture notes or book pages with OCR, creating searchable study materials for better organization and revision.

OCR for Businesses

Businesses use OCR to automate data entry from invoices, contracts, or forms, streamlining workflows and reducing errors.

How to Use Image to Text on AI Viewz

Upload an image (e.g., PNG, JPG) to AI Viewz’s OCR tool, click “Process,” and get editable text instantly. Perfect for receipts or notes.

How to Use PDF to Text on AI Viewz

Upload a PDF to AI Viewz’s OCR tool, select “Extract Text,” and receive a text file or editable document in seconds.

Discover more about our Image (PDF) to Text OCR Service or explore advanced Image and PDF Analysis tools on AI Viewz.

Convert PDF to Excel, Word & More