How To Install tesseract-ocr on Ubuntu
Posted on April 1, 2023  (Last modified on May 20, 2023 )
2 minutes • 379 words
Introduction
In this tutorial we learn how to install tesseract-ocr
on Ubuntu.
What is tesseract-ocr
tesseract-ocr is:
Tesseract is an open source Optical Character Recognition (OCR) Engine. It can be used directly, or (for programmers) using an API to extract printed text from images. It supports a wide variety of languages. This package includes the command line tool.
There are three methods to install tesseract-ocr
on Ubuntu. We can use apt-get
, apt
and aptitude
. In the following sections we will describe each method. You can choose one of them.
Install tesseract-ocr Using apt-get
Update apt database with apt-get
using the following command.
sudo apt-get update
After updating apt database, We can install tesseract-ocr
using apt-get
by running the following command:
sudo apt-get -y install tesseract-ocr
Install tesseract-ocr Using apt
Update apt database with apt
using the following command.
sudo apt update
After updating apt database, We can install tesseract-ocr
using apt
by running the following command:
sudo apt -y install tesseract-ocr
Install tesseract-ocr Using aptitude
If you want to follow this method, you might need to install aptitude first since aptitude is usually not installed by default on Ubuntu. Update apt database with aptitude
using the following command.
sudo aptitude update
After updating apt database, We can install tesseract-ocr
using aptitude
by running the following command:
sudo aptitude -y install tesseract-ocr
How To Uninstall tesseract-ocr on Ubuntu
To uninstall only the tesseract-ocr
package we can use the following command:
sudo apt-get remove tesseract-ocr
Uninstall tesseract-ocr And Its Dependencies
To uninstall tesseract-ocr
and its dependencies that are no longer needed by Ubuntu, we can use the command below:
sudo apt-get -y autoremove tesseract-ocr
Remove tesseract-ocr Configurations and Data
To remove tesseract-ocr
configuration and data from Ubuntu we can use the following command:
sudo apt-get -y purge tesseract-ocr
Remove tesseract-ocr configuration, data, and all of its dependencies
We can use the following command to remove tesseract-ocr
configurations, data and all of its dependencies, we can use the following command:
sudo apt-get -y autoremove --purge tesseract-ocr
Dependencies
tesseract-ocr have the following dependencies:
- libarchive13
- libc6
- libcairo2
- libfontconfig1
- libgcc-s1
- libglib2.0-0
- libicu70
- liblept5
- libpango-1.0-0
- libpangocairo-1.0-0
- libpangoft2-1.0-0
- libstdc++6
- libtesseract4
- tesseract-ocr-eng
- tesseract-ocr-osd
References
Summary
In this tutorial we learn how to install tesseract-ocr
package on Ubuntu using different package management tools: apt, apt-get and aptitude.