tesseract: Open Source OCR Engine

An OCR engine with unicode (UTF-8) support that can recognize over 100 languages out of the box.

Version: 1.4
Imports: Rcpp (≥ 0.12.10), curl, digest
LinkingTo: Rcpp
Suggests: magick, pdftools, tiff
Published: 2017-03-21
Author: Jeroen Ooms
Maintainer: Jeroen Ooms <jeroen at berkeley.edu>
BugReports: https://github.com/ropensci/tesseract/issues
License: MIT + file LICENSE
URL: https://github.com/ropensci/tesseract
NeedsCompilation: yes
SystemRequirements: Tesseract >= 3.03 (libtesseract-dev / tesseract-devel) and Leptonica (libleptonica-dev / leptonica-devel). On Debian you need to install the English training data separately (tesseract-ocr-eng)
Materials: NEWS
In views: NaturalLanguageProcessing
CRAN checks: tesseract results

Downloads:

Reference manual: tesseract.pdf
Package source: tesseract_1.4.tar.gz
Windows binaries: r-devel: tesseract_1.4.zip, r-release: tesseract_1.4.zip, r-oldrel: tesseract_1.4.zip
OS X El Capitan binaries: r-release: tesseract_1.4.tgz
OS X Mavericks binaries: r-oldrel: tesseract_1.4.tgz
Old sources: tesseract archive

Linking:

Please use the canonical form https://CRAN.R-project.org/package=tesseract to link to this page.