Tesseract ocr скачать для windows

Source Code

Source code of Tesseract’s Releases.

Binaries for Linux

Tesseract is included in most Linux distributions.

Binaries for Windows

Old Downloads

Downloads Archive on SourceForge.
There you can find, among other files, Windows installer for the old version 3.02.

Currently, there is no official Windows installer for newer versions.

3rd party Windows exe’s/installer

  • Cygwin includes packages for Tesseract.

  • UB Mannheim has installers available for current (5.3.0) and older versions.

  • Unofficial Binaries

5.3.3

What’s Changed

  • Disable -mfpu=neon for aarch64 by @hesmar in #4098
  • Fix build without git clone in cloned directory by @pkubaj in #4099
  • Fix some issues which were reported by Coverity Scan by @stweil in #4097
  • Update ScrollView.java by @Parryword in #4103
  • Fix some code comments by @stweil in #4113
  • Optimize function ImageFind::FindImages by @stweil in #4114
  • Rename BibTex file to please GitHub by @stweil in #4115
  • Fix Broken URLs in citations.bib by @kevinunger in #4118
  • initDSProfile: correct std::vector usage by @stima in #4124
  • Fix typo in stepblob.h by @eltociear in #4133
  • Fix regression in layout detection since 5.0.0 (fixes issue #4014) by @stweil in #4136
  • Update ScrollView.java by @Parryword in #4104
  • Fix loading of sublangs (regression) by @stweil in #4141

New Contributors

  • @hesmar made their first contribution in #4098
  • @Parryword made their first contribution in #4103
  • @kevinunger made their first contribution in #4118
  • @stima made their first contribution in #4124
  • @eltociear made their first contribution in #4133

Full Changelog: 5.3.2…5.3.3

5.3.2

5.3.1

What’s Changed

  • Update README.md by @seupedro in #3992
  • Fix FP division by zero (issue #3995) by @stweil in #3996
  • Fix linkage of icu and pango by @autoantwort in #4006
  • Fix build with gcc 13 by including by @kraj in #4009
  • msvc debug: fix wrong lib name in generated pkgconfig file by @autoantwort in #4008
  • Fix libdir in tesseract.pc from CMake by @ferdnyc in #4013
  • Replace ‘can not’ by ‘cannot’ by @stweil in #4015
  • Readme: Link to list of supported languages by @tooomm in #4027
  • Improve the DebugDump output by slightly adjusting the format. by @GerHobbelt in #4022
  • Fix issue #4010 by @amitdo in #4041

New Contributors

  • @seupedro made their first contribution in #3992
  • @autoantwort made their first contribution in #4006
  • @kraj made their first contribution in #4009
  • @ferdnyc made their first contribution in #4013
  • @tooomm made their first contribution in #4027

Full Changelog: 5.3.0…5.3.1

5.3.0

This is a new minor version of Tesseract 5.

What’s Changed

  • Fix memory issues in ScrollView::MessageReceiver by @p12tic in #3872
  • autotools: Add rule for svpaint executable by @stweil in #3873
  • Replace call of exit function by return statement in main function by @stweil in #3878
  • Fix the build on CodeQL/Analyze by @arseniy-sonar in #3888
  • CI: Remove Ubuntu 18.04 by @amitdo in #3902
  • configure.ac: fix build on aarch64_be by @ffontaine in #3907
  • SW CI: Add paths filter by @amitdo in #3908
  • Create .mailmap by @amitdo in #3910
  • Fix tesseract.pc from cmake to match autotools by @jeroen in #3930
  • Update README.md by @nicholasz2510 in #3935
  • Fixed 2 errors by @Gitoffthelawn in #3938
  • fix issue #3940 — remove colormap before thresholding by @zdenop in #3942
  • Update upload-artifact action by @rettinghaus in #3949
  • Update checkout action to version 3 by @rettinghaus in #3948
  • Fix Markdownlint by @Saibamen in #3950
  • Fix broken links in CONTRIBUTING.md by @doraeric in #3951
  • pdfrenderer.cpp: Ignore non-text blocks by @amitdo in #3959
  • lstm.train: allow .box from .raw.png too by @bertsky in #3962
  • Fix a number of performance issues (reported by Coverity Scan) by @stweil in #3967
  • Fix training tools for legacy engine (issue #3925) by @stweil in #3970
  • Fix function tesseract::WriteFeature (issue #3925) by @stweil in #3972
  • Modernize function ObjectCache::DeleteUnusedObjects (fix issue with s… by @stweil in #3978
  • More fixes for issue #3925 by @stweil in #3977

New Contributors

  • @p12tic made their first contribution in #3872
  • @arseniy-sonar made their first contribution in #3888
  • @nicholasz2510 made their first contribution in #3935
  • @rettinghaus made their first contribution in #3949
  • @Saibamen made their first contribution in #3950
  • @doraeric made their first contribution in #3951

Full Changelog: 5.2.0…5.3.0

5.3.0-rc1

What’s Changed

  • Fix memory issues in ScrollView::MessageReceiver by @p12tic in #3872
  • autotools: Add rule for svpaint executable by @stweil in #3873
  • Replace call of exit function by return statement in main function by @stweil in #3878
  • Fix the build on CodeQL/Analyze by @arseniy-sonar in #3888
  • CI: Remove Ubuntu 18.04 by @amitdo in #3902
  • configure.ac: fix build on aarch64_be by @ffontaine in #3907
  • SW CI: Add paths filter by @amitdo in #3908
  • Create .mailmap by @amitdo in #3910
  • Fix tesseract.pc from cmake to match autotools by @jeroen in #3930
  • Update README.md by @nicholasz2510 in #3935
  • Fixed 2 errors by @Gitoffthelawn in #3938
  • fix issue #3940 — remove colormap before thresholding by @zdenop in #3942
  • Update upload-artifact action by @rettinghaus in #3949
  • Update checkout action to version 3 by @rettinghaus in #3948
  • Fix Markdownlint by @Saibamen in #3950
  • Fix broken links in CONTRIBUTING.md by @doraeric in #3951
  • pdfrenderer.cpp: Ignore non-text blocks by @amitdo in #3959
  • lstm.train: allow .box from .raw.png too by @bertsky in #3962
  • Fix a number of performance issues (reported by Coverity Scan) by @stweil in #3967
  • Fix training tools for legacy engine (issue #3925) by @stweil in #3970
  • Fix function tesseract::WriteFeature (issue #3925) by @stweil in #3972
  • Modernize function ObjectCache::DeleteUnusedObjects (fix issue with s… by @stweil in #3978
  • More fixes for issue #3925 by @stweil in #3977

New Contributors

  • @p12tic made their first contribution in #3872
  • @arseniy-sonar made their first contribution in #3888
  • @nicholasz2510 made their first contribution in #3935
  • @rettinghaus made their first contribution in #3949
  • @Saibamen made their first contribution in #3950
  • @doraeric made their first contribution in #3951

Full Changelog: 5.2.0…5.3.0-rc1

5.2.0

This is a new minor version of Tesseract 5.

  • Improvements and fixes for continuous integration,
    autoconf and cmake builds.
  • Set /Os for some 32 bit MS compilers (fixes #3769).
  • Improve comments and other documentation.
  • Add initial support for Intel AVX512F.
  • Fix for very large PDF files on 32 bit hosts (fixes #3805).
  • Fix NEON detection on FreeBSD.
  • Fix regression with UZN files (fixes #3837).
  • Fix calling delete[] for memory allocated by malloc in C API.
  • Add an API function to init tesseract with traineddata from memory
    (fixes #3691).
  • Replace direct access to Leptonica internal data structures by
    function calls and support latest releases of Leptonica.
  • Replace std::regex by std::string functions (fixes issue #3830).
  • Use compiled-in TESSDATA_PREFIX also on Windows (fixes #3767).
  • Add new parameter ‘invert_threshold’, change the default threshold
    from 0.5 to 0.7 and mark parameter ‘tessedit_do_invert’ as deprecated.

See also list of all changes.

5.1.0

This is a new minor version of Tesseract 5.

  • Handle image and line regions in output formats ALTO, hOCR and text.
  • New parameter curl_timeout for curl_easy_setop.
  • Build fixes and improvements.
  • Catch nullptr in PageIterator::Orientation to improve robustness.
  • Remove unused code.

See also list of all changes.

5.0.1

This is a bug fix release of Tesseract 5.0.

  • Add SPDX-License-Identifier to public include files.
  • Support redirections when running OCR on a URL.
  • Lots of fixes and improvements for cmake builds.
    Distributions should use the autoconf build.
  • Fix broken msys2 build with gcc 11.
  • Fix parameter certainty_scale (was duplicated).
  • Fix some compiler warnings and clean code.
  • Correctly detect amd64 and i386 on FreeBSD.
  • Add libarchive and libcurl in continuous integration actions.
  • Update submodule googletest to release v1.11.0.

See also list of all changes.

5.0.0

This is the final stable release of Tesseract 5.0.0.

  • Limit BCER to interval [0,1]
  • Improved build process
  • Cleaned code

See also list of all changes.

5.0.0-rc3

This is the third release candidate of Tesseract 5.0.0.

  • Improve training messages
  • Add RowAttributes getter to PageIterator

See also list of all changes.

Tesseract is an open source OCR or optical character recognition engine and command line program. OCR is a technology that allows for the recognition of text characters within a digital image. With the latest version of Tesseract, there is a greater focus on line recognition, however it still supports the legacy Tesseract OCR engine which recognizes character patterns.

Tesseract can recognize over 100 languages out-of-the-box, and can be trained to recognize other languages. It supports various output formats, including plain text, HTML, PDF and more. It also has unicode (UTF-8) support.

Features

  • OCR engine and command line program
  • Line recognition and character pattern recognition
  • Unicode (UTF-8) support
  • Recognizes more than 100 languages, and can be trained to recognize others
  • Supports various output formats

Project Samples

Tesseract 3.02 running on Gnome Terminal 3.8.0 (screenshot by Naga2raja)

License

Apache License V2.0

Simplify applicant selection process with eSkill, a web-based pre-employement screening and skills assessment platform. Icon

eSkill features an extensive modular subject library that enables users to create single or multi-subject based exams for applicants. eSkill allows users to edit existing questions, upload or generate their own test content with the application’s editor. eSkill also provides users with a number of job-based assessments that test employee skills for different positions in different industries.

User Reviews

1. Установка Tesseract-OCR

Сначала загрузите установочный файл Тессеракт-OCR.

Адрес загрузки:(1) https://github.com/tesseract-ocr/tesseract/wiki/Downloads

                    (2) https://digi.bib.uni-mannheim.de/tesseract

Я использовал второй адрес и загрузил установочный файл Tesseract-OCR-Setup-3.05.01.exe.

Вы можете начать устанавливать этот файл.

Есть два очка, чтобы обратить внимание:

(1)При загрузке данных языка, по умолчанию для установки на английском языке, если вы хотите использовать Тессеракт текст ручки текст, вам необходимо проверить дополнительный язык данных. Но рекомендуется не проверять все, потому что мы не используем большинство языков, а установка за проверкой будет потреблять долгое время.

(2)Будьте осторожны, чтобы запомнить ваш путь установки, потому что он должен использоваться, когда переменная среды установлена.

Например, я устанавливаю здесь в папке D: / Tesseract.

2. Изменить переменные среды

2.1 После установки Тессеракта-OCR закончена, необходимо добавить его путь для установки переменной PATH среды системы.

Введите следующий интерфейс с помощью панели управления Панель системной системы Расширенные настройки:

Нажмите на переменную среды:

Выберите путь в системной переменной, нажмите кнопку Изменить, а затем добавить в папку D: \ Тессеракта \ Тессеракта-OCR на пути Путь к пути PATH по newting.

2.2 Добавить переменную Tessdata_prefix

После установки пути мы также будем создавать переменную TESSDATA_PREFIX в системной переменной, а значение переменной является дорожной мощностью D: \ Tesseract \ Tesseract-OCR. Если это не установлено, введите tesseract -list-langs, отобразит любые языковые пакеты, которые не могут быть загружены.

Нажмите кнопку Создать, задайте имя переменной и значение переменной следующим образом:

На данный момент Tesseract-OCR завершен.

3. Проверьте, если Tesseract-OCR успешно установлен

Откройте командную строку, введите tesseract -v, вернется версию Tesseract, которая в настоящее время устанавливается.

Введите Tesseract —list-langs Проверьте языковой пакет

Если все идет хорошо, Tesseract-OCR был успешно установлен и может быть использован.

Скачать Tesseract

Для Windows

Скриншот приложения Tesseract - №1

Проверено на вирусы

?

Проверено на virustotal

Характеристики

Описание

Версии

Обновлено

Совместимость

Windows XP, Windows Vista, Windows 7

Описание

Tesseract — это программа для распознавания текстов. В настоящий момент программа умеет работать с UTF-8, а поддержка различных языков осуществляется с помощью дополнительных модулей.                

Версии

Нет официального представителя разработчика на сайте

Рейтинг

Нажмите, для быстрой оценки

Оставить отзыв

Отзывы

Похожие приложения

Актуальные статьи

  • Tftp клиент для windows 10
  • Test host windows что это такое
  • Tftp windows 10 не работает
  • Test host windows при выключении что это
  • Tftp server скачать для windows 10