tesseract build script checks /proc/cpuinfo file, but this file is
not exist or darwin. This check is used for getting optimized build
flags depends on CPU. This feature does not work in cross-compile
case and should be explicitly disabled.
This patch disables CPU detection using cmake option
BUILD_TRAINING_TOOLS=OFF.
Generated binaries (aarch64 target) are exactly the same on
Ubuntu 20.04 x86_64 and MacOS 11.6 arm64. Binaries generated on
Linux build host are not changed (the same checksums before and
after applying this patch).
Signed-off-by: Sergey V. Lobanov <sergey@lobanov.in>
Switch to AUTORELEASE for simplicity.
Add libarchive dependency since it's now used.
Build with ninja for faster compilation.
Refresh SIMD CMake patch and add musl patch.
Signed-off-by: Rosen Penev <rosenp@gmail.com>
Tesseract is an open source text recognizer (OCR) Engine, available under the Apache 2.0 license. It can be used directly, or (for programmers) using an API to extract printed text from images. It supports a wide variety of languages.
Signed-off-by: Valentín Kivachuk <vk18496@gmail.com>