MERGE_UNICHARSETS(1)

NAME¶

merge_unicharsets - Simple tool to merge two or more unicharsets.

SYNOPSIS¶

merge_unicharsets unicharset-in-1 ... unicharset-in-n unicharset-out

DESCRIPTION¶

merge_unicharsets(1) is a simple tool to merge two or more unicharsets. It could be used to create a combined unicharset for a script-level engine, like the new Latin or Devanagari.

IN/OUT ARGUMENTS¶

unicharset-in-1

(Input) The name of the first unicharset file to be merged.

unicharset-in-n

(Input) The name of the nth unicharset file to be merged.

unicharset-out

(Output) The name of the merged unicharset file.

HISTORY¶

merge_unicharsets(1) was first made available for tesseract4.00.00alpha.

RESOURCES¶

Main web site: https://github.com/tesseract-ocr Information on training tesseract LSTM: https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00

COPYING¶

AUTHOR¶

The Tesseract OCR engine was written by Ray Smith and his research groups at Hewlett Packard (1985-1995) and Google (2006-present).

11/17/2021

Source file:	merge_unicharsets.1.en.gz (from tesseract 4.1.1-2.el8)
Source last updated:	2021-11-15 15:42:29
Converted to HTML:	2024-06-22 22:16:26