MFTRAINING(1) | MFTRAINING(1) |
NAME¶
mftraining - feature training for Tesseract
SYNOPSIS¶
mftraining -U unicharset -O lang.unicharset FILE...
DESCRIPTION¶
mftraining takes a list of .tr files, from which it generates the files inttemp (the shape prototypes), shapetable, and pffmtable (the number of expected features for each character). (A fourth file called Microfeat is also written by this program, but it is not used.)
OPTIONS¶
-U FILE
-F font_properties_file
*font_name* *italic* *bold* *fixed_pitch* *serif* *fraktur*
-X xheights_file
*font_name* *xheight*
-D dir
-O FILE
SEE ALSO¶
tesseract(1), cntraining(1), unicharset_extractor(1), combine_tessdata(1), shapeclustering(1), unicharset(5)
https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract
COPYING¶
Copyright (C) Hewlett-Packard Company, 1988 Licensed under the Apache License, Version 2.0
AUTHOR¶
The Tesseract OCR engine was written by Ray Smith and his research groups at Hewlett Packard (1985-1995) and Google (2006-present).
11/17/2021 |