.ds Aq ’
set_unicharset_properties --U input_unicharsetfile --script_dir /path/to/langdata --O output_unicharsetfile
set_unicharset_properties(1) reads a unicharset file, puts the result in a UNICHARSET object, fills it with properties about the unichars it contains and writes the result back to another unicharset file.
--script_dir /path/to/langdata
(Input) Specify the location of directory for universal script unicharsets and font xheights (type:string default:)
--U unicharsetfile
(Input) Specify the location of the unicharset to load as input.
--O unicharsetfile
(Output) Specify the location of the unicharset to be written with updated properties.
set_unicharset_properties(1) was first made available for tesseract version 3.03.
Main web site: m[blue]https://github.com/tesseract-ocrm[] Information on training: m[blue]https://tesseract-ocr.github.io/tessdoc/Training-Tesseract.htmlm[]
Copyright (C) 2012 Google, Inc. Licensed under the Apache License, Version 2.0
The Tesseract OCR engine was written by Ray Smith and his research groups at Hewlett Packard (1985-1995) and Google (2006-present).