First look down all the words with capital S and decide if they should be the letter f instead and search and replace them.To deal with these I recommend the following:. So you will find perSon and likewiSe there are also instances where try as a might (see below) Abbyy finereader puts a capital S where I have trained it to recognise the letter f so you may find Srom instead of from, and aSter instead of after. But it is a simple task to search and replace my and with æ and œ afterwards.įor some unknown reason my Abbyy Frinereader 12 puts a capital S in place of a small s in the OCR output for some of the recognised long s characters. (I figured if the default character set needs changing somehow, then others, certainly those with FR12 would have the same issues). I could not get my Abbyy Finereader to accept æ and œ characters into its trained character set, so instead you will see and in the OCR output. I still recommend running the resultant text through Guiprep I use Infraview batch convert to upscale images to 600dpi if required, note there is a box within the Infraview options to set the DPI to a particular value. If you don't Abbyy Finereader will give the following warning "Selected user pattern has been training at a different image resolution and may not work for this image" Here are some typical examples of the font style, and the OCR output produced by my trained version of Finereader, note there are still problems which I discuss below, and how to possibly deal with them. Printed Salem (1799) and The British Libriran Printed London (1738) They being A Treatise On The Plague And Yellow Fever, By James Tytler. I trained Finereaderr on two texts, one from the USA the other from England. The under the "Training" heading select "Use built-in and users patterns", note keep the "Read with training" choice unticked, unless you want to further add to the training pattern.Įxamples of characters trained in Abbyy Finereader from the Caslon FontĮxamples from two texts showing the OCR output To check it is active click on the "Pattern Editor." button and it should show "long_s(1730-1820)_v1(active)" v1 for version 1, as I may update some time in the future. Under the "User patterns and languages" heading click the "Load from File." button, and select my training file (.FBT) extension. (Dropbox link), in Abbyy Finereader on the menu bar choose TOOLS then Options. Once you have downloaded my Training File. How to configure Abbyy Finereader for my user pattern I use Abbyy Finereader 12 pro, I am assuming users with later versions will still be able to use my training file otherwise the whole exersise is a bit limited, as only useful for those with FR12 (this needs to be checked out with other users). I have been through two old texts one (1738) printed in London and the other (1799) printed in the U.S.A printed text, content providers are more than welcome to try my Training File. FBT file extension), this can then be loaded and used again. Once the 'training' has been done, the resultant file can be saved (it has a. While the program Guiprep has specific option to convert texts with the long s character following OCR, it is also possible to run Abbyy Finereader in 'training mode' to attempt to recognise the various long s characters and replace them instead with the letter 's' in the OCR output. 7 Some comments on strange indeterminate problems I had with training Abbyy Finereader.6 Things to do after you have run the OCR.5 Make sure your scans for OCR are 600 dpi.4 Examples from two texts showing the OCR output.3 Examples of characters trained in Abbyy Finereader from the Caslon Font.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |