Tesseract gibt keine konsistenten Ergebnisse zurück

Problem.Tesseract gibt keine konsistenten Ergebnisse zurück

Ich möchte ein beliebtes Smartphone Spiel screenscrap, um Gold, Elixir und Dark Elixir Werte von einer Instanz des Spiels auf einem Android VM-Image zu erhalten.

testeract markiert jedoch erfolgreich einige Beispiele, weigert sich aber, andere erfolgreich zu markieren. Das Testen derselben Probe mit einer Online-OCR gibt ein positives Ergebnis zurück.

Ich benutze die Standard Englisch ausgebildete Daten und trainiert auch Tesseract für die Supercell-Magic Schriftart Erhöhung der Genauigkeit um rund 30%.

Proben

gold_sample_1

gold_sample_1_processed

magick gold_sample_1.png -fill Black +opaque "#fffbcc" -fill White -opaque "#fffbcc" gold_sample_1_processed.png

Output

40 494

gold_sample_3

gold_sample_3_processed

magick gold_sample_3.png -fill Black +opaque "#ffffff" -fill White -opaque "#ffffff" gold_sample_3_processed.png

Ausgabe

There is nothing in the output file

jedoch das gleiche zu einem online OCR Hochladen gibt mir diese:

Specs.

OS.

Windows 7 x64 SP1

Mein Win7 noch Stealth sich wie die vielen weltweit Ninja-Stil aufgerüstet Eiskreme;)

Tesseract OCR.

tesseract 3.05.00dev 
leptonica-1.73 (Feb 5 2016, 01:13:58) [MSC v.1900 LIB Release x86] 
libgif 5.1.2 : libjpeg 9 : libpng 1.6.19 : libtiff 4.0.2 : zlib 1.2.8 : libwebp 0.3.1.

Bild Magick.

Version: ImageMagick 7.0.2-1 Q8 x86 2016-06-23 http://www.imagemagick.org 
Copyright: Copyright (C) 1999-2015 ImageMagick Studio LLC 
License: http://www.imagemagick.org/script/license.php 
Visual C++: 180040629 
Features: Cipher DPC Modules OpenMP 
Delegates (built-in): bzlib cairo flif freetype jng jp2 jpeg lcms lqr openexr pangocairo png ps rsvg tiff webp xml zlib

Quelle

2016-06-29 silberbaum

Gelöst!

Durch explizite Angabe eines psm-Modus.

tesseract --help-psm 
Page segmentation modes: 
    0 Orientation and script detection (OSD) only. 
    1 Automatic page segmentation with OSD. 
    2 Automatic page segmentation, but no OSD, or OCR. 
    3 Fully automatic page segmentation, but no OSD. (Default) 
    4 Assume a single column of text of variable sizes. 
    5 Assume a single uniform block of vertically aligned text. 
    6 Assume a single uniform block of text. 
    7 Treat the image as a single text line. 
    8 Treat the image as a single word. 
    9 Treat the image as a single word in a circle. 
10 Treat the image as a single character.

Das Bild:

und Befehl:

tesseract gold_sample_3_processed.png sample3 -l eng2 -psm 8

gibt Ausgang:

Danke sowieso Internet Fremden.

Quelle

2016-06-29 17:54:10 silberbaum

Was ist der Unterschied für PSM 7 und 8? – Wesley

Tesseract gibt keine konsistenten Ergebnisse zurück

Antwort

Verwandte Themen