Search forum and issue tracker for opencl topic.

Zdenko


st 19. 2. 2020 o 8:27 Tim Finnegan <[email protected]> napísal(a):

> I'm attempting to run GPU Acceleration during training using the OpenCL
> libraries.
>
> I have built tesseract to use openCL, and installed the NVidia Compute
> driver 440 on my Ubuntu 19.10 installation
>
> Whenever I run tesstrain.sh, however, I run into the issue that the
> program refuses to select the proper GPU. Rather than use my
> NVidia GeForce GTX 1060 6GB device, it will select the CPU as my default
> OpenCL device even though it detects my GPU,
> and scores it better in the built-in benchmark.
>
> Setting TESSERACT_OPENCL_DEVICE=1 seems to do nothing as nvidia-smi shows
> that the process is not utilizing my GPU.
>
> Here is my tesstrain.sh output:
>
> === Starting training for language 'eng'
>> [Tue 18 Feb 2020 04:55:13 PM PST] /usr/local/bin/text2image
>> --fonts_dir=/usr/share/fonts --ptsize 12 --font=Chit
>> --outputbase=/tmp/font_tmp.Hk8xAdjwI8/sample_text.txt
>> --text=/tmp/font_tmp.Hk8xAdjwI8/sample_text.txt
>> --fontconfig_tmpdir=/tmp/font_tmp.Hk8xAdjwI8
>> Rendered page 0 to file /tmp/font_tmp.Hk8xAdjwI8/sample_text.txt.tif
>> === Phase I: Generating training images ===
>> Rendering using Chit
>> [Tue 18 Feb 2020 04:55:15 PM PST] /usr/local/bin/text2image
>> --fontconfig_tmpdir=/tmp/font_tmp.Hk8xAdjwI8 --fonts_dir=/usr/share/fonts
>> --strip_unrenderable_words --leading=32 --xsize=3600 --char_spacing=0.0
>> --exposure=1 --outputbase=/tmp/eng-2020-02-18.Gfj/eng.Chit.exp1
>> --max_pages=0 --font=Chit --ptsize 12
>> --text=/home/tim/PycharmProjects/RnD/OCR_Dataset/langdata/eng/eng.training_text
>> Stripped 35 unrenderable words
>> Rendered page 0 to file /tmp/eng-2020-02-18.Gfj/eng.Chit.exp1.tif
>> Stripped 6 unrenderable words
>> Rendered page 1 to file /tmp/eng-2020-02-18.Gfj/eng.Chit.exp1.tif
>> Rendering using Chit
>> [Tue 18 Feb 2020 04:55:17 PM PST] /usr/local/bin/text2image
>> --fontconfig_tmpdir=/tmp/font_tmp.Hk8xAdjwI8 --fonts_dir=/usr/share/fonts
>> --strip_unrenderable_words --leading=32 --xsize=3600 --char_spacing=0.0
>> --exposure=2 --outputbase=/tmp/eng-2020-02-18.Gfj/eng.Chit.exp2
>> --max_pages=0 --font=Chit --ptsize 12
>> --text=/home/tim/PycharmProjects/RnD/OCR_Dataset/langdata/eng/eng.training_text
>> Stripped 35 unrenderable words
>> Rendered page 0 to file /tmp/eng-2020-02-18.Gfj/eng.Chit.exp2.tif
>> Stripped 6 unrenderable words
>> Rendered page 1 to file /tmp/eng-2020-02-18.Gfj/eng.Chit.exp2.tif
>> Rendering using Chit
>> [Tue 18 Feb 2020 04:55:19 PM PST] /usr/local/bin/text2image
>> --fontconfig_tmpdir=/tmp/font_tmp.Hk8xAdjwI8 --fonts_dir=/usr/share/fonts
>> --strip_unrenderable_words --leading=32 --xsize=3600 --char_spacing=0.0
>> --exposure=3 --outputbase=/tmp/eng-2020-02-18.Gfj/eng.Chit.exp3
>> --max_pages=0 --font=Chit --ptsize 12
>> --text=/home/tim/PycharmProjects/RnD/OCR_Dataset/langdata/eng/eng.training_text
>> Stripped 35 unrenderable words
>> Rendered page 0 to file /tmp/eng-2020-02-18.Gfj/eng.Chit.exp3.tif
>> Stripped 6 unrenderable words
>> Rendered page 1 to file /tmp/eng-2020-02-18.Gfj/eng.Chit.exp3.tif
>> Rendering using Chit
>> [Tue 18 Feb 2020 04:55:22 PM PST] /usr/local/bin/text2image
>> --fontconfig_tmpdir=/tmp/font_tmp.Hk8xAdjwI8 --fonts_dir=/usr/share/fonts
>> --strip_unrenderable_words --leading=32 --xsize=3600 --char_spacing=0.0
>> --exposure=4 --outputbase=/tmp/eng-2020-02-18.Gfj/eng.Chit.exp4
>> --max_pages=0 --font=Chit --ptsize 12
>> --text=/home/tim/PycharmProjects/RnD/OCR_Dataset/langdata/eng/eng.training_text
>> Stripped 35 unrenderable words
>> Rendered page 0 to file /tmp/eng-2020-02-18.Gfj/eng.Chit.exp4.tif
>> Stripped 6 unrenderable words
>> Rendered page 1 to file /tmp/eng-2020-02-18.Gfj/eng.Chit.exp4.tif
>> Rendering using Chit
>> [Tue 18 Feb 2020 04:55:24 PM PST] /usr/local/bin/text2image
>> --fontconfig_tmpdir=/tmp/font_tmp.Hk8xAdjwI8 --fonts_dir=/usr/share/fonts
>> --strip_unrenderable_words --leading=32 --xsize=3600 --char_spacing=0.0
>> --exposure=5 --outputbase=/tmp/eng-2020-02-18.Gfj/eng.Chit.exp5
>> --max_pages=0 --font=Chit --ptsize 12
>> --text=/home/tim/PycharmProjects/RnD/OCR_Dataset/langdata/eng/eng.training_text
>> Stripped 35 unrenderable words
>> Rendered page 0 to file /tmp/eng-2020-02-18.Gfj/eng.Chit.exp5.tif
>> Stripped 6 unrenderable words
>> Rendered page 1 to file /tmp/eng-2020-02-18.Gfj/eng.Chit.exp5.tif
>> Rendering using Chit
>> [Tue 18 Feb 2020 04:55:27 PM PST] /usr/local/bin/text2image
>> --fontconfig_tmpdir=/tmp/font_tmp.Hk8xAdjwI8 --fonts_dir=/usr/share/fonts
>> --strip_unrenderable_words --leading=32 --xsize=3600 --char_spacing=0.0
>> --exposure=6 --outputbase=/tmp/eng-2020-02-18.Gfj/eng.Chit.exp6
>> --max_pages=0 --font=Chit --ptsize 12
>> --text=/home/tim/PycharmProjects/RnD/OCR_Dataset/langdata/eng/eng.training_text
>> Stripped 35 unrenderable words
>> Rendered page 0 to file /tmp/eng-2020-02-18.Gfj/eng.Chit.exp6.tif
>> Stripped 6 unrenderable words
>> Rendered page 1 to file /tmp/eng-2020-02-18.Gfj/eng.Chit.exp6.tif
>> === Phase UP: Generating unicharset and unichar properties files ===
>> [Tue 18 Feb 2020 04:55:28 PM PST] /usr/local/bin/unicharset_extractor
>> --output_unicharset /tmp/eng-2020-02-18.Gfj/eng.unicharset --norm_mode 1
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp1.box
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp2.box
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp3.box
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp4.box
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp5.box
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp6.box
>> Extracting unicharset from box file
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp1.box
>> Extracting unicharset from box file
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp2.box
>> Extracting unicharset from box file
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp3.box
>> Extracting unicharset from box file
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp4.box
>> Extracting unicharset from box file
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp5.box
>> Extracting unicharset from box file
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp6.box
>> Other case É of é is not in unicharset
>> Wrote unicharset file /tmp/eng-2020-02-18.Gfj/eng.unicharset
>> [Tue 18 Feb 2020 04:55:29 PM PST]
>> /usr/local/bin/set_unicharset_properties -U
>> /tmp/eng-2020-02-18.Gfj/eng.unicharset -O
>> /tmp/eng-2020-02-18.Gfj/eng.unicharset -X
>> /tmp/eng-2020-02-18.Gfj/eng.xheights
>> --script_dir=/home/tim/PycharmProjects/RnD/OCR_Dataset/langdata
>> Loaded unicharset of size 102 from file
>> /tmp/eng-2020-02-18.Gfj/eng.unicharset
>> Setting unichar properties
>> Other case É of é is not in unicharset
>> Setting script properties
>> Warning: properties incomplete for index 25 = ~
>> Writing unicharset to file /tmp/eng-2020-02-18.Gfj/eng.unicharset
>> === Phase E: Generating lstmf files ===
>> Using
>> TESSDATA_PREFIX=/home/tim/PycharmProjects/RnD/OCR_Dataset/tessdata_best
>> [Tue 18 Feb 2020 04:55:29 PM PST] /usr/local/bin/tesseract
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp1.tif
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp1 --psm 6 lstm.train
>> [Tue 18 Feb 2020 04:55:29 PM PST] /usr/local/bin/tesseract
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp3.tif
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp3 --psm 6 lstm.train
>> [Tue 18 Feb 2020 04:55:29 PM PST] /usr/local/bin/tesseract
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp4.tif
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp4 --psm 6 lstm.train
>> [Tue 18 Feb 2020 04:55:29 PM PST] /usr/local/bin/tesseract
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp2.tif
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp2 --psm 6 lstm.train
>> [Tue 18 Feb 2020 04:55:29 PM PST] /usr/local/bin/tesseract
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp5.tif
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp5 --psm 6 lstm.train
>> [Tue 18 Feb 2020 04:55:29 PM PST] /usr/local/bin/tesseract
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp6.tif
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp6 --psm 6 lstm.train
>> [DS] Profile read from file (tesseract_opencl_profile_devices.dat).
>> [DS] Device[1] 1:GeForce GTX 1060 6GB score is 1.846448
>> [DS] Profile read from file (tesseract_opencl_profile_devices.dat).
>> [DS] Device[2] 0:(null) score is 0.503215
>> [DS] Selected Device[2]: "(null)" (Native)
>> [DS] Overriding Device Selection (TESSERACT_OPENCL_DEVICE=1, 1)
>> [DS] Overridden Device[1]: "GeForce GTX 1060 6GB" (OpenCL)
>> [DS] Device[1] 1:GeForce GTX 1060 6GB score is 1.846448
>> [DS] Device[2] 0:(null) score is 0.503215
>> [DS] Selected Device[2]: "(null)" (Native)
>> [DS] Overriding Device Selection (TESSERACT_OPENCL_DEVICE=1, 1)
>> [DS] Overridden Device[1]: "GeForce GTX 1060 6GB" (OpenCL)
>> [DS] Profile read from file (tesseract_opencl_profile_devices.dat).
>> [DS] Device[1] 1:GeForce GTX 1060 6GB score is 1.846448
>> [DS] Device[2] 0:(null) score is 0.503215
>> [DS] Selected Device[2]: "(null)" (Native)
>> [DS] Overriding Device Selection (TESSERACT_OPENCL_DEVICE=1, 1)
>> [DS] Overridden Device[1]: "GeForce GTX 1060 6GB" (OpenCL)
>> [DS] Profile read from file (tesseract_opencl_profile_devices.dat).
>> [DS] Device[1] 1:GeForce GTX 1060 6GB score is 1.846448
>> [DS] Device[2] 0:(null) score is 0.503215
>> [DS] Selected Device[2]: "(null)" (Native)
>> [DS] Overriding Device Selection (TESSERACT_OPENCL_DEVICE=1, 1)
>> [DS] Overridden Device[1]: "GeForce GTX 1060 6GB" (OpenCL)
>> [DS] Profile read from file (tesseract_opencl_profile_devices.dat).
>> [DS] Device[1] 1:GeForce GTX 1060 6GB score is 1.846448
>> [DS] Device[2] 0:(null) score is 0.503215
>> [DS] Selected Device[2]: "(null)" (Native)
>> [DS] Overriding Device Selection (TESSERACT_OPENCL_DEVICE=1, 1)
>> [DS] Overridden Device[1]: "GeForce GTX 1060 6GB" (OpenCL)
>> [DS] Profile read from file (tesseract_opencl_profile_devices.dat).
>> [DS] Device[1] 1:GeForce GTX 1060 6GB score is 1.846448
>> [DS] Device[2] 0:(null) score is 0.503215
>> [DS] Selected Device[2]: "(null)" (Native)
>> [DS] Overriding Device Selection (TESSERACT_OPENCL_DEVICE=1, 1)
>> [DS] Overridden Device[1]: "GeForce GTX 1060 6GB" (OpenCL)
>> Tesseract Open Source OCR Engine v5.0.0-alpha with Leptonica
>> Tesseract Open Source OCR Engine v5.0.0-alpha with Leptonica
>> Tesseract Open Source OCR Engine v5.0.0-alpha with Leptonica
>> Page 1
>> Page 1
>> Page 1
>> Tesseract Open Source OCR Engine v5.0.0-alpha with Leptonica
>> Page 1
>> Tesseract Open Source OCR Engine v5.0.0-alpha with Leptonica
>> Tesseract Open Source OCR Engine v5.0.0-alpha with Leptonica
>> Page 1
>> Page 1
>> Page 2
>> Loaded 56/56 lines (1-56) of document
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp4.lstmf
>> Page 2
>> Page 2
>> Page 2
>> Loaded 56/56 lines (1-56) of document
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp1.lstmf
>> Loaded 56/56 lines (1-56) of document
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp3.lstmf
>> Loaded 56/56 lines (1-56) of document
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp2.lstmf
>> Page 2
>> Loaded 56/56 lines (1-56) of document
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp5.lstmf
>> Page 2
>> Loaded 56/56 lines (1-56) of document
>> /tmp/eng-2020-02-18.Gfj/eng.Chit.exp6.lstmf
>> === Constructing LSTM training data ===
>> [Tue 18 Feb 2020 04:55:33 PM PST] /usr/local/bin/combine_lang_model
>> --input_unicharset /tmp/eng-2020-02-18.Gfj/eng.unicharset --script_dir
>> /home/tim/PycharmProjects/RnD/OCR_Dataset/langdata --words
>> /home/tim/PycharmProjects/RnD/OCR_Dataset/langdata/eng/eng.wordlist
>> --numbers
>> /home/tim/PycharmProjects/RnD/OCR_Dataset/langdata/eng/eng.numbers --puncs
>> /home/tim/PycharmProjects/RnD/OCR_Dataset/langdata/eng/eng.punc
>> --output_dir /home/tim/PycharmProjects/RnD/OCR_Dataset/DATA --lang eng
>> Loaded unicharset of size 102 from file
>> /tmp/eng-2020-02-18.Gfj/eng.unicharset
>> Setting unichar properties
>> Other case É of é is not in unicharset
>> Setting script properties
>> Config file is optional, continuing...
>> Failed to read data from:
>> /home/tim/PycharmProjects/RnD/OCR_Dataset/langdata/eng/eng.config
>> Null char=2
>> Reducing Trie to SquishedDawg
>> Reducing Trie to SquishedDawg
>> Reducing Trie to SquishedDawg
>> === Saving box/tiff pairs for training data ===
>> Moving /tmp/eng-2020-02-18.Gfj/eng.Chit.exp1.box to
>> /home/tim/PycharmProjects/RnD/OCR_Dataset/DATA
>> Moving /tmp/eng-2020-02-18.Gfj/eng.Chit.exp2.box to
>> /home/tim/PycharmProjects/RnD/OCR_Dataset/DATA
>> Moving /tmp/eng-2020-02-18.Gfj/eng.Chit.exp3.box to
>> /home/tim/PycharmProjects/RnD/OCR_Dataset/DATA
>> Moving /tmp/eng-2020-02-18.Gfj/eng.Chit.exp4.box to
>> /home/tim/PycharmProjects/RnD/OCR_Dataset/DATA
>> Moving /tmp/eng-2020-02-18.Gfj/eng.Chit.exp5.box to
>> /home/tim/PycharmProjects/RnD/OCR_Dataset/DATA
>> Moving /tmp/eng-2020-02-18.Gfj/eng.Chit.exp6.box to
>> /home/tim/PycharmProjects/RnD/OCR_Dataset/DATA
>> Moving /tmp/eng-2020-02-18.Gfj/eng.Chit.exp1.tif to
>> /home/tim/PycharmProjects/RnD/OCR_Dataset/DATA
>> Moving /tmp/eng-2020-02-18.Gfj/eng.Chit.exp2.tif to
>> /home/tim/PycharmProjects/RnD/OCR_Dataset/DATA
>> Moving /tmp/eng-2020-02-18.Gfj/eng.Chit.exp3.tif to
>> /home/tim/PycharmProjects/RnD/OCR_Dataset/DATA
>> Moving /tmp/eng-2020-02-18.Gfj/eng.Chit.exp4.tif to
>> /home/tim/PycharmProjects/RnD/OCR_Dataset/DATA
>> Moving /tmp/eng-2020-02-18.Gfj/eng.Chit.exp5.tif to
>> /home/tim/PycharmProjects/RnD/OCR_Dataset/DATA
>> Moving /tmp/eng-2020-02-18.Gfj/eng.Chit.exp6.tif to
>> /home/tim/PycharmProjects/RnD/OCR_Dataset/DATA
>> === Moving lstmf files for training data ===
>> Moving /tmp/eng-2020-02-18.Gfj/eng.Chit.exp1.lstmf to
>> /home/tim/PycharmProjects/RnD/OCR_Dataset/DATA
>> Moving /tmp/eng-2020-02-18.Gfj/eng.Chit.exp2.lstmf to
>> /home/tim/PycharmProjects/RnD/OCR_Dataset/DATA
>> Moving /tmp/eng-2020-02-18.Gfj/eng.Chit.exp3.lstmf to
>> /home/tim/PycharmProjects/RnD/OCR_Dataset/DATA
>> Moving /tmp/eng-2020-02-18.Gfj/eng.Chit.exp4.lstmf to
>> /home/tim/PycharmProjects/RnD/OCR_Dataset/DATA
>> Moving /tmp/eng-2020-02-18.Gfj/eng.Chit.exp5.lstmf to
>> /home/tim/PycharmProjects/RnD/OCR_Dataset/DATA
>> Moving /tmp/eng-2020-02-18.Gfj/eng.Chit.exp6.lstmf to
>> /home/tim/PycharmProjects/RnD/OCR_Dataset/DATA
>
>
> Created starter traineddata for LSTM training of language 'eng'
>
>
> Run 'lstmtraining' command to continue LSTM training for language 'eng'
>
>
> And here is my nvidia-smi output during the training process:
>
> Tue Feb 18 17:01:50 2020
>>
>> +-----------------------------------------------------------------------------+
>> | NVIDIA-SMI 440.33.01    Driver Version: 440.33.01    CUDA Version:
>> 10.2     |
>>
>> |-------------------------------+----------------------+----------------------+
>> | GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr.
>> ECC |
>> | Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util
>> Compute M. |
>>
>> |===============================+======================+======================|
>> |   0  GeForce GTX 106...  On   | 00000000:01:00.0  On |
>> N/A |
>> | 22%   56C    P0    23W / 120W |    402MiB /  6072MiB |      0%
>> Default |
>>
>> +-------------------------------+----------------------+----------------------+
>>
>>
>>
>> +-----------------------------------------------------------------------------+
>> | Processes:                                                       GPU
>> Memory |
>> |  GPU       PID   Type   Process name                             Usage
>>     |
>>
>> |=============================================================================|
>> |    0      1254      G   /usr/lib/xorg/Xorg
>> 32MiB |
>> |    0      2003      G   /usr/lib/xorg/Xorg
>>  155MiB |
>> |    0      2223      G   /usr/bin/gnome-shell
>> 97MiB |
>> |    0      2724      G   ...p/pycharm-professional/183/jbr/bin/java
>>  2MiB |
>> |    0      4932      G   /usr/bin/nvidia-settings
>>  0MiB |
>> |    0      5071      G   ...AAAAAAAAAAAAAAgAAAAAAAAA --shared-files
>> 62MiB |
>>
>> +-----------------------------------------------------------------------------+
>>
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/984f04a4-c5da-495c-8231-83c15e51fd77%40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/984f04a4-c5da-495c-8231-83c15e51fd77%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8zAnvW83XrLLijsTMCGz1nEZEaCZXxhGzs7E5eajc0h2g%40mail.gmail.com.

Reply via email to