Hy Martyn,

thank's for your quick responce. With your help and a little bit time i
have made my first own extract module.

I scan documents and archive it to a djvu sandwich which contain the text
informations from ocr in a hidden layer.

In my modul i read out that textinformatons (i use the djvulibre lib) an
store that informations in the tracker db.

regards Matthias



here an example-output from tracker-extract:

matze@matze-Satellite-C660:~/Dokumente/SCAN$ /usr/local/bin/tracker-extract
-v 3 -f test.djvu
Initializing tracker-extract...
Tracker-Message: Setting up monitor for changes to config
file:'/home/matze/.config/tracker/tracker-extract.cfg'
Locale 'TRACKER_LOCALE_LANGUAGE' was set to 'de_DE.UTF-8'
Locale 'TRACKER_LOCALE_TIME' was set to 'de_DE.UTF-8'
Locale 'TRACKER_LOCALE_COLLATE' was set to 'de_DE.UTF-8'
Locale 'TRACKER_LOCALE_NUMERIC' was set to 'de_DE.UTF-8'
Locale 'TRACKER_LOCALE_MONETARY' was set to 'de_DE.UTF-8'
Initializing Storage...
Mount monitors set up for to watch for added, removed and pre-unmounts...
No mounts found to iterate
Setting priority nice level to 19
Loading extractor rules... (/usr/local/share/tracker/extract-rules)
  Loaded rule '10-abw.rule'
  Loaded rule '10-djvu.rule'
  Loaded rule '10-html.rule'
  Loaded rule '10-ico.rule'
  Loaded rule '10-jpeg.rule'
  Loaded rule '10-mp3.rule'
  Loaded rule '10-pdf.rule'
  Loaded rule '10-png.rule'
  Loaded rule '10-ps.rule'
  Loaded rule '10-tiff.rule'
  Loaded rule '90-text-generic.rule'
  Loaded rule '93-mplayer-generic.rule'
  Loaded rule '93-totem-generic.rule'
Extractor rules loaded
Setting memory limitations: total is 18,4 EB, minimum is 256 MB,
recommended is ~1 GB
  Virtual/Heap set to 268,4 MB (50% of total or MAXLONG)
Guessing mime type as '(null)'
Extracting...
  Using /usr/local/lib/tracker-0.14/extract-modules/libextract-djvu.so...
extract_content pagemax:
return dopage
document_release
context_release
minilisp_finish
return from extract_content
Done (3 items)

SPARQL pre-update:
--
--

SPARQL item:
--
 a nfo:PaginatedTextDocument ;
     nie:title "Scan from xsane to /home/matze/Dokumente/SCAN/test.djvu" ;
     nie:plainTextContent "The quick brown fox jumps over maddows by
matze:abc‚L537. Bestellauftraq für Schulklassen „f? Gewünschter Film:
......................................... ..( ........... ..Min.*)
Vorstellung am .................. den ................ ..‚ um ........ ..
Uhr (Di.—Fr.‚ montags sind (Datum) keine Gruppenbuchungen möglich) '
Voraussichtliche Personenanzahl: ........................................
.. Name der Lehrerin/ des Lehrers: .
........................................ .. Name der Schule:
.................................................................. ..
Straße:
..................................................................................
.. PLZ/ Ort:
...............................................................................
.. Tel.: .................................. ..Fax:..... 1:] Über’
kostenlosesfüntetrrichtsmateriai, besondere Angebotevnd Neuigkeiten zum
Thema „Kinofilme für Schulen“ können Sie uns gerne informieren. Unser
Ansprechpartner für Sie
............................................................ ..
Kontaktadresse (Post oder Email)
............................................................... .. Datum:
............... „Unterschrift/ Stempel: ............................... ..
Rückfax an: CinemaxX Solingen 0212 — 38326-21 Wir holen Sie 15 Minuten vor
Vorstellungsbeginn an unserem Haupteingang ab. Bitte zahlen Sie mit
abgezähltem Geld zu Beginn der Veranstaltung. Unsere Preise: Weiterführende
Schulen: 5,90 € pro Person 4 V Grundschulen & KiTas: 4,90 € pro Person 0
Diese Preise gelten für Parkett. Logenaufschlag = 1,50 € p.P. 0g ’ a, Pi?
Üwir möchten in der Loge sitzen ’5\\ FC‘; Pro Buchung nur eine
Platzkategorie möglich Falls erforderlich: Saalreinigung pauschal € 30,00 *
Preise jeweils ggf. zuzüglich 0,60 € Überlängenzuschlag ab 120 Min. Ein
Lehrer bzw. Begleitperson/ Klasse frei Sie erreichen uns täglich ab 9:30
Uhr unter 0212 38326-15. Äx SÜLINGEQ; cf ist?» AI: gjyfiinemaxxvseitrzgenfite
w " .
--

SPARQL where clause:
--
--

SPARQL post-update:
--
--





2013/2/13 Martyn Russell <mar...@lanedo.com>

> On 13/02/13 17:23, Matthias Thon wrote:
>
>> Hello,
>>
>
> Hello Matthias,
>
>
>  i am new in this mailing list, and use tracker on my ubuntu 12.10 machine.
>> I like to create a own extract modul and worked in the example dir.
>>
>
> Cool, something you might want to include in Tracker itself perhaps? :)
>
>
>  After a few modifications on libextract-mockup example i try to use
>> "make" and "make install". But in the .lib dir i only find a *.a file.
>>
>
> Make install should be a NOP, because the target library is in the target:
> noinst_LTLIBRARIES.
>
> If you change the target to modules_LTLIBRARIES a .so is created in .libs/
> and make install should work properly.
>
>
>  What have i do , to create a *.so file. And what steps missing, to test
>> the extract modul in my environment.
>>
>
> For real examples, see src/tracker-extract/
>
> You can also set the tracker-extract modules directory to the current
> directory to test your work without needing to install it.
>
> See man tracker-extract for TRACKER_EXTRACTORS_DIR and
> TRACKER_EXTRACTOR_RULES_DIR environment variables ;)
>
> --
> Regards,
> Martyn
>
> Founder and CEO of Lanedo GmbH.
>
_______________________________________________
tracker-list mailing list
tracker-list@gnome.org
https://mail.gnome.org/mailman/listinfo/tracker-list

Reply via email to