Hi Johannes

This should already work as suggested. EntityLinking does already uses
upper case tokens for lookups (see [1] and also the upper case
configurations of [2] for more details).

In your specific case:

* your processed language configuration should not include ';'. Just a
single line with '*' should be sufficient.
* Do you have the German models for OpenNLP installed? If not it is still
expected to work (by only using upper case tokens), but having German
models available would be good.
* Do you use the Sesame Yard as backend for the 'node' Site, or does you
use a SolrYard?

best
Rupert



[1]
http://stanbol.staging.apache.org/docs/trunk/components/enhancer/engines/entitylinking#token-types
[2]
http://stanbol.staging.apache.org/docs/trunk/components/enhancer/engines/entitylinking#text-processing-configuration



On Mon, Jan 27, 2014 at 3:23 PM, Johannes Goslar <johannes.gos...@dkd.de>wrote:

> Hi everyone,
> is there a way to configure subtext matching, improve word recognition
> where words a not the same language as the main text?
> Concrete example:
> The following entity is in a OpenRDF Sesame database:
> Linking config:
> Chain:
> Linking works great if I input an english text like:
> The Global Toy Conference is a really good thing.
>
> But if I send
> Die Global Toy Conference ist eine gute Sache.
>
> It will report the language as German and will not recognize the entity.
> Is there any configuration way to enable detecting this?
> Maybe one could add a chain component extracting all chained uppercase
> words as label?
>
> Cheers
> Johannes
> --
> Johannes Goslar
>
> dkd Internet Service GmbH
> development // kommunikation // design
> Kaiserstraße 73
> 60329 Frankfurt am Main
>
> Kontakt:
> - email: johannes.gos...@dkd.de
> - fon: +49 69 2475218-0
> - fax: +49 69 2475218-99
> - web: http://www.dkd.de
> - social media: http://social.dkd.de
>
> Aktuelle Projekte:
> - http://j.mp/SehBiS-App – iPhone-App Sehbehinderungssimulator
> - http://www.ellen-wille.de - Launch Website (TYPO3)
> - http://www.vgf-ffm.de - Relaunch Website (TYPO3)
>
> Geschäftsführer: O. Dobberkau, S. Schaffstein, G. Wegenast, C. Zabanski
> Registergericht: Amtsgericht Frankfurt am Main
> Registernummer: HRB 45590
>
>
>
>


-- 
| Rupert Westenthaler             rupert.westentha...@gmail.com
| Bodenlehenstraße 11                             ++43-699-11108907
| A-5500 Bischofshofen

Reply via email to