Hi Johannes This should already work as suggested. EntityLinking does already uses upper case tokens for lookups (see [1] and also the upper case configurations of [2] for more details).
In your specific case: * your processed language configuration should not include ';'. Just a single line with '*' should be sufficient. * Do you have the German models for OpenNLP installed? If not it is still expected to work (by only using upper case tokens), but having German models available would be good. * Do you use the Sesame Yard as backend for the 'node' Site, or does you use a SolrYard? best Rupert [1] http://stanbol.staging.apache.org/docs/trunk/components/enhancer/engines/entitylinking#token-types [2] http://stanbol.staging.apache.org/docs/trunk/components/enhancer/engines/entitylinking#text-processing-configuration On Mon, Jan 27, 2014 at 3:23 PM, Johannes Goslar <johannes.gos...@dkd.de>wrote: > Hi everyone, > is there a way to configure subtext matching, improve word recognition > where words a not the same language as the main text? > Concrete example: > The following entity is in a OpenRDF Sesame database: > Linking config: > Chain: > Linking works great if I input an english text like: > The Global Toy Conference is a really good thing. > > But if I send > Die Global Toy Conference ist eine gute Sache. > > It will report the language as German and will not recognize the entity. > Is there any configuration way to enable detecting this? > Maybe one could add a chain component extracting all chained uppercase > words as label? > > Cheers > Johannes > -- > Johannes Goslar > > dkd Internet Service GmbH > development // kommunikation // design > Kaiserstraße 73 > 60329 Frankfurt am Main > > Kontakt: > - email: johannes.gos...@dkd.de > - fon: +49 69 2475218-0 > - fax: +49 69 2475218-99 > - web: http://www.dkd.de > - social media: http://social.dkd.de > > Aktuelle Projekte: > - http://j.mp/SehBiS-App – iPhone-App Sehbehinderungssimulator > - http://www.ellen-wille.de - Launch Website (TYPO3) > - http://www.vgf-ffm.de - Relaunch Website (TYPO3) > > Geschäftsführer: O. Dobberkau, S. Schaffstein, G. Wegenast, C. Zabanski > Registergericht: Amtsgericht Frankfurt am Main > Registernummer: HRB 45590 > > > > -- | Rupert Westenthaler rupert.westentha...@gmail.com | Bodenlehenstraße 11 ++43-699-11108907 | A-5500 Bischofshofen