Hi, I am using stanbol to extract entitiies by plugging custom vocabulary as per http://stanbol.apache.org/docs/trunk/customvocabulary.html
Following are the steps followed - Configured Clerezza Yard. Configured Managed Yard site. Updated the site by plugging ontology(containing custom entities) . Configured Entity hub linking Engine(*customLinkingEngine*) with managed site. Configured a customChain which uses following engine - *langdetect* - *opennlp-sentence* - *opennlp-token* - *opennlp-pos* - *opennlp-chunker* - *customLinkingEngine* Now, i am able to extract entities like Adidas using *customChain*. However i am facing an issue in extracting entities which has space in between. For example "Tommy Hilfiger". Chain like *dbpedia-disambiguation *(which comes bundeled with stanbol instance) is rightly extracting entities like "Tommy Hilfiger". I had tried configuring *customLinkingEngine* same as * dbpedia-disamb-linking *(configured in *dbpedia-disambiguation* ) but it didn't work to extract above entity. I have invested more than a week now and running out of options now i request you to please provide help in resolving this issue -- Regards, Keval Sethi -- "This e-mail and any attachments transmitted with it are for the sole use of the intended recipient(s) and may contain confidential , proprietary or privileged information. If you are not the intended recipient, please contact the sender by reply e-mail and destroy all copies of the original message. Any unauthorized review, use, disclosure, dissemination, forwarding, printing or copying of this e-mail or any action taken in reliance on this e-mail is strictly prohibited and may be unlawful."