Hi Mohammed, As Rafa stated, for the "OpenNLP Custom NER Model" you need to train your own OpenNLP NER model. If you have done that just copy the model to the 'stanbol/datafiles' directory. After that you can configure the "OpenNLP Custom NER Model" engine by providing
* a name * the name of model file (in the 'stanbol/datafiles' directory) * the type mappings ( {ner-type} > {concept-uri}). Where {ner-type} is the name of the entities in the training set - the <START:{ner-type}> <END> annotations. The {concept-uri} is the URI used as value for the dc:type properties added to fise:TextAnnotations best Rupert On Thu, May 30, 2013 at 7:39 PM, Rafa Haro <rh...@zaizi.com> wrote: > Hi Mohammad, > > Maybe your question is more suitable for OpenNLP mail list but I can try to > help you. First you need to clarify if you want to build a document > classifier or an enhancer, because maybe a document classifier doesn't > really fit what an enhancement mean in Stanbol. > > If you want to build your custom "concept" or Named entity recognition > engine, you have some different options. Maybwe the easiest one is to train > your custom OpenNLP NER model and then integrate it in an engine in > Stanbol. You can follow OpenNLP documentation for that [1]. You would need > some custom training data for your problem domain. > > In the other hand, if you have your own dataset or vocabulary and you want > to link surface forms or concept mentions in text with such dataset, you > should create an EntityHub site for your data an configure a new Entity > Linking engine. You can then also follow a quite helpful guide at Stanbol > website [2]. > > I hope these two links are useful for your first steps. > > Cheers > > [1] - > http://opennlp.apache.org/documentation/1.5.3/manual/opennlp.html#tools.namefind > [2] - https://stanbol.apache.org/docs/trunk/customvocabulary.html > > El jueves, 30 de mayo de 2013, Mohammad Benslimne escribió: > >> Hello folks, >> >> I am developping for my undergraduate project a document >> classifier/extractor. >> I would like use your tools, espacially the OpenNLP Custom NER Model >> extraction engine to define what kind of data to extract. >> Can you please fill me examples how to make it woking out? >> How can I make my own name Finder models and type mapping? >> >> Thanks in advance for your precious hints >> >> >> Regards, >> Med >> > > -- > > ------------------------------ > This message should be regarded as confidential. If you have received this > email in error please notify the sender and destroy it immediately. > Statements of intent shall only become binding when confirmed in hard copy > by an authorised signatory. > > Zaizi Ltd is registered in England and Wales with the registration number > 6440931. The Registered Office is 222 Westbourne Studios, 242 Acklam Road, > London W10 5JJ, UK. -- | Rupert Westenthaler rupert.westentha...@gmail.com | Bodenlehenstraße 11 ++43-699-11108907 | A-5500 Bischofshofen