Re: Custom NER

Mohammad Benslimne Mon, 03 Jun 2013 06:33:02 -0700

Thank you Rafa and Rupert for your responses.

I have more queries :
- Spans annotations can be overlaped interlocked?
- The concept-uri can be any custom uri? Or it should define a schema?



Advance thanks
Mohammad



On 3 June 2013 03:10, Rupert Westenthaler <rupert.westentha...@gmail.com>wrote:

> Hi Mohammed,
>
> As Rafa stated, for the "OpenNLP Custom NER Model" you need to train
> your own OpenNLP NER model. If you have done that just copy the model
> to the 'stanbol/datafiles' directory. After that you can configure the
> "OpenNLP Custom NER Model" engine by providing
>
> * a name
> * the name of model file (in the  'stanbol/datafiles' directory)
> * the type mappings ( {ner-type} > {concept-uri}). Where {ner-type} is
> the name of the entities in the training set - the <START:{ner-type}>
> <END> annotations. The {concept-uri} is the URI used as value for the
> dc:type properties added to fise:TextAnnotations
>
> best
> Rupert
>
>
> On Thu, May 30, 2013 at 7:39 PM, Rafa Haro <rh...@zaizi.com> wrote:
> > Hi Mohammad,
> >
> > Maybe your question is more suitable for OpenNLP mail list but I can try
> to
> > help you. First you need to clarify if you want to build a document
> > classifier or an enhancer, because maybe a document classifier doesn't
> > really fit what an enhancement mean in Stanbol.
> >
> > If you want to build your custom "concept" or Named entity recognition
> > engine, you have some different options. Maybwe the easiest one is to
> train
> > your custom OpenNLP NER model and then integrate it in an engine in
> > Stanbol. You can follow OpenNLP documentation for that [1]. You would
> need
> > some custom training data for your problem domain.
> >
> > In the other hand, if you have your own dataset or vocabulary and you
> want
> > to link surface forms or concept mentions in text with such dataset, you
> > should create an EntityHub site for your data an configure a new Entity
> > Linking engine. You can then also follow a quite helpful guide at Stanbol
> > website [2].
> >
> > I hope these two links are useful for your first steps.
> >
> > Cheers
> >
> > [1] -
> >
> http://opennlp.apache.org/documentation/1.5.3/manual/opennlp.html#tools.namefind
> > [2] - https://stanbol.apache.org/docs/trunk/customvocabulary.html
> >
> > El jueves, 30 de mayo de 2013, Mohammad Benslimne escribió:
> >
> >> Hello folks,
> >>
> >> I am developping for my undergraduate project a document
> >> classifier/extractor.
> >> I would like use your tools, espacially the OpenNLP Custom NER Model
> >> extraction engine to define what kind of data to extract.
> >> Can you please fill me examples how to make it woking out?
> >> How can I make my own name Finder models and type mapping?
> >>
> >> Thanks in advance for your precious hints
> >>
> >>
> >> Regards,
> >> Med
> >>
> >
> > --
> >
> > ------------------------------
> > This message should be regarded as confidential. If you have received
> this
> > email in error please notify the sender and destroy it immediately.
> > Statements of intent shall only become binding when confirmed in hard
> copy
> > by an authorised signatory.
> >
> > Zaizi Ltd is registered in England and Wales with the registration number
> > 6440931. The Registered Office is 222 Westbourne Studios, 242 Acklam
> Road,
> > London W10 5JJ, UK.
>
>
>
> --
> | Rupert Westenthaler             rupert.westentha...@gmail.com
> | Bodenlehenstraße 11                             ++43-699-11108907
> | A-5500 Bischofshofen
>



-- 


Mohammad

Re: Custom NER

Reply via email to