I second that. Regex will work better w.r.t. the default trained model of
OpenNLP.
Also, take a look at this extractor- https://code.google.com/p/heideltime/ and
Stanford's tagger- http://nlp.stanford.edu/downloads/sutime.shtml#!

It will be useful to have similar temporal expression enhancement engine in
Stanbol.

Regards,
Anuj


On Thu, Nov 28, 2013 at 11:05 AM, Rupert Westenthaler <
rupert.westentha...@gmail.com> wrote:

> Hi Jayani,
>
> I was not even aware that there exists a Time model for OpenNLP.
> Documentation shows that this uses a purely statistical model so I am
> wondering about the quality. Note also that OpenNLP only provides a
> prebuilt model for English [1].
>
> AFAIK OpenNLP will only provide you with the information that some
> tokens do represent a date. It will not provide you the parsed
> xsd:dateTime. So if you use this Engine you will still need to
> implement this part of your own. So most likely you will end up using
> regex patterns to parse the actual time from the Tokens marked by
> OpenNLP as time.
>
> So I am wondering if it is not better to start with Regex from the
> beginning. If you search for "Regey Date Time extraction" you can
> fined a huge set of example you could start from.
>
> best
> Rupert
>
>
> [1] http://opennlp.sourceforge.net/models-1.5/
>
>
>
> On Thu, Nov 28, 2013 at 5:15 AM, Jayani Withanawasam
> <jayaniwithanawa...@gmail.com> wrote:
> > Hi Dileepa,
> >
> > Thank you so much for your valuble feedback. I'm working on this.
> >
> >
> > On Mon, Nov 25, 2013 at 9:00 PM, Dileepa Jayakody <
> dileepajayak...@gmail.com
> >> wrote:
> >
> >> Hi Jayani,
> >>
> >> There are several enhancement engines in Stanbol developed based on
> >> OpenNLP. (opennlp-ner, opennlp-sentence, opennlp-pos...See [1])  Each of
> >> these engines focus on a particular enhancement aspect using OpenNLP.
> >> Therefore I think it's better to write a new engine for temporal
> >> extractions rather than extending the OpenNLP-NER engine.
> >>
> >> Thanks,
> >> Dileepa
> >>
> >> [1]
> >>
> https://svn.apache.org/repos/asf/stanbol/trunk/enhancement-engines/opennlp
> >>
> >>
> >> On Mon, Nov 25, 2013 at 4:30 PM, Jayani Withanawasam <
> >> jayaniwithanawa...@gmail.com> wrote:
> >>
> >> > Hi,
> >> >
> >> > I'm researching on adding new enhancement engine for extracting date
> and
> >> > time (Temporal extraction) to Stanbol as suggested by Rupert.
> >> >
> >> > There, it is being found that OpenNLP has an entity extraction unit
> for
> >> > date and time.
> >> > Also, I noticed that OpenNLP is already integrated to Stanbol in NER
> >> > engine.
> >> >
> >> > So, as per my understanding, there are two options to extract date and
> >> > time.
> >> >
> >> > One is to have a seperate enhancement engine for date and time
> >> information
> >> > extraction. Another one is to add date time extraction as a code
> >> > enhancement to exisitng OpenNLP NER engine.
> >> >
> >> > What is your opinion on this? Is there any other approach which you
> think
> >> > that would be better?
> >> >
> >> > Thank you
> >> > Jayani
> >> >
> >>
>
>
>
> --
> | Rupert Westenthaler             rupert.westentha...@gmail.com
> | Bodenlehenstraße 11                             ++43-699-11108907
> | A-5500 Bischofshofen
>

Reply via email to