Thank you Rupert and Anuj for your suggestions. I'm going through the links
you have provided.

Rupert,

What did you mean by base-line engine that is directly integrated in
Stanbol with Regex based approach?

Appreciate if you can further elaborate this.


On Fri, Nov 29, 2013 at 11:35 AM, Rupert Westenthaler <
rupert.westentha...@gmail.com> wrote:

> Hi Anuj
>
> On Thu, Nov 28, 2013 at 1:51 PM, Anuj Kumar <anujs...@gmail.com> wrote:
> > I second that. Regex will work better w.r.t. the default trained model of
> > OpenNLP.
>
> Both such projects do look interesting:
>
> > Also, take a look at this extractor-
> https://code.google.com/p/heideltime/ and
>
> As this is GPLv3 you can not directly use it to implement an
> EnhancementEngine that is part of the Stanbol Codebase. Integrating it
> via a RESTful service would be an option.
>
> > Stanford's tagger- http://nlp.stanford.edu/downloads/sutime.shtml#!
>
> The same is true for SuTime as all Stanford NLP components are under GPL.
>
> If we want to integrate those projects I suggest to extend the Stanbol
> RESTful NLP protocol [1] and service [2] so that it can represent
> date/time points and ranges. SuTime support could be added to the
> already existing Stanbol-Stanford integration [3]. For HeidelTime one
> would need to implement a similar component.
>
>
> But before integrating those I would prefer to have a base-line engine
> that is directly integrated in Stanbol. Looks like a Regex based
> approach could be sufficient for that. WDYT Jayani?
>
> best
> Rupert
>
> [1] https://issues.apache.org/jira/browse/STANBOL-878
> [2] https://issues.apache.org/jira/browse/STANBOL-892
> [3] https://github.com/westei/stanbol-stanfordnlp
>
> >
> > It will be useful to have similar temporal expression enhancement engine
> in
> > Stanbol.
> >
> > Regards,
> > Anuj
> >
> >
> > On Thu, Nov 28, 2013 at 11:05 AM, Rupert Westenthaler <
> > rupert.westentha...@gmail.com> wrote:
> >
> >> Hi Jayani,
> >>
> >> I was not even aware that there exists a Time model for OpenNLP.
> >> Documentation shows that this uses a purely statistical model so I am
> >> wondering about the quality. Note also that OpenNLP only provides a
> >> prebuilt model for English [1].
> >>
> >> AFAIK OpenNLP will only provide you with the information that some
> >> tokens do represent a date. It will not provide you the parsed
> >> xsd:dateTime. So if you use this Engine you will still need to
> >> implement this part of your own. So most likely you will end up using
> >> regex patterns to parse the actual time from the Tokens marked by
> >> OpenNLP as time.
> >>
> >> So I am wondering if it is not better to start with Regex from the
> >> beginning. If you search for "Regey Date Time extraction" you can
> >> fined a huge set of example you could start from.
> >>
> >> best
> >> Rupert
> >>
> >>
> >> [1] http://opennlp.sourceforge.net/models-1.5/
> >>
> >>
> >>
> >> On Thu, Nov 28, 2013 at 5:15 AM, Jayani Withanawasam
> >> <jayaniwithanawa...@gmail.com> wrote:
> >> > Hi Dileepa,
> >> >
> >> > Thank you so much for your valuble feedback. I'm working on this.
> >> >
> >> >
> >> > On Mon, Nov 25, 2013 at 9:00 PM, Dileepa Jayakody <
> >> dileepajayak...@gmail.com
> >> >> wrote:
> >> >
> >> >> Hi Jayani,
> >> >>
> >> >> There are several enhancement engines in Stanbol developed based on
> >> >> OpenNLP. (opennlp-ner, opennlp-sentence, opennlp-pos...See [1])
>  Each of
> >> >> these engines focus on a particular enhancement aspect using OpenNLP.
> >> >> Therefore I think it's better to write a new engine for temporal
> >> >> extractions rather than extending the OpenNLP-NER engine.
> >> >>
> >> >> Thanks,
> >> >> Dileepa
> >> >>
> >> >> [1]
> >> >>
> >>
> https://svn.apache.org/repos/asf/stanbol/trunk/enhancement-engines/opennlp
> >> >>
> >> >>
> >> >> On Mon, Nov 25, 2013 at 4:30 PM, Jayani Withanawasam <
> >> >> jayaniwithanawa...@gmail.com> wrote:
> >> >>
> >> >> > Hi,
> >> >> >
> >> >> > I'm researching on adding new enhancement engine for extracting
> date
> >> and
> >> >> > time (Temporal extraction) to Stanbol as suggested by Rupert.
> >> >> >
> >> >> > There, it is being found that OpenNLP has an entity extraction unit
> >> for
> >> >> > date and time.
> >> >> > Also, I noticed that OpenNLP is already integrated to Stanbol in
> NER
> >> >> > engine.
> >> >> >
> >> >> > So, as per my understanding, there are two options to extract date
> and
> >> >> > time.
> >> >> >
> >> >> > One is to have a seperate enhancement engine for date and time
> >> >> information
> >> >> > extraction. Another one is to add date time extraction as a code
> >> >> > enhancement to exisitng OpenNLP NER engine.
> >> >> >
> >> >> > What is your opinion on this? Is there any other approach which you
> >> think
> >> >> > that would be better?
> >> >> >
> >> >> > Thank you
> >> >> > Jayani
> >> >> >
> >> >>
> >>
> >>
> >>
> >> --
> >> | Rupert Westenthaler             rupert.westentha...@gmail.com
> >> | Bodenlehenstraße 11                             ++43-699-11108907
> >> | A-5500 Bischofshofen
> >>
>
>
>
> --
> | Rupert Westenthaler             rupert.westentha...@gmail.com
> | Bodenlehenstraße 11                             ++43-699-11108907
> | A-5500 Bischofshofen
>

Reply via email to