I second that. Regex will work better w.r.t. the default trained model of OpenNLP. Also, take a look at this extractor- https://code.google.com/p/heideltime/ and Stanford's tagger- http://nlp.stanford.edu/downloads/sutime.shtml#!
It will be useful to have similar temporal expression enhancement engine in Stanbol. Regards, Anuj On Thu, Nov 28, 2013 at 11:05 AM, Rupert Westenthaler < rupert.westentha...@gmail.com> wrote: > Hi Jayani, > > I was not even aware that there exists a Time model for OpenNLP. > Documentation shows that this uses a purely statistical model so I am > wondering about the quality. Note also that OpenNLP only provides a > prebuilt model for English [1]. > > AFAIK OpenNLP will only provide you with the information that some > tokens do represent a date. It will not provide you the parsed > xsd:dateTime. So if you use this Engine you will still need to > implement this part of your own. So most likely you will end up using > regex patterns to parse the actual time from the Tokens marked by > OpenNLP as time. > > So I am wondering if it is not better to start with Regex from the > beginning. If you search for "Regey Date Time extraction" you can > fined a huge set of example you could start from. > > best > Rupert > > > [1] http://opennlp.sourceforge.net/models-1.5/ > > > > On Thu, Nov 28, 2013 at 5:15 AM, Jayani Withanawasam > <jayaniwithanawa...@gmail.com> wrote: > > Hi Dileepa, > > > > Thank you so much for your valuble feedback. I'm working on this. > > > > > > On Mon, Nov 25, 2013 at 9:00 PM, Dileepa Jayakody < > dileepajayak...@gmail.com > >> wrote: > > > >> Hi Jayani, > >> > >> There are several enhancement engines in Stanbol developed based on > >> OpenNLP. (opennlp-ner, opennlp-sentence, opennlp-pos...See [1]) Each of > >> these engines focus on a particular enhancement aspect using OpenNLP. > >> Therefore I think it's better to write a new engine for temporal > >> extractions rather than extending the OpenNLP-NER engine. > >> > >> Thanks, > >> Dileepa > >> > >> [1] > >> > https://svn.apache.org/repos/asf/stanbol/trunk/enhancement-engines/opennlp > >> > >> > >> On Mon, Nov 25, 2013 at 4:30 PM, Jayani Withanawasam < > >> jayaniwithanawa...@gmail.com> wrote: > >> > >> > Hi, > >> > > >> > I'm researching on adding new enhancement engine for extracting date > and > >> > time (Temporal extraction) to Stanbol as suggested by Rupert. > >> > > >> > There, it is being found that OpenNLP has an entity extraction unit > for > >> > date and time. > >> > Also, I noticed that OpenNLP is already integrated to Stanbol in NER > >> > engine. > >> > > >> > So, as per my understanding, there are two options to extract date and > >> > time. > >> > > >> > One is to have a seperate enhancement engine for date and time > >> information > >> > extraction. Another one is to add date time extraction as a code > >> > enhancement to exisitng OpenNLP NER engine. > >> > > >> > What is your opinion on this? Is there any other approach which you > think > >> > that would be better? > >> > > >> > Thank you > >> > Jayani > >> > > >> > > > > -- > | Rupert Westenthaler rupert.westentha...@gmail.com > | Bodenlehenstraße 11 ++43-699-11108907 > | A-5500 Bischofshofen >