Re: How to get SOLR document metadata in UIMA using SOLR6.3

2016-12-24 Thread Tommaso Teofili
The UIMAUpdateRequestProcessor passes the text from all the SolrInputDocument fields configured in solrconfig.xml into the UIMA's JCas text, so in your use case you will see both the fields 'text' and 'title' passed as text, either as a merged single text or as two separate text fragments to be

Re: next UIMA workshop?

2013-10-21 Thread Tommaso Teofili
good idea, I think it'd be really nice. Tommaso 2013/10/21 Jens Grivolla j+...@grivolla.net Hi, at GSCL 2013 we talked a bit about options for the next UIMA workshop. How about trying to have it at COLING 2014? WORKSHOP TIMELINE • 19th January 2014: Workshop proposals due

Re: Increase heap size using UIMA with Solr

2013-08-27 Thread Tommaso Teofili
Hi Jun, 2013/8/26 Jun Ohtani joht...@gmail.com Dear all, My co-worker use UIMA with Solr. We have two problems increasing heap size and cpu usage. We have two question. 1. o.a.u.jcas.impl.JCasImpl.loadJCasClasses() call every requests. Using UIMA with Solr, we send to request using

Re: Clerezza and UIMA

2013-02-11 Thread Tommaso Teofili
H Greg, the code you're referring to [1] is used essentially for OSGi deployments, where each bundle (the package jars) have their own Classloader and essentially that would create issues when trying to instantiate AEs whose classes are in different bundles than the one which contains the UIMA

Re: using UIMA add-ons.

2013-02-04 Thread Tommaso Teofili
Hi Brian, 2013/2/1 Brian Dolan buddha_...@yahoo.com Hello Group, I am looking to fold several of the UIMA-Addons into an application I'm building. Since not all of them are set up for PEAR packaging, is there a robust alternative method? what do you mean here exactly? In particular,

Re: Solr 4 and UIMA

2012-11-30 Thread Tommaso Teofili
Which integration are you talking about? The one in Solr (the UpdateRequestProcessor that extracts metadata with UIMA to populate some fields and then index the received document in Solr) [1] or the one in UIMA Solrcas (the CASConsumer which takes some of the existing annotations in the received

Re: Building Source Code

2012-08-03 Thread Tommaso Teofili
sure, the strange failure I get with Java7 is related to the compile phase, I'll try to figure out if we can safely fix that without breaking backward compatibility. Cheers, Tommaso 2012/8/2 Burn Lewis burnle...@gmail.com The first failure (file size changed) is not important ... it's testing

Re: Building Source Code

2012-08-02 Thread Tommaso Teofili
Hi Alan, this is the correct way of building, can you describe further your environment (OS, maven version, etc.) and which tests are failing? Regards, Tommaso 2012/8/2 Sergeant, Alan alan.serge...@sap.com Hi, I am trying to build the source code. I have checked out the trunk and issued

Re: Building Source Code

2012-08-02 Thread Tommaso Teofili
(org.apache.uima.analysis_engine.impl.AnalysisEngine_implTest): File size changed by 737 should be no more than 2 Tests in error: testSofaDataStream(org.apache.uima.cas.test.JcasSofaTest): Error trying to open a stream to Sofa data. Tests run: 434, Failures: 1, Errors: 1, Skipped: 0 Alan -Original Message- From: Tommaso Teofili

Re: Building Source Code

2012-08-02 Thread Tommaso Teofili
Uhm, I get some errors as well with Java7, let me see if I can create a Jenkins job for that so that we can track it. Tommaso 2012/8/2 Tommaso Teofili tommaso.teof...@gmail.com Thanks for reporting Alan, I'll have a look and let you know. Regards, Tommaso 2012/8/2 Sergeant, Alan alan.serge

Re: [ANN] UIMA/Drools bridge: use Drools to write NLP gramras

2012-06-11 Thread Tommaso Teofili
Nice thing Roberto, thanks for letting us know! Have a nice day, Tommaso 2012/6/8 Roberto Franchini franch...@celi.it Hi, we have released on github a little bridge that allow to write NLP grammars using Drools rule engine (http://www.jboss.org/drools)

Re: Clustering, Collapsing

2012-06-11 Thread Tommaso Teofili
Hi Deejay, 2012/6/8 Deejay dee...@binarytweed.com Hi all, I recently discovered Apache UIMA, and it looks like a very large project! I was hoping that someone more experienced with it than I could comment on whether there are parts of the project that could help with my problem. I need

Re: Maven UIMA and import by name

2012-05-11 Thread Tommaso Teofili
Hi Erik, I mostly edit XMLs by hand (also because I use IntelliJ IDEA) but I think you may tell Maven that desc is a resource folder: build ... resources resource directorysrc/main/resources/directory directorydesc/directory /resource ... /build HTH, Tommaso

Hackaton mentioning UIMA at BerlinBuzzwords

2011-05-13 Thread Tommaso Teofili
Hi all, I've just seen this : http://berlinbuzzwords.de/wiki/semantic-hackathon May be interesting to attend (I'll try to make it remotely). Tommaso

Re: Sandbox help - every descriptor throws ResourceInitializationException from the CAS Visual Debugger

2011-04-29 Thread Tommaso Teofili
Hi Britt, what I think you miss is adding the sandbox jars to the CAS Visual Debugger launcher classpath. From Eclipse open Run- Run Configurations - go to the CAS Visual Debugger launcher you're using. You'll probably find uimaj-examples under the project selector, you just may change this to

Possible bug with DictionaryAnnotator and escaped characters

2011-04-19 Thread Tommaso Teofili
Hi all, I've just noticed an unexpected behavior in DictionaryAnnotator: if you create a dictionary with the DictionaryCreator and your input file (text file with one entry per line) contains characters like or ' then they get converted to their escaped version amp; or apos; as it's right in XML

Re: Solr now supports UIMA

2011-04-05 Thread Tommaso Teofili
2011/4/5 Jörn Kottmann kottm...@gmail.com If the analysis pipeline should be scaled with UIMA AS the Solrcas AE can be placed at the end of the pipeline to send the results to a Solr instance. Right Jörn, Solrcas [1] is a UIMA CAS Consumer and it can be used just like any other UIMA

Re: Solr now supports UIMA

2011-04-04 Thread Tommaso Teofili
Hi all, as you may see reading the wiki page linked by Jörn that integration enables calling a UIMA pipeline from inside a Solr instance. This is done via a dedicated component one can add to the chain of UpdateRequestProcessors that are responsible of processing documents when they come to Solr

Re: JCasGen and Maven

2011-03-02 Thread Tommaso Teofili
Hi Greg, I remember that being done with the maven-exec-plugin, you may have a look at [1] Hope this helps, Tommaso [1] : http://markmail.org/thread/wc6riilnheh7v2cy 2011/3/2 Greg Holmberg holmberg2...@comcast.net Hi UIMA users-- Does anyone know how to get maven to run jcasgen as part of a

Re: Watson powered by UIMA

2011-02-24 Thread Tommaso Teofili
Thanks Philip 2011/2/24 Philip Ogren phi...@ogren.info This may be of interest to folks on this list even if somewhat off-topic and probably well-known: Watson is powered by 10 racks of IBM Power 750 servers running Linux, and uses 15 terabytes of RAM, 2,880 processor cores and is capable

Re: Analysis Engines for mbox like data

2011-02-14 Thread Tommaso Teofili
I agree with Jorn, I think that's the faster way. Tommaso 2011/2/14 Jörn Kottmann kottm...@gmail.com On 2/14/11 4:49 AM, Radhouane Aniba wrote: Hello everyone, Quite unusual request to this list, I am wondering if there is any analysis engine that allow to mine MBOX like formats such as

Re: UIMA for extracting book entities from tables of contents, etc. as RDF?

2010-12-27 Thread Tommaso Teofili
On Mon, Dec 27, 2010 at 2:45 AM, Tommaso Teofili tommaso.teof...@gmail.com wrote: Hi Darren, 2010/12/23 Darren Cruse darren.cr...@gmail.com Hi guys I apologize for a newbie question but I'm quite new to UIMA and the whole area of information extraction/entity extraction. And I'm

Re: Compile Alchemy Annotator

2010-06-15 Thread Tommaso Teofili
nice to hear :-) Let us know how it goes. Cheers, Tommaso 2010/6/15 florent andré florent.andre-...@4sengines.com with the workaround to compile uimaj (see last mail from me), Alchemy compile like a charm. Thanks. Let's play with it ! :) ++ On 15/06/2010 06:34, Tommaso Teofili wrote: Hi

Re: What about a search engine like

2010-05-27 Thread Tommaso Teofili
2010/5/27 Anuj Saini anuj.sa...@orkash.com You are trying to generate clusters of similar artifacts. Though this can be done at processing time, but better approach is to keep the annotated results in database. My suggestion is use index for fast retrieval. UIMA does'nt provide anything