Re: Running UIMA Apps on Hadoop

Yi-Wen Liu Mon, 12 Oct 2015 12:34:21 -0700

Thanks for your reply, I'll take a look at the sample.
It seems like a mixture of both. cTAKES using UIMA as the framework and
cTAKES is inside a MR job.
I found something that might help:
https://github.com/DigitalPebble/behemoth, but
not sure if it is what I need.
Have you used  Behemoth before?



Thanks,
Yi-Wen

On Mon, Oct 12, 2015 at 12:25 AM, Tommaso Teofili <[email protected]
> wrote:

> In the past I had developed a small sample of running UIMA on Apache Hama
> (which in turns can run on HDFS) [1].
> Other than that I don't know the mentioned doc and never tried reproducing
> the steps, however it sounds like "shaving the classes" is a typo :-)
>
> First and foremost I think you should decide which tool does what: do you
> use UIMA as a library inside a MR job (like in the mentioned doc) ? Or you
> use UIMA as the framework and intend to read / write job data from HDFS ?
> Or a mixture of both ?
>
> My 2 cents,
> Tommaso
>
> [1] :
>
> https://github.com/apache/uima-sandbox/blob/trunk/uima-bsp/src/main/java/org/apache/uima/bsp/BasicAEProcessingBSPJob.java
> [2] : http://hama.apache.org
>
>
> 2015-10-12 7:52 GMT+02:00 Yi-Wen Liu <[email protected]>:
>
> > Thanks, but if we still have to use Hadoop, does anybody know who I can
> ask
> > questions about the document?
> >
> >
> https://cwiki.apache.org/confluence/display/UIMA/Running+UIMA+Apps+on+Hadoop
> >
> > Thanks,
> > Yi-Wen
> >
> > On Sat, Oct 10, 2015 at 4:05 AM, Lou DeGenaro <[email protected]>
> > wrote:
> >
> > > Perhaps you have no choice, but if you do then consider DUCC
> > https://uima.
> > > apache.org/doc-uimaducc-whatitam.html for UIMA scale-out.
> > >
> > > Lou.
> > >
> > > On Fri, Oct 9, 2015 at 5:54 PM, Yi-Wen Liu <[email protected]> wrote:
> > >
> > > > Hi all,
> > > >
> > > > I am a USC student working on professor Mattmann's project
> "Integration
> > > of
> > > > cTAKES/UIMA and Apache Hadoop for the Shangridocs system", the
> proposal
> > > is
> > > > attached.
> > > > I searched many relevant resources of running UIMA on Hadoop, this is
> > one
> > > > of them:
> > > >
> > > >
> > >
> >
> https://cwiki.apache.org/confluence/display/UIMA/Running+UIMA+Apps+on+Hadoop
> > > >
> > > > but it only provides very general explanations.
> > > >
> > > > Has anybody gone through all the steps and succeeded? Hope you could
> > > > provide me some examples. Thanks!
> > > >
> > > > And in the *Important Considerations*,
> > > > "1. The jar file created should *shave* all the classes, descriptors
> of
> > > > the UIMA app along with the map/reduce and job main class"
> > > >
> > > > I think you mean "The jar file created should *have* all the
> > classes...".
> > > > Is that correct?
> > > >
> > > >
> > > > Thanks,
> > > > Yi-Wen
> > > >
> > >
> >
>

Re: Running UIMA Apps on Hadoop

Reply via email to