It sounds more like an alternative to DIH rather than an incremental add-on to DIH. I mean, isn't Morphline really just "a DIH for Hadoop"?

So, back to Shalin's question, which specific (please detail!) use cases of DIH are enhanced by Morphline?

Maybe it would help if you simply elaborate what benefits would accrue to adding Morphline to DIH - as opposed to creating a separate module for Solr. I suppose it depends on whether you consider DIH a solid foundation or a weak link in Solr that desperately needs firming up.

-- Jack Krupansky

-----Original Message----- From: Alexandre Rafalovitch
Sent: Sunday, June 8, 2014 1:40 AM
To: [email protected]
Subject: Re: Adding Morphline support to DIH - worth the effort?

Well, it's the same core scenario as DIH supports (apart from actual
data sources), but actively supported and developed by a company with
a lot more investment in it. For the primitives supported, see
http://cloudera.github.io/cdk/docs/current/cdk-morphlines/morphlinesReferenceGuide.html

We don't bundle ALL of these with Solr, but I think we do bundle core,
solr-core and solr-cell packages, which is a good number and range of
functionality (e.g. readMultiLine).

Regards,
  Alex.
Personal website: http://www.outerthoughts.com/
Current project: http://www.solr-start.com/ - Accelerating your Solr proficiency


On Sun, Jun 8, 2014 at 12:23 PM, Shalin Shekhar Mangar
<[email protected]> wrote:
I do not know much about morphlines but I'd like to know what use-cases
would be possible/easier/faster with such an integration?


On Sun, Jun 8, 2014 at 10:32 AM, Alexandre Rafalovitch <[email protected]>
wrote:

Hello,

I had a preliminary look around and it might be possible to plug
Morphline (already shipped with Solr) into DIH by creating a bridging
EntityProcessor.

Two questions:
1) Do people see value in it?
2) DIH is not very supported, so any addition seems to be a bit stuck
in "rickety bridge, don't rock" discussion (e.g. SOLR-4799). I don't
want to suddenly be responsible for fixing the bridge before adding a
standalone piece of code. So, if I write the code, how many general
DIH externalities would I also have to address (e.g. lack of tests,
etc)?

Regards,
   Alex.
P.s. Morphline could also be integrated in update request processor
chain. So, that could be an alternative project.

Personal website: http://www.outerthoughts.com/
Current project: http://www.solr-start.com/ - Accelerating your Solr
proficiency

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]




--
Regards,
Shalin Shekhar Mangar.

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to