Re: Consider cleaning up backend code

Jianyong Dai Thu, 22 Apr 2010 17:52:18 -0700

+1 for removing. This interface does not bring us any value when wedecide to move closer to hadoop. Writing a backend is almost writinghalf of Pig. I don't think this interface is attractive to mostdevelopers. Instead, I +1 for Milind's idea to make intermediateartifacts available, or provide some hook for user to peek/morph theplan at different stages. This opens the door for developers tovisualize/debug/improve Pig without knowing every details of Pig.


Daniel


Alan Gates wrote:

A couple of years ago we had this concept that Pig as is should beable to run on other backends (like say Dryad if it were opensource). So we built this whole backend interface and (mostly) keptHadoop specific objects out of the front end.
Recently we have modified that stand and said that this implementationof Pig is Hadoop specific. Pig Latin itself will still stay Hadoopindependent. So the ability to have multiple backends is fine. Butthe ability to have non-Hadoop backends is not really interesting now.
So I at least see the proposal here as getting rid of generic codethat tries to hide the fact that we are working on top of Hadoop(things like DataStorage and ExecutionEngine).
Alan.

On Apr 22, 2010, at 4:14 PM, Arun C Murthy wrote:
I read it as getting rid of concepts parallel to hadoop in src/org/apache/pig/backend/hadoop/datastorage.
Is that true?

thanks,
Arun

On Apr 22, 2010, at 1:34 PM, Dmitriy Ryaboy wrote:
I kind of dig the concept of being able to plug in a differentbackend,though I definitely thing we should get rid of the dead localmodecode. Canyou give an example of how this will simplify the codebase? Is itmore thanjust GenericClass foo = new SpecificClass(), and the associatedextra files?
-D
On Thu, Apr 22, 2010 at 1:25 PM, Arun C Murthy <[email protected]>wrote:
+1

Arun


On Apr 22, 2010, at 11:35 AM, Richard Ding wrote:

Pig has an abstraction layer (interfaces and abstract classes) to
support multiple execution engines. After PIG-1053, Hadoop is theonlyexecution engine supported by Pig. I wonder if we should removethislayer of code, and make Hadoop THE execution engine for Pig. Thiswill
simplify a lot the backend code.



Thanks,

-Richard

Re: Consider cleaning up backend code

Reply via email to