Re: queries on Spork (Pig on Spark)

2015-11-24 Thread Divya Gehlot
;>> Details at logfile: /home/pig/pig_1448425672112.log > > You need to check the log file for details > > > > > On Wed, Nov 25, 2015 at 1:57 PM, Divya Gehlot > wrote: > >> Hi, >> >> >> As a beginner ,I have below queries on Spork(Pig on

Re: queries on Spork (Pig on Spark)

2015-11-24 Thread Jeff Zhang
>>> Details at logfile: /home/pig/pig_1448425672112.log You need to check the log file for details On Wed, Nov 25, 2015 at 1:57 PM, Divya Gehlot wrote: > Hi, > > > As a beginner ,I have below queries on Spork(Pig on Spark). > I have cloned git clone https://github

queries on Spork (Pig on Spark)

2015-11-24 Thread Divya Gehlot
> > Hi, As a beginner ,I have below queries on Spork(Pig on Spark). I have cloned git clone https://github.com/apache/pig -b spark . 1.On which version of Pig and Spark , Spork is being built ? 2. I followed the steps mentioned in https://issues.apache.org/ji ra/browse/PIG-4059 and

Re: Update on Pig on Spark initiative

2014-08-27 Thread Russell Jurney
This is really exciting! Thanks so much for this work, I think you've guaranteed Pig's continued vitality. On Wednesday, August 27, 2014, Matei Zaharia wrote: > Awesome to hear this, Mayur! Thanks for putting this together. > > Matei > > On August 27, 2014 at 10:04:12 PM, Mayur Rustagi (mayur.ru

Re: Update on Pig on Spark initiative

2014-08-27 Thread Matei Zaharia
Awesome to hear this, Mayur! Thanks for putting this together. Matei On August 27, 2014 at 10:04:12 PM, Mayur Rustagi (mayur.rust...@gmail.com) wrote: Hi, We have migrated Pig functionality on top of Spark passing 100% e2e for success cases in pig test suite. That means UDF, Joins & other func

Update on Pig on Spark initiative

2014-08-27 Thread Mayur Rustagi
Hi, We have migrated Pig functionality on top of Spark passing 100% e2e for success cases in pig test suite. That means UDF, Joins & other functionality is working quite nicely. We are in the process of merging with Apache Pig trunk(something that should happen over the next 2 weeks). Meanwhile if

Re: Pig on Spark

2014-04-25 Thread Mayur Rustagi
One core segment that frequently asks for systems like Pig & Hive are analyst who want to deal with data. The key place I see pig fitting in is getting non-developers deal with data at scale & free up developers to deal with code, udf rather than manage day to day dataflow changes & updates. A bypr

Re: Pig on Spark

2014-04-25 Thread Bharath Mundlapudi
>> I've only had a quick look at Pig, but it seems that a declarative >> layer on top of Spark couldn't be anything other than a big win, as it >> allows developers to declare *what* they want, permitting the compiler >> to determine how best poke at the RDD API to implement it. The devil is in th

Re: Pig on Spark

2014-04-25 Thread Michael Armbrust
On Fri, Apr 25, 2014 at 6:30 AM, Mark Baker wrote: > I've only had a quick look at Pig, but it seems that a declarative > layer on top of Spark couldn't be anything other than a big win, as it > allows developers to declare *what* they want, permitting the compiler > to determine how best poke at

Re: Pig on Spark

2014-04-25 Thread Eugen Cepoi
It depends, personally I have the opposite opinion. IMO expressing pipelines in a functional language feels natural, you just have to get used with the language (scala). Testing spark jobs is easy where testing a Pig script is much harder and not natural. If you want a more high level language t

Re: Pig on Spark

2014-04-25 Thread Mark Baker
I've only had a quick look at Pig, but it seems that a declarative layer on top of Spark couldn't be anything other than a big win, as it allows developers to declare *what* they want, permitting the compiler to determine how best poke at the RDD API to implement it. In my brief time with Spark, I

Re: Pig on Spark

2014-04-24 Thread suman bharadwaj
t; Thanks. >>>> >>>> >>>> On Thu, Apr 24, 2014 at 1:54 AM, Mayur Rustagi >>> > wrote: >>>> >>>>> Thr are two benefits I get as of now >>>>> 1. Most of the time a lot of customers dont want the full power but &g

Re: Pig on Spark

2014-04-23 Thread Mayur Rustagi
as of now >>>> 1. Most of the time a lot of customers dont want the full power but >>>> they want something dead simple with which they can do dsl. They end up >>>> using Hive for a lot of ETL just cause its SQL & they understand it. Pig is >>>> clos

Re: Pig on Spark

2014-04-23 Thread suman bharadwaj
s close >>> & wraps up a lot of framework level semantics away from the user & lets him >>> focus on data flow >>> 2. Some have codebases in Pig already & are just looking to do it >>> faster. I am yet to benchmark that on Pig on spark. >&

Re: Pig on Spark

2014-04-23 Thread Mayur Rustagi
>> & wraps up a lot of framework level semantics away from the user & lets him >> focus on data flow >> 2. Some have codebases in Pig already & are just looking to do it faster. >> I am yet to benchmark that on Pig on spark. >> >> I agree that pig on s

Re: Pig on Spark

2014-04-23 Thread suman bharadwaj
t looking to do it faster. > I am yet to benchmark that on Pig on spark. > > I agree that pig on spark cannot solve a lot problems but it can solve > some without forcing the end customer to do anything even close to coding, > I believe thr is quite some value in making Spark accessibl

Re: Pig on Spark

2014-04-23 Thread Mayur Rustagi
rk level semantics away from the user & lets him focus on data flow 2. Some have codebases in Pig already & are just looking to do it faster. I am yet to benchmark that on Pig on spark. I agree that pig on spark cannot solve a lot problems but it can solve some without forcing the end custome

Re: Pig on Spark

2014-04-23 Thread Bharath Mundlapudi
, functional and complete language power! If we can improve first 3 lines, here you go, you have most powerful DSL to solve data problems. -Bharath On Mon, Mar 10, 2014 at 11:00 PM, Xiangrui Meng wrote: > Hi Sameer, > > Lin (cc'ed) could also give you some updates about

Re: Pig on Spark

2014-04-23 Thread lalit1303
/Pig-on-Spark-tp2367p4668.html Sent from the Apache Spark User List mailing list archive at Nabble.com.

Re: Pig on Spark

2014-04-10 Thread Mayur Rustagi
t; I am really interested in making it stable & production quality. >>>> >>>> Regards >>>> Mayur Rustagi >>>> Ph: +1 (760) 203 3257 >>>> http://www.sigmoidanalytics.com >>>> @mayur_rustagi <https://twitter.com/mayur_rustagi>

Re: Pig on Spark

2014-04-10 Thread Konstantin Kudryavtsev
<https://twitter.com/mayur_rustagi> >>> >>> >>> >>> On Fri, Mar 14, 2014 at 11:53 AM, Julien Le Dem wrote: >>> >>>> Hi Mayur, >>>> Are you going to the Pig meetup this afternoon? >>>> http://www.meetup.com/PigUser/even

Re: Pig on Spark

2014-04-08 Thread Mayur Rustagi
> Are you going to the Pig meetup this afternoon? >>> http://www.meetup.com/PigUser/events/160604192/ >>> Aniket and I will be there. >>> We would be happy to chat about Pig-on-Spark >>> >>> >>> >>> On Tue, Mar 11, 2014 at 8:56 AM, Ma

Re: Pig on Spark

2014-03-25 Thread lalit1303
oke(Method.java:622) at org.apache.hadoop.util.RunJar.main(RunJar.java:212) -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Pig-on-Spark-tp2367p3187.html Sent from the Ap

Re: Pig on Spark

2014-03-14 Thread Aniket Mokashi
tup.com/PigUser/events/160604192/ >> Aniket and I will be there. >> We would be happy to chat about Pig-on-Spark >> >> >> >> On Tue, Mar 11, 2014 at 8:56 AM, Mayur Rustagi >> wrote: >> >>> Hi Lin, >>> We are working on getting Pig on

Re: Pig on Spark

2014-03-14 Thread Mayur Rustagi
On Fri, Mar 14, 2014 at 11:53 AM, Julien Le Dem wrote: > Hi Mayur, > Are you going to the Pig meetup this afternoon? > http://www.meetup.com/PigUser/events/160604192/ > Aniket and I will be there. > We would be happy to chat about Pig-on-Spark > > > > On Tue, Mar 11,

Re: Pig on Spark

2014-03-14 Thread Julien Le Dem
Hi Mayur, Are you going to the Pig meetup this afternoon? http://www.meetup.com/PigUser/events/160604192/ Aniket and I will be there. We would be happy to chat about Pig-on-Spark On Tue, Mar 11, 2014 at 8:56 AM, Mayur Rustagi wrote: > Hi Lin, > We are working on getting Pig on spark func

Re: Pig on Spark

2014-03-11 Thread Mayur Rustagi
Hi Lin, We are working on getting Pig on spark functional with 0.8.0, have you got it working on any spark version ? Also what all functionality works on it? Regards Mayur Mayur Rustagi Ph: +1 (760) 203 3257 http://www.sigmoidanalytics.com @mayur_rustagi <https://twitter.com/mayur_rustagi>

Re: Pig on Spark

2014-03-10 Thread Xiangrui Meng
Hi Sameer, Lin (cc'ed) could also give you some updates about Pig on Spark development on her side. Best, Xiangrui On Mon, Mar 10, 2014 at 12:52 PM, Sameer Tilak wrote: > Hi Mayur, > We are planning to upgrade our distribution MR1> MR2 (YARN) and the goal is > to get SPROK set

RE: Pig on Spark

2014-03-10 Thread Sameer Tilak
Hi Mayur,We are planning to upgrade our distribution MR1> MR2 (YARN) and the goal is to get SPROK set up next month. I will keep you posted. Can you please keep me informed about your progress as well. From: mayur.rust...@gmail.com Date: Mon, 10 Mar 2014 11:47:56 -0700 Subject: Re: Pig on Sp

Re: Pig on Spark

2014-03-10 Thread Mayur Rustagi
wrote: > Hi Aniket, > Many thanks! I will check this out. > > -- > Date: Thu, 6 Mar 2014 13:46:50 -0800 > Subject: Re: Pig on Spark > From: aniket...@gmail.com > To: user@spark.apache.org; tgraves...@yahoo.com > > > There is some wo

RE: Pig on Spark

2014-03-06 Thread Sameer Tilak
Hi Aniket,Many thanks! I will check this out. Date: Thu, 6 Mar 2014 13:46:50 -0800 Subject: Re: Pig on Spark From: aniket...@gmail.com To: user@spark.apache.org; tgraves...@yahoo.com There is some work to make this work on yarn at https://github.com/aniket486/pig. (So, compile pig with ant

Re: Pig on Spark

2014-03-06 Thread Aniket Mokashi
, Sameer Tilak > wrote: > Hi everyone, > > We are using to Pig to build our data pipeline. I came across Spork -- Pig > on Spark at: https://github.com/dvryaboy/pig and not sure if it is still > active. > > Can someone please let me know the status of Spork or any other eff

Re: Pig on Spark

2014-03-06 Thread Tom Graves
to build our data pipeline. I came across Spork -- Pig on Spark at: https://github.com/dvryaboy/pig and not sure if it is still active.    Can someone please let me know the status of Spork or any other effort that will let us run Pig on Spark? We can significantly benefit by using Spark, but we wo

Pig on Spark

2014-03-06 Thread Sameer Tilak
Hi everyone, We are using to Pig to build our data pipeline. I came across Spork -- Pig on Spark at: https://github.com/dvryaboy/pig and not sure if it is still active. Can someone please let me know the status of Spork or any other effort that will let us run Pig on Spark? We can