Wow Jeremy ,

Thanks for detailed coverage. Seems you guys did lots of good work along
with fun.

-----------
Sent from Mobile , short and crisp.
On 12-May-2012 11:53 PM, "Jeremy Hanna" <[email protected]> wrote:

> Thanks again to Twitter for doing their event and inspiring ours.  I just
> wanted to report on some things we did in Austin for any interested.  We
> had a good turnout of about 30 people.
>
> Kevin Safford presented an introduction to Pig, or Pig 101.  The slides
> are available here:
> http://www.slideshare.net/ktsafford/dachis-group-pigout101-12895911
>
> Timothy Potter down from Colorado gave a presentation on intermediate Pig,
> or Pig 202.  His slides are available here:
> http://www.slideshare.net/thelabdude/dachis-group-pig-hackday-pig-202
>
> Clint Miller gave an introduction to unit testing with Pig with these
> slides: http://www.slideshare.net/clintmiller1/unit-testing-pig
>
> After that we had some lunch and linked up remotely for a bit to the
> Twitter hackday in the Bay Area.  Their group is mostly Pig committers and
> contributors so they worked on Pig tickets.  One thing that Twitter
> opensourced as part of the event was a workflow visualization tool called
> Ambrose, https://github.com/twitter/ambrose
>
> Also mentioned was Alan Gates excellent reference Programming Pig, the web
> version found here:
> http://ofps.oreilly.com/titles/9781449302641/index.html
> We started the afternoon with a list of things we could work on:
>
>        • Pig mahout integration (pigout) led by Timothy Potter
>        • Pig Unit improvments led by Clint Miller
>        • David Boney wanted to get his KDD data preparation going with Pig
> for a competition
>        • Kevin wanted to help people get the presentation examples running
>        • Brandon Kearby led a group on helping get the IntelliJ IDEA Pig
> plugin working.
>        • Josh Levy wanted to see about getting grunt to recognize
> parameters passed in.
>        • Josh also wanted to look more at the python udf scripting and see
> if it could be improved.
>        • John Prior wanted see if there could be a grunt pretty print when
> using describe
>        • John also wanted to see if bash command history facilities could
> be added to grunt
>        • John also brought up that knime is a really cool visual workflow
> creator for machine learning that could also could be developed for Pig.
>        • The CassandraStorage loadstorefunc was also brought up as
> something Brandon Williams might work on, specifically the way to have it
> automatically use secondary indexes.
>
> What actually happened?
>
> Tim is going to continue working on the pig-vector integration into Mahout
> pending some feedback from Tim and the mahout folks.
>
> Clint worked on getting Pig 0.10 branch downloaded and built locally in
> order to have something to patch against for the pig unit improvements
> outlined on this ticket:  https://issues.apache.org/jira/browse/PIG-2692
>
> David Boney got his data loaded up in CFS, the Cassandra file system and
> made some progress there.
>
> Several people talked about Pig generally getting things running on their
> own laptops and environments.
>
> Brandon Kearby and others forked
> https://github.com/brandonkearby/three-little-piggies and the jar in that
> project can now be added to your IntelliJ IDEA plugins directory to
> associate .pig files and provide source coloring.  There's still some work
> to do there, but it's nice to have that working and available for IntelliJ
> 11 users.
>
> Josh Levy got some ideas together with a couple of other attendees on how
> to improve the Pig/Python UDF scripting.  Josh and Jeremy contacted Julien
> from Twitter who had written the python udf support and he is reviewing
> Josh's proposed changes with the possibility of creating a ticket for it.
>
> Grunt pretty print?  Coincidentally, someone in the Bay Area had the same
> thought and independent of our efforts created a ticket along with
> submitted a patch to do just that:
> https://issues.apache.org/jira/browse/PIG-2697
>
> Brandon Williams is working on the CassandraStorage ticket -
> https://issues.apache.org/jira/browse/CASSANDRA-4238
>
> Besides that there was great interaction among everyone until people went
> their own ways around 4 PM.  Thanks for Twitter for doing their hackathon.
>  We didn't interact too much with them because their group was more
> advanced and we didn't want to slow them down.  Several of us chatted in
> the #hadoop-pig channel on freenode (IRC) as well as Russell Jurney and
> Jonathan Coveney from the Bay Area.
>
> Cheers,
>
> Jeremy
>

Reply via email to