Re: Hive projects for Google Summer of code 2012 ?

2012-05-02 Thread Carl Steinbach
Ed,

Did you see this?

http://google-melange.appspot.com/gsoc/project/google/gsoc2012/bharathv/29002

On Wed, Mar 7, 2012 at 12:15 PM, Edward Capriolo edlinuxg...@gmail.comwrote:

 For the record we are 0-2 in gsoc.

 On Wed, Mar 7, 2012 at 1:42 AM, Namit Jain nj...@fb.com wrote:
  Done
 
 
  On 3/6/12 10:32 PM, bharath vissapragada
  bharathvissapragada1...@gmail.com wrote:
 
 Hi Namit,
 
 Is it possible to add Hive-1362 [1] to the list? I am interested in
 working
 on that and I've contacted Ashutosh regarding that.
 
 Thanks and Regards,
 
 
 [1]  https://issues.apache.org/jira/browse/HIVE-1362
 
 On Wed, Mar 7, 2012 at 6:46 AM, Namit Jain nj...@fb.com wrote:
 
 
 
 https://issues.apache.org/jira/secure/IssueNavigator.jspa?requestId=12319
 27
  0
 
 
 
  0 was in the next line by mistake
  Look for HIVE jiras.
 
 
 
  On 3/6/12 7:37 PM, Alexis De La Cruz Toledo alexis...@gmail.com
 wrote:
 
  Sorry Mr. Namit Jain I can't see the jiras.
  
  Thanks
  
  El 6 de marzo de 2012 19:41, Namit Jain nj...@fb.com escribió:
  
   I filed a couple of gsoc 2012 jiras.
  
   Please submit your proposal if you are interested.
  
  
  
 
 
 https://issues.apache.org/jira/secure/IssueNavigator.jspa?requestId=12319
  27
   0
  
  
  
  
   Thanks,
   -namit
  
  
   On 2/7/12 6:32 PM, Alexis De La Cruz Toledo alexis...@gmail.com
  wrote:
  
   Hi Namit Jain, I'm Alexis, I'm a master student, I'm studying in
   Cinvestav,
   df Mexico.
   I'm interesting in collaborating with Hive. I'd like to do my
 thesis
  work
   in something about Hive.
   I'd like to participate in Google Summer of Code 2012 too.
   
   The issues that you propose I find interesting, particularly in the
  next
   issues:
   * The topic 2. Indexed Joins.
   * [PO] Optimize Joins using Bloom Filters of this page
   https://cwiki.apache.org/confluence/display/Hive/Roadmap.
   
   Can you tell me something more about it?
   What is problem to be solved?
   What benefits we hope to gain?
   
   This because I want to raise my thesis problem.
   By another hand, can you be my mentor in Google Summer of Code, if
 I
  work
   with this topics?
   
   Thanks.
   
   El 5 de febrero de 2012 19:58, Namit Jain nj...@fb.com escribió:
   
Hi Alexis/Bharath,
   
Great to see your interests. If you looking for ideas, some
 things
  that
will be very useful are:
   
1. Removing the map-join hint completely.
  Moving all processing to runtime.
  Currently, bucketed map joins and sort merge joins are
 completely
driven off hints.
  It would be very helpful to the community, and also clean up
 the
  code
   a
lot.
   
2. Indexed Joins.
  Something that would be really useful -
  If the basic infrastructure is ready, it can eventually be used
 to
   join
tables
  stored outside also (say Hbase).
   
3. Metastore understanding hierarchy.
  For eg: if a table is partitioned by ds and hr,
  A valid partition on ds does not exist. This is a very common
  usecase
on many
  applications, and the current work-around is to have signal
 tables
  for
ds
  un-necessarily complicating the process.
   
   
If you are interested, I would be happy to provide more details.
   
   
Thanks,
-namit
   
   
On 2/4/12 11:57 AM, Ashutosh Chauhan hashut...@apache.org
 wrote:
   
Hi Alexis,

Great to see your interest. Feel free to come up with concrete
  proposal
and
submit to GSoC. Its certainly heartening to see folks interested
 in
   making
contributions to the Hive Project.

Ashutosh
On Sat, Feb 4, 2012 at 10:48, Alexis De La Cruz Toledo
alexis...@gmail.comwrote:

 Hi Ashutosh, I'm interesting in hive,
 I'd like to improve the compilation process,
 I have been that the plan query tree generated
 by Hive can be optimized, and I'd like
 to participate in Google Summer of code 2012.
 What do you say?

 Regards.


 El 4 de febrero de 2012 12:29, Ashutosh Chauhan 
   hashut...@apache.org
 escribió:

  Hey Bharath,
 
  Great to see your enthusiasm for Hive! I would be happy to
  mentor
   you
for
  the project.  For the start, you can take a look at
  https://cwiki.apache.org/confluence/display/Hive/Roadmapfor
 a
   list
of
  open
  projects in Hive. The document is bit dated, so some of
 those
   projects
 may
  not be relevant. But, its a good source to start with to see
 if
   any of
  these projects excite you.
 
  Hope it helps,
  Ashutosh
 
  On Sat, Feb 4, 2012 at 08:47, bharath vissapragada 
  bharathvissapragada1...@gmail.com wrote:
 
   Hey list, devs,
  
   Google summer of code, 2012 's notification [1] has been
  released
and
   mentoring organizations can submit their proposals to
 Google
  for
  opensource
   projects.
  
   Any of the devs interested in mentoring students 

Re: Hive projects for Google Summer of code 2012 ?

2012-05-02 Thread Edward Capriolo
I meant 0-2 over the past few years. I was wrong in any case. My bad.

On Wed, May 2, 2012 at 2:49 AM, Carl Steinbach c...@cloudera.com wrote:
 Ed,

 Did you see this?

 http://google-melange.appspot.com/gsoc/project/google/gsoc2012/bharathv/29002

 On Wed, Mar 7, 2012 at 12:15 PM, Edward Capriolo edlinuxg...@gmail.comwrote:

 For the record we are 0-2 in gsoc.

 On Wed, Mar 7, 2012 at 1:42 AM, Namit Jain nj...@fb.com wrote:
  Done
 
 
  On 3/6/12 10:32 PM, bharath vissapragada
  bharathvissapragada1...@gmail.com wrote:
 
 Hi Namit,
 
 Is it possible to add Hive-1362 [1] to the list? I am interested in
 working
 on that and I've contacted Ashutosh regarding that.
 
 Thanks and Regards,
 
 
 [1]  https://issues.apache.org/jira/browse/HIVE-1362
 
 On Wed, Mar 7, 2012 at 6:46 AM, Namit Jain nj...@fb.com wrote:
 
 
 
 https://issues.apache.org/jira/secure/IssueNavigator.jspa?requestId=12319
 27
  0
 
 
 
  0 was in the next line by mistake
  Look for HIVE jiras.
 
 
 
  On 3/6/12 7:37 PM, Alexis De La Cruz Toledo alexis...@gmail.com
 wrote:
 
  Sorry Mr. Namit Jain I can't see the jiras.
  
  Thanks
  
  El 6 de marzo de 2012 19:41, Namit Jain nj...@fb.com escribió:
  
   I filed a couple of gsoc 2012 jiras.
  
   Please submit your proposal if you are interested.
  
  
  
 
 
 https://issues.apache.org/jira/secure/IssueNavigator.jspa?requestId=12319
  27
   0
  
  
  
  
   Thanks,
   -namit
  
  
   On 2/7/12 6:32 PM, Alexis De La Cruz Toledo alexis...@gmail.com
  wrote:
  
   Hi Namit Jain, I'm Alexis, I'm a master student, I'm studying in
   Cinvestav,
   df Mexico.
   I'm interesting in collaborating with Hive. I'd like to do my
 thesis
  work
   in something about Hive.
   I'd like to participate in Google Summer of Code 2012 too.
   
   The issues that you propose I find interesting, particularly in the
  next
   issues:
   * The topic 2. Indexed Joins.
   * [PO] Optimize Joins using Bloom Filters of this page
   https://cwiki.apache.org/confluence/display/Hive/Roadmap.
   
   Can you tell me something more about it?
   What is problem to be solved?
   What benefits we hope to gain?
   
   This because I want to raise my thesis problem.
   By another hand, can you be my mentor in Google Summer of Code, if
 I
  work
   with this topics?
   
   Thanks.
   
   El 5 de febrero de 2012 19:58, Namit Jain nj...@fb.com escribió:
   
Hi Alexis/Bharath,
   
Great to see your interests. If you looking for ideas, some
 things
  that
will be very useful are:
   
1. Removing the map-join hint completely.
  Moving all processing to runtime.
  Currently, bucketed map joins and sort merge joins are
 completely
driven off hints.
  It would be very helpful to the community, and also clean up
 the
  code
   a
lot.
   
2. Indexed Joins.
  Something that would be really useful -
  If the basic infrastructure is ready, it can eventually be used
 to
   join
tables
  stored outside also (say Hbase).
   
3. Metastore understanding hierarchy.
  For eg: if a table is partitioned by ds and hr,
  A valid partition on ds does not exist. This is a very common
  usecase
on many
  applications, and the current work-around is to have signal
 tables
  for
ds
  un-necessarily complicating the process.
   
   
If you are interested, I would be happy to provide more details.
   
   
Thanks,
-namit
   
   
On 2/4/12 11:57 AM, Ashutosh Chauhan hashut...@apache.org
 wrote:
   
Hi Alexis,

Great to see your interest. Feel free to come up with concrete
  proposal
and
submit to GSoC. Its certainly heartening to see folks interested
 in
   making
contributions to the Hive Project.

Ashutosh
On Sat, Feb 4, 2012 at 10:48, Alexis De La Cruz Toledo
alexis...@gmail.comwrote:

 Hi Ashutosh, I'm interesting in hive,
 I'd like to improve the compilation process,
 I have been that the plan query tree generated
 by Hive can be optimized, and I'd like
 to participate in Google Summer of code 2012.
 What do you say?

 Regards.


 El 4 de febrero de 2012 12:29, Ashutosh Chauhan 
   hashut...@apache.org
 escribió:

  Hey Bharath,
 
  Great to see your enthusiasm for Hive! I would be happy to
  mentor
   you
for
  the project.  For the start, you can take a look at
  https://cwiki.apache.org/confluence/display/Hive/Roadmapfor
 a
   list
of
  open
  projects in Hive. The document is bit dated, so some of
 those
   projects
 may
  not be relevant. But, its a good source to start with to see
 if
   any of
  these projects excite you.
 
  Hope it helps,
  Ashutosh
 
  On Sat, Feb 4, 2012 at 08:47, bharath vissapragada 
  bharathvissapragada1...@gmail.com wrote:
 
   Hey list, devs,
  
   Google summer of code, 2012 's notification [1] has been
  released
and
   mentoring 

Re: Hive projects for Google Summer of code 2012 ?

2012-03-07 Thread Edward Capriolo
For the record we are 0-2 in gsoc.

On Wed, Mar 7, 2012 at 1:42 AM, Namit Jain nj...@fb.com wrote:
 Done


 On 3/6/12 10:32 PM, bharath vissapragada
 bharathvissapragada1...@gmail.com wrote:

Hi Namit,

Is it possible to add Hive-1362 [1] to the list? I am interested in
working
on that and I've contacted Ashutosh regarding that.

Thanks and Regards,


[1]  https://issues.apache.org/jira/browse/HIVE-1362

On Wed, Mar 7, 2012 at 6:46 AM, Namit Jain nj...@fb.com wrote:


https://issues.apache.org/jira/secure/IssueNavigator.jspa?requestId=12319
27
 0



 0 was in the next line by mistake
 Look for HIVE jiras.



 On 3/6/12 7:37 PM, Alexis De La Cruz Toledo alexis...@gmail.com
wrote:

 Sorry Mr. Namit Jain I can't see the jiras.
 
 Thanks
 
 El 6 de marzo de 2012 19:41, Namit Jain nj...@fb.com escribió:
 
  I filed a couple of gsoc 2012 jiras.
 
  Please submit your proposal if you are interested.
 
 
 

https://issues.apache.org/jira/secure/IssueNavigator.jspa?requestId=12319
 27
  0
 
 
 
 
  Thanks,
  -namit
 
 
  On 2/7/12 6:32 PM, Alexis De La Cruz Toledo alexis...@gmail.com
 wrote:
 
  Hi Namit Jain, I'm Alexis, I'm a master student, I'm studying in
  Cinvestav,
  df Mexico.
  I'm interesting in collaborating with Hive. I'd like to do my thesis
 work
  in something about Hive.
  I'd like to participate in Google Summer of Code 2012 too.
  
  The issues that you propose I find interesting, particularly in the
 next
  issues:
  * The topic 2. Indexed Joins.
  * [PO] Optimize Joins using Bloom Filters of this page
  https://cwiki.apache.org/confluence/display/Hive/Roadmap.
  
  Can you tell me something more about it?
  What is problem to be solved?
  What benefits we hope to gain?
  
  This because I want to raise my thesis problem.
  By another hand, can you be my mentor in Google Summer of Code, if I
 work
  with this topics?
  
  Thanks.
  
  El 5 de febrero de 2012 19:58, Namit Jain nj...@fb.com escribió:
  
   Hi Alexis/Bharath,
  
   Great to see your interests. If you looking for ideas, some things
 that
   will be very useful are:
  
   1. Removing the map-join hint completely.
     Moving all processing to runtime.
     Currently, bucketed map joins and sort merge joins are
completely
   driven off hints.
     It would be very helpful to the community, and also clean up the
 code
  a
   lot.
  
   2. Indexed Joins.
     Something that would be really useful -
     If the basic infrastructure is ready, it can eventually be used
to
  join
   tables
     stored outside also (say Hbase).
  
   3. Metastore understanding hierarchy.
     For eg: if a table is partitioned by ds and hr,
     A valid partition on ds does not exist. This is a very common
 usecase
   on many
     applications, and the current work-around is to have signal
tables
 for
   ds
     un-necessarily complicating the process.
  
  
   If you are interested, I would be happy to provide more details.
  
  
   Thanks,
   -namit
  
  
   On 2/4/12 11:57 AM, Ashutosh Chauhan hashut...@apache.org
wrote:
  
   Hi Alexis,
   
   Great to see your interest. Feel free to come up with concrete
 proposal
   and
   submit to GSoC. Its certainly heartening to see folks interested
in
  making
   contributions to the Hive Project.
   
   Ashutosh
   On Sat, Feb 4, 2012 at 10:48, Alexis De La Cruz Toledo
   alexis...@gmail.comwrote:
   
Hi Ashutosh, I'm interesting in hive,
I'd like to improve the compilation process,
I have been that the plan query tree generated
by Hive can be optimized, and I'd like
to participate in Google Summer of code 2012.
What do you say?
   
Regards.
   
   
El 4 de febrero de 2012 12:29, Ashutosh Chauhan 
  hashut...@apache.org
escribió:
   
 Hey Bharath,

 Great to see your enthusiasm for Hive! I would be happy to
 mentor
  you
   for
 the project.  For the start, you can take a look at
 https://cwiki.apache.org/confluence/display/Hive/Roadmap for
a
  list
   of
 open
 projects in Hive. The document is bit dated, so some of those
  projects
may
 not be relevant. But, its a good source to start with to see
if
  any of
 these projects excite you.

 Hope it helps,
 Ashutosh

 On Sat, Feb 4, 2012 at 08:47, bharath vissapragada 
 bharathvissapragada1...@gmail.com wrote:

  Hey list, devs,
 
  Google summer of code, 2012 's notification [1] has been
 released
   and
  mentoring organizations can submit their proposals to
Google
 for
 opensource
  projects.
 
  Any of the devs interested in mentoring students on Hive
  projects (
   any
  critical jiras etc.) ?  It would be great if any of the
devs
 (dev
   list
  cc'ed) can do that on behalf of ASF .
 
  It would be a great opportunity for  many students to
 contribute
patches
  to Hadoop and Hive and make their summer vacation fruitful.
 
  [1]
  http://google-melange.appspot.com/gsoc/events/google/gsoc2012

Re: Hive projects for Google Summer of code 2012 ?

2012-03-06 Thread Namit Jain
I filed a couple of gsoc 2012 jiras.

Please submit your proposal if you are interested.

https://issues.apache.org/jira/secure/IssueNavigator.jspa?requestId=1231927
0




Thanks,
-namit


On 2/7/12 6:32 PM, Alexis De La Cruz Toledo alexis...@gmail.com wrote:

Hi Namit Jain, I'm Alexis, I'm a master student, I'm studying in
Cinvestav,
df Mexico.
I'm interesting in collaborating with Hive. I'd like to do my thesis work
in something about Hive.
I'd like to participate in Google Summer of Code 2012 too.

The issues that you propose I find interesting, particularly in the next
issues:
* The topic 2. Indexed Joins.
* [PO] Optimize Joins using Bloom Filters of this page
https://cwiki.apache.org/confluence/display/Hive/Roadmap.

Can you tell me something more about it?
What is problem to be solved?
What benefits we hope to gain?

This because I want to raise my thesis problem.
By another hand, can you be my mentor in Google Summer of Code, if I work
with this topics?

Thanks.

El 5 de febrero de 2012 19:58, Namit Jain nj...@fb.com escribió:

 Hi Alexis/Bharath,

 Great to see your interests. If you looking for ideas, some things that
 will be very useful are:

 1. Removing the map-join hint completely.
   Moving all processing to runtime.
   Currently, bucketed map joins and sort merge joins are completely
 driven off hints.
   It would be very helpful to the community, and also clean up the code
a
 lot.

 2. Indexed Joins.
   Something that would be really useful -
   If the basic infrastructure is ready, it can eventually be used to
join
 tables
   stored outside also (say Hbase).

 3. Metastore understanding hierarchy.
   For eg: if a table is partitioned by ds and hr,
   A valid partition on ds does not exist. This is a very common usecase
 on many
   applications, and the current work-around is to have signal tables for
 ds
   un-necessarily complicating the process.


 If you are interested, I would be happy to provide more details.


 Thanks,
 -namit


 On 2/4/12 11:57 AM, Ashutosh Chauhan hashut...@apache.org wrote:

 Hi Alexis,
 
 Great to see your interest. Feel free to come up with concrete proposal
 and
 submit to GSoC. Its certainly heartening to see folks interested in
making
 contributions to the Hive Project.
 
 Ashutosh
 On Sat, Feb 4, 2012 at 10:48, Alexis De La Cruz Toledo
 alexis...@gmail.comwrote:
 
  Hi Ashutosh, I'm interesting in hive,
  I'd like to improve the compilation process,
  I have been that the plan query tree generated
  by Hive can be optimized, and I'd like
  to participate in Google Summer of code 2012.
  What do you say?
 
  Regards.
 
 
  El 4 de febrero de 2012 12:29, Ashutosh Chauhan hashut...@apache.org
  escribió:
 
   Hey Bharath,
  
   Great to see your enthusiasm for Hive! I would be happy to mentor
you
 for
   the project.  For the start, you can take a look at
   https://cwiki.apache.org/confluence/display/Hive/Roadmap for a list
 of
   open
   projects in Hive. The document is bit dated, so some of those
projects
  may
   not be relevant. But, its a good source to start with to see if
any of
   these projects excite you.
  
   Hope it helps,
   Ashutosh
  
   On Sat, Feb 4, 2012 at 08:47, bharath vissapragada 
   bharathvissapragada1...@gmail.com wrote:
  
Hey list, devs,
   
Google summer of code, 2012 's notification [1] has been released
 and
mentoring organizations can submit their proposals to Google for
   opensource
projects.
   
Any of the devs interested in mentoring students on Hive
projects (
 any
critical jiras etc.) ?  It would be great if any of the devs (dev
 list
cc'ed) can do that on behalf of ASF .
   
It would be a great opportunity for  many students to contribute
  patches
to Hadoop and Hive and make their summer vacation fruitful.
   
[1] http://google-melange.appspot.com/gsoc/events/google/gsoc2012
   
Thanks and Regards,
Bharath .V
w:http://researchweb.iiit.ac.in/~bharath.v
   
  
 
 
 
  --
  Ing. Alexis de la Cruz Toledo.
  *Av. Instituto Politécnico Nacional No. 2508 Col. San Pedro
Zacatenco.
  México,
  D.F, 07360 *
  *CINVESTAV, DF.*
 




-- 
Ing. Alexis de la Cruz Toledo.
*Av. Instituto Politécnico Nacional No. 2508 Col. San Pedro Zacatenco.
México,
D.F, 07360 *
*CINVESTAV, DF.*



Re: Hive projects for Google Summer of code 2012 ?

2012-03-06 Thread Alexis De La Cruz Toledo
Sorry Mr. Namit Jain I can't see the jiras.

Thanks

El 6 de marzo de 2012 19:41, Namit Jain nj...@fb.com escribió:

 I filed a couple of gsoc 2012 jiras.

 Please submit your proposal if you are interested.

 https://issues.apache.org/jira/secure/IssueNavigator.jspa?requestId=1231927
 0




 Thanks,
 -namit


 On 2/7/12 6:32 PM, Alexis De La Cruz Toledo alexis...@gmail.com wrote:

 Hi Namit Jain, I'm Alexis, I'm a master student, I'm studying in
 Cinvestav,
 df Mexico.
 I'm interesting in collaborating with Hive. I'd like to do my thesis work
 in something about Hive.
 I'd like to participate in Google Summer of Code 2012 too.
 
 The issues that you propose I find interesting, particularly in the next
 issues:
 * The topic 2. Indexed Joins.
 * [PO] Optimize Joins using Bloom Filters of this page
 https://cwiki.apache.org/confluence/display/Hive/Roadmap.
 
 Can you tell me something more about it?
 What is problem to be solved?
 What benefits we hope to gain?
 
 This because I want to raise my thesis problem.
 By another hand, can you be my mentor in Google Summer of Code, if I work
 with this topics?
 
 Thanks.
 
 El 5 de febrero de 2012 19:58, Namit Jain nj...@fb.com escribió:
 
  Hi Alexis/Bharath,
 
  Great to see your interests. If you looking for ideas, some things that
  will be very useful are:
 
  1. Removing the map-join hint completely.
Moving all processing to runtime.
Currently, bucketed map joins and sort merge joins are completely
  driven off hints.
It would be very helpful to the community, and also clean up the code
 a
  lot.
 
  2. Indexed Joins.
Something that would be really useful -
If the basic infrastructure is ready, it can eventually be used to
 join
  tables
stored outside also (say Hbase).
 
  3. Metastore understanding hierarchy.
For eg: if a table is partitioned by ds and hr,
A valid partition on ds does not exist. This is a very common usecase
  on many
applications, and the current work-around is to have signal tables for
  ds
un-necessarily complicating the process.
 
 
  If you are interested, I would be happy to provide more details.
 
 
  Thanks,
  -namit
 
 
  On 2/4/12 11:57 AM, Ashutosh Chauhan hashut...@apache.org wrote:
 
  Hi Alexis,
  
  Great to see your interest. Feel free to come up with concrete proposal
  and
  submit to GSoC. Its certainly heartening to see folks interested in
 making
  contributions to the Hive Project.
  
  Ashutosh
  On Sat, Feb 4, 2012 at 10:48, Alexis De La Cruz Toledo
  alexis...@gmail.comwrote:
  
   Hi Ashutosh, I'm interesting in hive,
   I'd like to improve the compilation process,
   I have been that the plan query tree generated
   by Hive can be optimized, and I'd like
   to participate in Google Summer of code 2012.
   What do you say?
  
   Regards.
  
  
   El 4 de febrero de 2012 12:29, Ashutosh Chauhan 
 hashut...@apache.org
   escribió:
  
Hey Bharath,
   
Great to see your enthusiasm for Hive! I would be happy to mentor
 you
  for
the project.  For the start, you can take a look at
https://cwiki.apache.org/confluence/display/Hive/Roadmap for a
 list
  of
open
projects in Hive. The document is bit dated, so some of those
 projects
   may
not be relevant. But, its a good source to start with to see if
 any of
these projects excite you.
   
Hope it helps,
Ashutosh
   
On Sat, Feb 4, 2012 at 08:47, bharath vissapragada 
bharathvissapragada1...@gmail.com wrote:
   
 Hey list, devs,

 Google summer of code, 2012 's notification [1] has been released
  and
 mentoring organizations can submit their proposals to Google for
opensource
 projects.

 Any of the devs interested in mentoring students on Hive
 projects (
  any
 critical jiras etc.) ?  It would be great if any of the devs (dev
  list
 cc'ed) can do that on behalf of ASF .

 It would be a great opportunity for  many students to contribute
   patches
 to Hadoop and Hive and make their summer vacation fruitful.

 [1]
 http://google-melange.appspot.com/gsoc/events/google/gsoc2012

 Thanks and Regards,
 Bharath .V
 w:http://researchweb.iiit.ac.in/~bharath.v

   
  
  
  
   --
   Ing. Alexis de la Cruz Toledo.
   *Av. Instituto Politécnico Nacional No. 2508 Col. San Pedro
 Zacatenco.
   México,
   D.F, 07360 *
   *CINVESTAV, DF.*
  
 
 
 
 
 --
 Ing. Alexis de la Cruz Toledo.
 *Av. Instituto Politécnico Nacional No. 2508 Col. San Pedro Zacatenco.
 México,
 D.F, 07360 *
 *CINVESTAV, DF.*




-- 
Ing. Alexis de la Cruz Toledo.
*Av. Instituto Politécnico Nacional No. 2508 Col. San Pedro Zacatenco. México,
D.F, 07360 *
*CINVESTAV, DF.*


Re: Hive projects for Google Summer of code 2012 ?

2012-03-06 Thread Namit Jain
https://issues.apache.org/jira/secure/IssueNavigator.jspa?requestId=1231927
0



0 was in the next line by mistake
Look for HIVE jiras.



On 3/6/12 7:37 PM, Alexis De La Cruz Toledo alexis...@gmail.com wrote:

Sorry Mr. Namit Jain I can't see the jiras.

Thanks

El 6 de marzo de 2012 19:41, Namit Jain nj...@fb.com escribió:

 I filed a couple of gsoc 2012 jiras.

 Please submit your proposal if you are interested.

 
https://issues.apache.org/jira/secure/IssueNavigator.jspa?requestId=12319
27
 0




 Thanks,
 -namit


 On 2/7/12 6:32 PM, Alexis De La Cruz Toledo alexis...@gmail.com
wrote:

 Hi Namit Jain, I'm Alexis, I'm a master student, I'm studying in
 Cinvestav,
 df Mexico.
 I'm interesting in collaborating with Hive. I'd like to do my thesis
work
 in something about Hive.
 I'd like to participate in Google Summer of Code 2012 too.
 
 The issues that you propose I find interesting, particularly in the
next
 issues:
 * The topic 2. Indexed Joins.
 * [PO] Optimize Joins using Bloom Filters of this page
 https://cwiki.apache.org/confluence/display/Hive/Roadmap.
 
 Can you tell me something more about it?
 What is problem to be solved?
 What benefits we hope to gain?
 
 This because I want to raise my thesis problem.
 By another hand, can you be my mentor in Google Summer of Code, if I
work
 with this topics?
 
 Thanks.
 
 El 5 de febrero de 2012 19:58, Namit Jain nj...@fb.com escribió:
 
  Hi Alexis/Bharath,
 
  Great to see your interests. If you looking for ideas, some things
that
  will be very useful are:
 
  1. Removing the map-join hint completely.
Moving all processing to runtime.
Currently, bucketed map joins and sort merge joins are completely
  driven off hints.
It would be very helpful to the community, and also clean up the
code
 a
  lot.
 
  2. Indexed Joins.
Something that would be really useful -
If the basic infrastructure is ready, it can eventually be used to
 join
  tables
stored outside also (say Hbase).
 
  3. Metastore understanding hierarchy.
For eg: if a table is partitioned by ds and hr,
A valid partition on ds does not exist. This is a very common
usecase
  on many
applications, and the current work-around is to have signal tables
for
  ds
un-necessarily complicating the process.
 
 
  If you are interested, I would be happy to provide more details.
 
 
  Thanks,
  -namit
 
 
  On 2/4/12 11:57 AM, Ashutosh Chauhan hashut...@apache.org wrote:
 
  Hi Alexis,
  
  Great to see your interest. Feel free to come up with concrete
proposal
  and
  submit to GSoC. Its certainly heartening to see folks interested in
 making
  contributions to the Hive Project.
  
  Ashutosh
  On Sat, Feb 4, 2012 at 10:48, Alexis De La Cruz Toledo
  alexis...@gmail.comwrote:
  
   Hi Ashutosh, I'm interesting in hive,
   I'd like to improve the compilation process,
   I have been that the plan query tree generated
   by Hive can be optimized, and I'd like
   to participate in Google Summer of code 2012.
   What do you say?
  
   Regards.
  
  
   El 4 de febrero de 2012 12:29, Ashutosh Chauhan 
 hashut...@apache.org
   escribió:
  
Hey Bharath,
   
Great to see your enthusiasm for Hive! I would be happy to
mentor
 you
  for
the project.  For the start, you can take a look at
https://cwiki.apache.org/confluence/display/Hive/Roadmap for a
 list
  of
open
projects in Hive. The document is bit dated, so some of those
 projects
   may
not be relevant. But, its a good source to start with to see if
 any of
these projects excite you.
   
Hope it helps,
Ashutosh
   
On Sat, Feb 4, 2012 at 08:47, bharath vissapragada 
bharathvissapragada1...@gmail.com wrote:
   
 Hey list, devs,

 Google summer of code, 2012 's notification [1] has been
released
  and
 mentoring organizations can submit their proposals to Google
for
opensource
 projects.

 Any of the devs interested in mentoring students on Hive
 projects (
  any
 critical jiras etc.) ?  It would be great if any of the devs
(dev
  list
 cc'ed) can do that on behalf of ASF .

 It would be a great opportunity for  many students to
contribute
   patches
 to Hadoop and Hive and make their summer vacation fruitful.

 [1]
 http://google-melange.appspot.com/gsoc/events/google/gsoc2012

 Thanks and Regards,
 Bharath .V
 w:http://researchweb.iiit.ac.in/~bharath.v

   
  
  
  
   --
   Ing. Alexis de la Cruz Toledo.
   *Av. Instituto Politécnico Nacional No. 2508 Col. San Pedro
 Zacatenco.
   México,
   D.F, 07360 *
   *CINVESTAV, DF.*
  
 
 
 
 
 --
 Ing. Alexis de la Cruz Toledo.
 *Av. Instituto Politécnico Nacional No. 2508 Col. San Pedro Zacatenco.
 México,
 D.F, 07360 *
 *CINVESTAV, DF.*




-- 
Ing. Alexis de la Cruz Toledo.
*Av. Instituto Politécnico Nacional No. 2508 Col. San Pedro Zacatenco.
México,
D.F, 07360 *
*CINVESTAV, DF.*



Re: Hive projects for Google Summer of code 2012 ?

2012-03-06 Thread bharath vissapragada
Hi Namit,

Is it possible to add Hive-1362 [1] to the list? I am interested in working
on that and I've contacted Ashutosh regarding that.

Thanks and Regards,


[1]  https://issues.apache.org/jira/browse/HIVE-1362

On Wed, Mar 7, 2012 at 6:46 AM, Namit Jain nj...@fb.com wrote:

 https://issues.apache.org/jira/secure/IssueNavigator.jspa?requestId=1231927
 0



 0 was in the next line by mistake
 Look for HIVE jiras.



 On 3/6/12 7:37 PM, Alexis De La Cruz Toledo alexis...@gmail.com wrote:

 Sorry Mr. Namit Jain I can't see the jiras.
 
 Thanks
 
 El 6 de marzo de 2012 19:41, Namit Jain nj...@fb.com escribió:
 
  I filed a couple of gsoc 2012 jiras.
 
  Please submit your proposal if you are interested.
 
 
 
 https://issues.apache.org/jira/secure/IssueNavigator.jspa?requestId=12319
 27
  0
 
 
 
 
  Thanks,
  -namit
 
 
  On 2/7/12 6:32 PM, Alexis De La Cruz Toledo alexis...@gmail.com
 wrote:
 
  Hi Namit Jain, I'm Alexis, I'm a master student, I'm studying in
  Cinvestav,
  df Mexico.
  I'm interesting in collaborating with Hive. I'd like to do my thesis
 work
  in something about Hive.
  I'd like to participate in Google Summer of Code 2012 too.
  
  The issues that you propose I find interesting, particularly in the
 next
  issues:
  * The topic 2. Indexed Joins.
  * [PO] Optimize Joins using Bloom Filters of this page
  https://cwiki.apache.org/confluence/display/Hive/Roadmap.
  
  Can you tell me something more about it?
  What is problem to be solved?
  What benefits we hope to gain?
  
  This because I want to raise my thesis problem.
  By another hand, can you be my mentor in Google Summer of Code, if I
 work
  with this topics?
  
  Thanks.
  
  El 5 de febrero de 2012 19:58, Namit Jain nj...@fb.com escribió:
  
   Hi Alexis/Bharath,
  
   Great to see your interests. If you looking for ideas, some things
 that
   will be very useful are:
  
   1. Removing the map-join hint completely.
 Moving all processing to runtime.
 Currently, bucketed map joins and sort merge joins are completely
   driven off hints.
 It would be very helpful to the community, and also clean up the
 code
  a
   lot.
  
   2. Indexed Joins.
 Something that would be really useful -
 If the basic infrastructure is ready, it can eventually be used to
  join
   tables
 stored outside also (say Hbase).
  
   3. Metastore understanding hierarchy.
 For eg: if a table is partitioned by ds and hr,
 A valid partition on ds does not exist. This is a very common
 usecase
   on many
 applications, and the current work-around is to have signal tables
 for
   ds
 un-necessarily complicating the process.
  
  
   If you are interested, I would be happy to provide more details.
  
  
   Thanks,
   -namit
  
  
   On 2/4/12 11:57 AM, Ashutosh Chauhan hashut...@apache.org wrote:
  
   Hi Alexis,
   
   Great to see your interest. Feel free to come up with concrete
 proposal
   and
   submit to GSoC. Its certainly heartening to see folks interested in
  making
   contributions to the Hive Project.
   
   Ashutosh
   On Sat, Feb 4, 2012 at 10:48, Alexis De La Cruz Toledo
   alexis...@gmail.comwrote:
   
Hi Ashutosh, I'm interesting in hive,
I'd like to improve the compilation process,
I have been that the plan query tree generated
by Hive can be optimized, and I'd like
to participate in Google Summer of code 2012.
What do you say?
   
Regards.
   
   
El 4 de febrero de 2012 12:29, Ashutosh Chauhan 
  hashut...@apache.org
escribió:
   
 Hey Bharath,

 Great to see your enthusiasm for Hive! I would be happy to
 mentor
  you
   for
 the project.  For the start, you can take a look at
 https://cwiki.apache.org/confluence/display/Hive/Roadmap for a
  list
   of
 open
 projects in Hive. The document is bit dated, so some of those
  projects
may
 not be relevant. But, its a good source to start with to see if
  any of
 these projects excite you.

 Hope it helps,
 Ashutosh

 On Sat, Feb 4, 2012 at 08:47, bharath vissapragada 
 bharathvissapragada1...@gmail.com wrote:

  Hey list, devs,
 
  Google summer of code, 2012 's notification [1] has been
 released
   and
  mentoring organizations can submit their proposals to Google
 for
 opensource
  projects.
 
  Any of the devs interested in mentoring students on Hive
  projects (
   any
  critical jiras etc.) ?  It would be great if any of the devs
 (dev
   list
  cc'ed) can do that on behalf of ASF .
 
  It would be a great opportunity for  many students to
 contribute
patches
  to Hadoop and Hive and make their summer vacation fruitful.
 
  [1]
  http://google-melange.appspot.com/gsoc/events/google/gsoc2012
 
  Thanks and Regards,
  Bharath .V
  w:http://researchweb.iiit.ac.in/~bharath.v
 

   
   
   
--
Ing. Alexis de la Cruz Toledo.
*Av. Instituto Politécnico 

Re: Hive projects for Google Summer of code 2012 ?

2012-03-06 Thread Namit Jain
Done


On 3/6/12 10:32 PM, bharath vissapragada
bharathvissapragada1...@gmail.com wrote:

Hi Namit,

Is it possible to add Hive-1362 [1] to the list? I am interested in
working
on that and I've contacted Ashutosh regarding that.

Thanks and Regards,


[1]  https://issues.apache.org/jira/browse/HIVE-1362

On Wed, Mar 7, 2012 at 6:46 AM, Namit Jain nj...@fb.com wrote:

 
https://issues.apache.org/jira/secure/IssueNavigator.jspa?requestId=12319
27
 0



 0 was in the next line by mistake
 Look for HIVE jiras.



 On 3/6/12 7:37 PM, Alexis De La Cruz Toledo alexis...@gmail.com
wrote:

 Sorry Mr. Namit Jain I can't see the jiras.
 
 Thanks
 
 El 6 de marzo de 2012 19:41, Namit Jain nj...@fb.com escribió:
 
  I filed a couple of gsoc 2012 jiras.
 
  Please submit your proposal if you are interested.
 
 
 
 
https://issues.apache.org/jira/secure/IssueNavigator.jspa?requestId=12319
 27
  0
 
 
 
 
  Thanks,
  -namit
 
 
  On 2/7/12 6:32 PM, Alexis De La Cruz Toledo alexis...@gmail.com
 wrote:
 
  Hi Namit Jain, I'm Alexis, I'm a master student, I'm studying in
  Cinvestav,
  df Mexico.
  I'm interesting in collaborating with Hive. I'd like to do my thesis
 work
  in something about Hive.
  I'd like to participate in Google Summer of Code 2012 too.
  
  The issues that you propose I find interesting, particularly in the
 next
  issues:
  * The topic 2. Indexed Joins.
  * [PO] Optimize Joins using Bloom Filters of this page
  https://cwiki.apache.org/confluence/display/Hive/Roadmap.
  
  Can you tell me something more about it?
  What is problem to be solved?
  What benefits we hope to gain?
  
  This because I want to raise my thesis problem.
  By another hand, can you be my mentor in Google Summer of Code, if I
 work
  with this topics?
  
  Thanks.
  
  El 5 de febrero de 2012 19:58, Namit Jain nj...@fb.com escribió:
  
   Hi Alexis/Bharath,
  
   Great to see your interests. If you looking for ideas, some things
 that
   will be very useful are:
  
   1. Removing the map-join hint completely.
 Moving all processing to runtime.
 Currently, bucketed map joins and sort merge joins are
completely
   driven off hints.
 It would be very helpful to the community, and also clean up the
 code
  a
   lot.
  
   2. Indexed Joins.
 Something that would be really useful -
 If the basic infrastructure is ready, it can eventually be used
to
  join
   tables
 stored outside also (say Hbase).
  
   3. Metastore understanding hierarchy.
 For eg: if a table is partitioned by ds and hr,
 A valid partition on ds does not exist. This is a very common
 usecase
   on many
 applications, and the current work-around is to have signal
tables
 for
   ds
 un-necessarily complicating the process.
  
  
   If you are interested, I would be happy to provide more details.
  
  
   Thanks,
   -namit
  
  
   On 2/4/12 11:57 AM, Ashutosh Chauhan hashut...@apache.org
wrote:
  
   Hi Alexis,
   
   Great to see your interest. Feel free to come up with concrete
 proposal
   and
   submit to GSoC. Its certainly heartening to see folks interested
in
  making
   contributions to the Hive Project.
   
   Ashutosh
   On Sat, Feb 4, 2012 at 10:48, Alexis De La Cruz Toledo
   alexis...@gmail.comwrote:
   
Hi Ashutosh, I'm interesting in hive,
I'd like to improve the compilation process,
I have been that the plan query tree generated
by Hive can be optimized, and I'd like
to participate in Google Summer of code 2012.
What do you say?
   
Regards.
   
   
El 4 de febrero de 2012 12:29, Ashutosh Chauhan 
  hashut...@apache.org
escribió:
   
 Hey Bharath,

 Great to see your enthusiasm for Hive! I would be happy to
 mentor
  you
   for
 the project.  For the start, you can take a look at
 https://cwiki.apache.org/confluence/display/Hive/Roadmap for
a
  list
   of
 open
 projects in Hive. The document is bit dated, so some of those
  projects
may
 not be relevant. But, its a good source to start with to see
if
  any of
 these projects excite you.

 Hope it helps,
 Ashutosh

 On Sat, Feb 4, 2012 at 08:47, bharath vissapragada 
 bharathvissapragada1...@gmail.com wrote:

  Hey list, devs,
 
  Google summer of code, 2012 's notification [1] has been
 released
   and
  mentoring organizations can submit their proposals to
Google
 for
 opensource
  projects.
 
  Any of the devs interested in mentoring students on Hive
  projects (
   any
  critical jiras etc.) ?  It would be great if any of the
devs
 (dev
   list
  cc'ed) can do that on behalf of ASF .
 
  It would be a great opportunity for  many students to
 contribute
patches
  to Hadoop and Hive and make their summer vacation fruitful.
 
  [1]
  http://google-melange.appspot.com/gsoc/events/google/gsoc2012
 
  Thanks and Regards,
  Bharath .V
  w:http://researchweb.iiit.ac.in/~bharath.v
 

Re: Hive projects for Google Summer of code 2012 ?

2012-02-05 Thread Namit Jain
Hi Alexis/Bharath,

Great to see your interests. If you looking for ideas, some things that
will be very useful are:

1. Removing the map-join hint completely.
   Moving all processing to runtime.
   Currently, bucketed map joins and sort merge joins are completely
driven off hints.
   It would be very helpful to the community, and also clean up the code a
lot.

2. Indexed Joins. 
   Something that would be really useful -
   If the basic infrastructure is ready, it can eventually be used to join
tables
   stored outside also (say Hbase).
   
3. Metastore understanding hierarchy.
   For eg: if a table is partitioned by ds and hr,
   A valid partition on ds does not exist. This is a very common usecase
on many
   applications, and the current work-around is to have signal tables for
ds
   un-necessarily complicating the process.


If you are interested, I would be happy to provide more details.


Thanks,
-namit


On 2/4/12 11:57 AM, Ashutosh Chauhan hashut...@apache.org wrote:

Hi Alexis,

Great to see your interest. Feel free to come up with concrete proposal
and
submit to GSoC. Its certainly heartening to see folks interested in making
contributions to the Hive Project.

Ashutosh
On Sat, Feb 4, 2012 at 10:48, Alexis De La Cruz Toledo
alexis...@gmail.comwrote:

 Hi Ashutosh, I'm interesting in hive,
 I'd like to improve the compilation process,
 I have been that the plan query tree generated
 by Hive can be optimized, and I'd like
 to participate in Google Summer of code 2012.
 What do you say?

 Regards.


 El 4 de febrero de 2012 12:29, Ashutosh Chauhan hashut...@apache.org
 escribió:

  Hey Bharath,
 
  Great to see your enthusiasm for Hive! I would be happy to mentor you
for
  the project.  For the start, you can take a look at
  https://cwiki.apache.org/confluence/display/Hive/Roadmap for a list of
  open
  projects in Hive. The document is bit dated, so some of those projects
 may
  not be relevant. But, its a good source to start with to see if any of
  these projects excite you.
 
  Hope it helps,
  Ashutosh
 
  On Sat, Feb 4, 2012 at 08:47, bharath vissapragada 
  bharathvissapragada1...@gmail.com wrote:
 
   Hey list, devs,
  
   Google summer of code, 2012 's notification [1] has been released
and
   mentoring organizations can submit their proposals to Google for
  opensource
   projects.
  
   Any of the devs interested in mentoring students on Hive projects (
any
   critical jiras etc.) ?  It would be great if any of the devs (dev
list
   cc'ed) can do that on behalf of ASF .
  
   It would be a great opportunity for  many students to contribute
 patches
   to Hadoop and Hive and make their summer vacation fruitful.
  
   [1] http://google-melange.appspot.com/gsoc/events/google/gsoc2012
  
   Thanks and Regards,
   Bharath .V
   w:http://researchweb.iiit.ac.in/~bharath.v
  
 



 --
 Ing. Alexis de la Cruz Toledo.
 *Av. Instituto Politécnico Nacional No. 2508 Col. San Pedro Zacatenco.
 México,
 D.F, 07360 *
 *CINVESTAV, DF.*




Re: Hive projects for Google Summer of code 2012 ?

2012-02-04 Thread Ashutosh Chauhan
Hey Bharath,

Great to see your enthusiasm for Hive! I would be happy to mentor you for
the project.  For the start, you can take a look at
https://cwiki.apache.org/confluence/display/Hive/Roadmap for a list of open
projects in Hive. The document is bit dated, so some of those projects may
not be relevant. But, its a good source to start with to see if any of
these projects excite you.

Hope it helps,
Ashutosh

On Sat, Feb 4, 2012 at 08:47, bharath vissapragada 
bharathvissapragada1...@gmail.com wrote:

 Hey list, devs,

 Google summer of code, 2012 's notification [1] has been released and
 mentoring organizations can submit their proposals to Google for opensource
 projects.

 Any of the devs interested in mentoring students on Hive projects ( any
 critical jiras etc.) ?  It would be great if any of the devs (dev list
 cc'ed) can do that on behalf of ASF .

 It would be a great opportunity for  many students to contribute patches
 to Hadoop and Hive and make their summer vacation fruitful.

 [1] http://google-melange.appspot.com/gsoc/events/google/gsoc2012

 Thanks and Regards,
 Bharath .V
 w:http://researchweb.iiit.ac.in/~bharath.v



Re: Hive projects for Google Summer of code 2012 ?

2012-02-04 Thread Alexis De La Cruz Toledo
Hi Ashutosh, I'm interesting in hive,
I'd like to improve the compilation process,
I have been that the plan query tree generated
by Hive can be optimized, and I'd like
to participate in Google Summer of code 2012.
What do you say?

Regards.


El 4 de febrero de 2012 12:29, Ashutosh Chauhan hashut...@apache.orgescribió:

 Hey Bharath,

 Great to see your enthusiasm for Hive! I would be happy to mentor you for
 the project.  For the start, you can take a look at
 https://cwiki.apache.org/confluence/display/Hive/Roadmap for a list of
 open
 projects in Hive. The document is bit dated, so some of those projects may
 not be relevant. But, its a good source to start with to see if any of
 these projects excite you.

 Hope it helps,
 Ashutosh

 On Sat, Feb 4, 2012 at 08:47, bharath vissapragada 
 bharathvissapragada1...@gmail.com wrote:

  Hey list, devs,
 
  Google summer of code, 2012 's notification [1] has been released and
  mentoring organizations can submit their proposals to Google for
 opensource
  projects.
 
  Any of the devs interested in mentoring students on Hive projects ( any
  critical jiras etc.) ?  It would be great if any of the devs (dev list
  cc'ed) can do that on behalf of ASF .
 
  It would be a great opportunity for  many students to contribute patches
  to Hadoop and Hive and make their summer vacation fruitful.
 
  [1] http://google-melange.appspot.com/gsoc/events/google/gsoc2012
 
  Thanks and Regards,
  Bharath .V
  w:http://researchweb.iiit.ac.in/~bharath.v
 




-- 
Ing. Alexis de la Cruz Toledo.
*Av. Instituto Politécnico Nacional No. 2508 Col. San Pedro Zacatenco. México,
D.F, 07360 *
*CINVESTAV, DF.*


Re: Hive projects for Google Summer of code 2012 ?

2012-02-04 Thread Ashutosh Chauhan
Hi Alexis,

Great to see your interest. Feel free to come up with concrete proposal and
submit to GSoC. Its certainly heartening to see folks interested in making
contributions to the Hive Project.

Ashutosh
On Sat, Feb 4, 2012 at 10:48, Alexis De La Cruz Toledo
alexis...@gmail.comwrote:

 Hi Ashutosh, I'm interesting in hive,
 I'd like to improve the compilation process,
 I have been that the plan query tree generated
 by Hive can be optimized, and I'd like
 to participate in Google Summer of code 2012.
 What do you say?

 Regards.


 El 4 de febrero de 2012 12:29, Ashutosh Chauhan hashut...@apache.org
 escribió:

  Hey Bharath,
 
  Great to see your enthusiasm for Hive! I would be happy to mentor you for
  the project.  For the start, you can take a look at
  https://cwiki.apache.org/confluence/display/Hive/Roadmap for a list of
  open
  projects in Hive. The document is bit dated, so some of those projects
 may
  not be relevant. But, its a good source to start with to see if any of
  these projects excite you.
 
  Hope it helps,
  Ashutosh
 
  On Sat, Feb 4, 2012 at 08:47, bharath vissapragada 
  bharathvissapragada1...@gmail.com wrote:
 
   Hey list, devs,
  
   Google summer of code, 2012 's notification [1] has been released and
   mentoring organizations can submit their proposals to Google for
  opensource
   projects.
  
   Any of the devs interested in mentoring students on Hive projects ( any
   critical jiras etc.) ?  It would be great if any of the devs (dev list
   cc'ed) can do that on behalf of ASF .
  
   It would be a great opportunity for  many students to contribute
 patches
   to Hadoop and Hive and make their summer vacation fruitful.
  
   [1] http://google-melange.appspot.com/gsoc/events/google/gsoc2012
  
   Thanks and Regards,
   Bharath .V
   w:http://researchweb.iiit.ac.in/~bharath.v
  
 



 --
 Ing. Alexis de la Cruz Toledo.
 *Av. Instituto Politécnico Nacional No. 2508 Col. San Pedro Zacatenco.
 México,
 D.F, 07360 *
 *CINVESTAV, DF.*