Kudu JIRA has been moved to issues.apache.org/jira/browse/KUDU

2016-02-24 Thread Mike Percy
Hi everyone,
We have moved the Kudu JIRA to Apache Software Foundation (ASF)
infrastructure. You can now find all of the Kudu tickets migrated to
https://issues.apache.org/jira/browse/KUDU

The previous Kudu JIRA project on issues.cloudera.org is now retired and
has been marked READ ONLY. Going forward, please use the Kudu ASF JIRA
instance exclusively.

We set up some mappings for user ids -- so if we were able to find your
existing account at issues.apache.org then we linked it up and all of your
comments and assignments should have been carried over to that account.

If we couldn't find a userid mapping for you then, an account was
automatically created with the userid you were using on issues.cloudera.org.
In that case, you will need to go through the password reset process to get
access access to your account, since passwords were not migrated over. Here
is the link for that:
https://issues.apache.org/jira/secure/ForgotLoginDetails.jspa

Please let me know if you have any questions or if you see anything that
was missed during the migration.

Thanks!
Mike


Re: Spark on Kudu

2016-02-24 Thread Jean-Daniel Cryans
The DStream stuff isn't there at all. I'm not sure if it's needed either.

The kuduRDD is just leveraging the MR input format, ideally we'd use scans
directly.

The SparkSQL stuff is there but it doesn't do any sort of pushdown. It's
really basic.

The goal was to provide something for others to contribute to. We have some
basic unit tests that others can easily extend. None of us on the team are
Spark experts, but we'd be really happy to assist one improve the
kudu-spark code.

J-D

On Wed, Feb 24, 2016 at 3:41 PM, Benjamin Kim  wrote:

> J-D,
>
> It looks like it fulfills most of the basic requirements (kudu RDD, kudu
> DStream) in KUDU-1214. Am I right? Besides shoring up more Spark SQL
> functionality (Dataframes) and doing the documentation, what more needs to
> be done? Optimizations?
>
> I believe that it’s a good place to start using Spark with Kudu and
> compare it to HBase with Spark (not clean).
>
> Thanks,
> Ben
>
>
> On Feb 24, 2016, at 3:10 PM, Jean-Daniel Cryans 
> wrote:
>
> AFAIK no one is working on it, but we did manage to get this in for 0.7.0:
> https://issues.cloudera.org/browse/KUDU-1321
>
> It's a really simple wrapper, and yes you can use SparkSQL on Kudu, but it
> will require a lot more work to make it fast/useful.
>
> Hope this helps,
>
> J-D
>
> On Wed, Feb 24, 2016 at 3:08 PM, Benjamin Kim  wrote:
>
>> I see this KUDU-1214  targeted
>> for 0.8.0, but I see no progress on it. When this is complete, will this
>> mean that Spark will be able to work with Kudu both programmatically and as
>> a client via Spark SQL? Or is there more work that needs to be done on the
>> Spark side for it to work?
>>
>> Just curious.
>>
>> Cheers,
>> Ben
>>
>>
>
>


Re: Spark on Kudu

2016-02-24 Thread Benjamin Kim
J-D,

It looks like it fulfills most of the basic requirements (kudu RDD, kudu 
DStream) in KUDU-1214. Am I right? Besides shoring up more Spark SQL 
functionality (Dataframes) and doing the documentation, what more needs to be 
done? Optimizations?

I believe that it’s a good place to start using Spark with Kudu and compare it 
to HBase with Spark (not clean).

Thanks,
Ben


> On Feb 24, 2016, at 3:10 PM, Jean-Daniel Cryans  wrote:
> 
> AFAIK no one is working on it, but we did manage to get this in for 0.7.0: 
> https://issues.cloudera.org/browse/KUDU-1321 
> 
> 
> It's a really simple wrapper, and yes you can use SparkSQL on Kudu, but it 
> will require a lot more work to make it fast/useful.
> 
> Hope this helps,
> 
> J-D
> 
> On Wed, Feb 24, 2016 at 3:08 PM, Benjamin Kim  > wrote:
> I see this KUDU-1214  targeted 
> for 0.8.0, but I see no progress on it. When this is complete, will this mean 
> that Spark will be able to work with Kudu both programmatically and as a 
> client via Spark SQL? Or is there more work that needs to be done on the 
> Spark side for it to work?
> 
> Just curious.
> 
> Cheers,
> Ben
> 
> 



Re: Spark on Kudu

2016-02-24 Thread Jean-Daniel Cryans
AFAIK no one is working on it, but we did manage to get this in for 0.7.0:
https://issues.cloudera.org/browse/KUDU-1321

It's a really simple wrapper, and yes you can use SparkSQL on Kudu, but it
will require a lot more work to make it fast/useful.

Hope this helps,

J-D

On Wed, Feb 24, 2016 at 3:08 PM, Benjamin Kim  wrote:

> I see this KUDU-1214  targeted
> for 0.8.0, but I see no progress on it. When this is complete, will this
> mean that Spark will be able to work with Kudu both programmatically and as
> a client via Spark SQL? Or is there more work that needs to be done on the
> Spark side for it to work?
>
> Just curious.
>
> Cheers,
> Ben
>
>


Spark on Kudu

2016-02-24 Thread Benjamin Kim
I see this KUDU-1214  targeted 
for 0.8.0, but I see no progress on it. When this is complete, will this mean 
that Spark will be able to work with Kudu both programmatically and as a client 
via Spark SQL? Or is there more work that needs to be done on the Spark side 
for it to work?

Just curious.

Cheers,
Ben



Re: Unsubscribe

2016-02-24 Thread Todd Lipcon
Please email user-unsubscribe@

-Todd

On Wed, Feb 24, 2016 at 10:48 AM, Andrea Ferretti
 wrote:
>



-- 
Todd Lipcon
Software Engineer, Cloudera


Unsubscribe

2016-02-24 Thread Andrea Ferretti