[ 
https://issues.apache.org/jira/browse/CASSANDRA-747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vijay closed CASSANDRA-747.
---------------------------

    Assignee: Vijay

Closing this and merging it to #342

> Need a additional method for Hadoop Range Queries.
> --------------------------------------------------
>
>                 Key: CASSANDRA-747
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-747
>             Project: Cassandra
>          Issue Type: New Feature
>          Components: Core
>    Affects Versions: 0.6
>         Environment: Need to add additional method for Range queries from 
> Hadoop. 
>            Reporter: Vijay
>            Assignee: Vijay
>            Priority: Minor
>
> Hadoop Integration might need the following.....
> 1) API to return the List of splits, given the number of splits. 
> Using this tokens we cam span equal number of MR Jobs (Have a configuration 
> in MR Job - This will be according to the complexity in processing), which 
> will say how many map tasks per partition and span those process.
> 2) Start token to stream.... API
> Input will be Range(String startKey, Token start, Token finish, int 
> limit).... return will be 
>     If Startwithkey is empty we will use the token1 as the starting point for 
> the stream, else we will use startwithkey to specify the key to start with? 
> Make sense?
> So each MR jobs will get the range of data from the Cassandra and will do 
> processing on it, it can also stream the data and doesn't need to get all of 
> it.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to