[ 
https://issues.apache.org/jira/browse/CASSANDRA-1227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Bryan Tower updated CASSANDRA-1227:
-----------------------------------

    Attachment: trunk-1227.txt

This patch file includes all of the changes needed to allow the 
ColumnInputFormat and the ColumnOutputFormat to be configured independently.

> Input and Output column families should be configured independently
> -------------------------------------------------------------------
>
>                 Key: CASSANDRA-1227
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-1227
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Hadoop
>    Affects Versions: 0.7
>            Reporter: Bryan Tower
>             Fix For: 0.7
>
>         Attachments: trunk-1227.txt
>
>
> I would like to use a ColumnFamilyInputFormat  and a ColumnFamilyRecordReader 
> to map a bunch of data from Cassandra to a job and then I would like to do 
> some operations on the data and in the Reducer write out some summary of the 
> work that I have done.  Both the ColumnFamilyInputFormat and the 
> ColumnFamilyOutputFormat read the column family from the same configuration 
> property in the job configuration object (they both use the 
> ConfigHelper.COLUMNFAMILY_CONFIG property).  This means that I can not read 
> from one Cassandra column family and write out to different one in the same 
> job with the existing code.
> I changed the ColumnFamilyOutputFormat to read from 
> "cassandra.output.columnfamily" instead of the "cassandra.input.columnfamily" 
> that it was using before.
> I changed the COLUMNFAMILY_CONFIG property and related methods to include the 
> word input.  I also added corresponding Output versions of each of the 
> relevant properties that should be configured for the 
> ColumnFamilyOutputFormat.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to