[
https://issues.apache.org/jira/browse/CASSANDRA-4912?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13494513#comment-13494513
]
Michael Kjellman commented on CASSANDRA-4912:
---------------------------------------------
I think also another difference in behavior between CFOF and BOF is that when a
new BulkRecordWriter(Configuration conf) is created it creates the directory
for the sstables. It calls ConfigHelper here to get the name of the column
family so it can create the directory. The only call to getOutputColumnFamily
is RangeClient in CFOF.
Normally, without MultipleOutputs the job config would include a
setOutputColumnFamily(). I don't understand what calls setOutputColumnFamily
when you add a new named MultipleOutput. I presume this is where the problem is.
> BulkOutputFormat should support Hadoop MultipleOutput
> -----------------------------------------------------
>
> Key: CASSANDRA-4912
> URL: https://issues.apache.org/jira/browse/CASSANDRA-4912
> Project: Cassandra
> Issue Type: New Feature
> Components: Hadoop
> Affects Versions: 1.2.0 beta 1
> Reporter: Michael Kjellman
> Attachments: Example.java
>
>
> Much like CASSANDRA-4208 BOF should support outputting to Multiple Column
> Families. The current approach takken in the patch for COF results in only
> one stream being sent.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira