[ 
https://issues.apache.org/jira/browse/CASSANDRA-4912?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13492993#comment-13492993
 ] 

Michael Kjellman edited comment on CASSANDRA-4912 at 11/8/12 6:21 AM:
----------------------------------------------------------------------

so obviously this is due to the handling in the close() function in 
BulkRecordWriter. So far i've been unable to get BOF to work in Local mode thru 
eclipse with MultipleOutput. ConfigHelper is happy on the first check of the 
job config, but when the reducer is instantiated the column family output names 
don't seem to be set. close() is pretty simple in BulkRecordWriter though, 
looks like the sstable is first closed, and then streamed to the nodes. I'm 
guessing that either close() is only being called on one of the sstables/named 
outputs (i do see in a fully distributed cluster the sstables get created for 
multiple column families).
                
      was (Author: mkjellman):
    so obviously this is due to the handling in the close() function in 
BulkRecordWriter. So far i've been unable to get BOF to work in Local mode thru 
eclipse with multipleoutput. ConfigHelper is happy on the first check, but when 
the reducer is created the column family output names don't seem to be set. 
close() is pretty simple, looks like the sstable is first closed, and then 
streamed to the nodes. I'm guessing that either close is only being close on 
one of the sstables (i do see in a fully distributed cluster the sstables get 
created for multiple column families) but maybe we don't close it thus it never 
streams to the nodes?
                  
> BulkOutputFormat should support Hadoop MultipleOutput
> -----------------------------------------------------
>
>                 Key: CASSANDRA-4912
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-4912
>             Project: Cassandra
>          Issue Type: New Feature
>          Components: Hadoop
>    Affects Versions: 1.2.0 beta 1
>            Reporter: Michael Kjellman
>
> Much like CASSANDRA-4208 BOF should support outputting to Multiple Column 
> Families. The current approach takken in the patch for COF results in only 
> one stream being sent.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to