[ 
https://issues.apache.org/jira/browse/CASSANDRA-5234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13670577#comment-13670577
 ] 

Alex Liu edited comment on CASSANDRA-5234 at 6/17/13 4:30 PM:
--------------------------------------------------------------

pull @ https://github.com/alexliu68/cassandra/pull/3

Use CassandraStorage for any cql3 tables, you will have composite columns in 
"columns" bag

Use CqlStorage for any cql3 table.
{code}
cassandra://[username:password@]<keyspace>/<columnfamily>[?[page_size=<size>]
[&columns=<col1,col2>][&output_query=<prepared_statement>]
[&where_clause=<clause>][&split_size=<size>][&partitioner=<partitioner>]]
{code}

where 
  page_size is the number of cql3 rows per page (the default is 1000, it's 
optional)

  columns is the column names for the cql3 select query, it's optional
 
  where_clause is the user defined where clause on the indexed column, it's 
optional

  split_size is the number of C* rows per split which can be used to tune the 
number of mappers

  output_query is the prepared query for inserting data to cql3 table (replace 
the = by @ and ? by #,
      because Pig can't take = and ? as url parameter values)

Output row are in the following format
{code}
(((name, value), (name, value)), (value ... value), (value...value))
{code}

where the name and value tuples are key name and value pairs.


The input schema: ((name, value), (name, value), (name, value)) where keys are 
in the front.
                
      was (Author: alexliu68):
    pull @ https://github.com/alexliu68/cassandra/pull/3

Use CassandraStorage for any cql3 tables, you will have composite columns in 
"columns" bag

Use CQL3Storage for any cql3 table.
{code}
cassandra://[username:password@]<keyspace>/<columnfamily>[?[page_size=<size>]
[&columns=<col1,col2>][&output_query=<prepared_statement>]
[&where_clause=<clause>][&split_size=<size>][&partitioner=<partitioner>]]
{code}

where 
  page_size is the number of cql3 rows per page (the default is 1000, it's 
optional)

  columns is the column names for the cql3 select query, it's optional
 
  where_clause is the user defined where clause on the indexed column, it's 
optional

  split_size is the number of C* rows per split which can be used to tune the 
number of mappers

  output_query is the prepared query for inserting data to cql3 table (replace 
the = by @ and ? by #,
      because Pig can't take = and ? as url parameter values)

Output row are in the following format
{code}
(((name, value), (name, value)), (value ... value), (value...value))
{code}

where the name and value tuples are key name and value pairs.


The input schema: ((name, value), (name, value), (name, value)) where keys are 
in the front.
                  
> Table created through CQL3 are not accessble to Pig 0.10
> --------------------------------------------------------
>
>                 Key: CASSANDRA-5234
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-5234
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Hadoop
>    Affects Versions: 1.2.1
>         Environment: Red hat linux 5
>            Reporter: Shamim Ahmed
>             Fix For: 1.2.6
>
>         Attachments: 5234-1.2-patch.txt, 5234.tx
>
>
> Hi,
>   i have faced a bug when creating table through CQL3 and trying to load data 
> through pig 0.10 as follows:
> java.lang.RuntimeException: Column family 'abc' not found in keyspace 'XYZ'
>       at 
> org.apache.cassandra.hadoop.pig.CassandraStorage.initSchema(CassandraStorage.java:1112)
>       at 
> org.apache.cassandra.hadoop.pig.CassandraStorage.setLocation(CassandraStorage.java:615).
> This effects from Simple table to table with compound key. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to