[
https://issues.apache.org/jira/browse/CASSANDRA-5234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13670577#comment-13670577
]
Alex Liu edited comment on CASSANDRA-5234 at 6/17/13 4:30 PM:
--------------------------------------------------------------
pull @ https://github.com/alexliu68/cassandra/pull/3
Use CassandraStorage for any cql3 tables, you will have composite columns in
"columns" bag
Use CqlStorage for any cql3 table.
{code}
cassandra://[username:password@]<keyspace>/<columnfamily>[?[page_size=<size>]
[&columns=<col1,col2>][&output_query=<prepared_statement>]
[&where_clause=<clause>][&split_size=<size>][&partitioner=<partitioner>]]
{code}
where
page_size is the number of cql3 rows per page (the default is 1000, it's
optional)
columns is the column names for the cql3 select query, it's optional
where_clause is the user defined where clause on the indexed column, it's
optional
split_size is the number of C* rows per split which can be used to tune the
number of mappers
output_query is the prepared query for inserting data to cql3 table (replace
the = by @ and ? by #,
because Pig can't take = and ? as url parameter values)
Output row are in the following format
{code}
(((name, value), (name, value)), (value ... value), (value...value))
{code}
where the name and value tuples are key name and value pairs.
The input schema: ((name, value), (name, value), (name, value)) where keys are
in the front.
was (Author: alexliu68):
pull @ https://github.com/alexliu68/cassandra/pull/3
Use CassandraStorage for any cql3 tables, you will have composite columns in
"columns" bag
Use CQL3Storage for any cql3 table.
{code}
cassandra://[username:password@]<keyspace>/<columnfamily>[?[page_size=<size>]
[&columns=<col1,col2>][&output_query=<prepared_statement>]
[&where_clause=<clause>][&split_size=<size>][&partitioner=<partitioner>]]
{code}
where
page_size is the number of cql3 rows per page (the default is 1000, it's
optional)
columns is the column names for the cql3 select query, it's optional
where_clause is the user defined where clause on the indexed column, it's
optional
split_size is the number of C* rows per split which can be used to tune the
number of mappers
output_query is the prepared query for inserting data to cql3 table (replace
the = by @ and ? by #,
because Pig can't take = and ? as url parameter values)
Output row are in the following format
{code}
(((name, value), (name, value)), (value ... value), (value...value))
{code}
where the name and value tuples are key name and value pairs.
The input schema: ((name, value), (name, value), (name, value)) where keys are
in the front.
> Table created through CQL3 are not accessble to Pig 0.10
> --------------------------------------------------------
>
> Key: CASSANDRA-5234
> URL: https://issues.apache.org/jira/browse/CASSANDRA-5234
> Project: Cassandra
> Issue Type: Bug
> Components: Hadoop
> Affects Versions: 1.2.1
> Environment: Red hat linux 5
> Reporter: Shamim Ahmed
> Fix For: 1.2.6
>
> Attachments: 5234-1.2-patch.txt, 5234.tx
>
>
> Hi,
> i have faced a bug when creating table through CQL3 and trying to load data
> through pig 0.10 as follows:
> java.lang.RuntimeException: Column family 'abc' not found in keyspace 'XYZ'
> at
> org.apache.cassandra.hadoop.pig.CassandraStorage.initSchema(CassandraStorage.java:1112)
> at
> org.apache.cassandra.hadoop.pig.CassandraStorage.setLocation(CassandraStorage.java:615).
> This effects from Simple table to table with compound key.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira