Re: how can i get the column value? Need help!.. cassandra 1.28 and pig 0.11.1

Miguel Angel Martin junquera Mon, 02 Sep 2013 04:11:01 -0700

hi:

I test this in cassandra 1.2.9 new  version and the issue still persists .


:-(




Miguel Angel Martín Junquera
Analyst Engineer.
miguelangel.mar...@brainsins.com



2013/8/30 Miguel Angel Martin junquera <mianmarjun.mailingl...@gmail.com>

> I try this:
>
> *rows = LOAD
> 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
> CqlStorage();*
>
> *dump rows;*
>
> *ILLUSTRATE rows;*
>
> *describe rows;*
>
> *
> *
>
> *values2= FOREACH rows GENERATE  TOTUPLE (id) as
> (mycolumn:tuple(name,value));*
>
> *dump values2;*
>
> *describe values2;*
> *
> *
>
> But I get this results:
>
>
>
> -------------------------------------------------------------
> | rows     | id:chararray   | age:int   | title:chararray   |
> -------------------------------------------------------------
> |          | (id, 6)        | (age, 30) | (title, QA)       |
> -------------------------------------------------------------
>
> rows: {id: chararray,age: int,title: chararray}
> 2013-08-30 09:54:37,831 [main] ERROR org.apache.pig.tools.grunt.Grunt -
> ERROR 1031: Incompatable field schema: left is
> "tuple_0:tuple(mycolumn:tuple(name:bytearray,value:bytearray))", right is
> "org.apache.pig.builtin.totuple_id_1:tuple(id:chararray)"
>
>
>
>
>
> or
>
>
>
> ....
>
> *values2= FOREACH rows GENERATE  TOTUPLE (id) ;*
> *dump values2;*
> *describe values2;*
>
>
>
>
> and  the results are:
>
>
> ...
> (((id,6)))
> (((id,5)))
> values2: {org.apache.pig.builtin.totuple_id_8: (id: chararray)}
>
>
>
> Aggg!!!!!
>
>
> *
> *
>
>
>
>
> Miguel Angel Martín Junquera
> Analyst Engineer.
> miguelangel.mar...@brainsins.com
>
>
>
> 2013/8/28 Miguel Angel Martin junquera <mianmarjun.mailingl...@gmail.com>
>
>> hi:
>>
>> I can not understand why the schema is  define like 
>> *"id:chararray,age:int,title:chararray"
>>  and it does not define like tuples or bag tuples,  if we have pair
>> key-values  columns*
>> *
>> *
>> *
>> *
>> *I try other time to change schema  but it does not work.*
>> *
>> *
>> *any ideas ...*
>> *
>> *
>> *perhaps, is the issue in the definition cql3 tables ?*
>> *
>> *
>> *regards*
>>
>>
>> 2013/8/28 Miguel Angel Martin junquera <mianmarjun.mailingl...@gmail.com>
>>
>>> hi all:
>>>
>>>
>>> Regards
>>>
>>> Still i can resolve this issue. .....
>>>
>>> does anybody have this issue or try to test this simple example?
>>>
>>>
>>> i am stumped I can not find a solution working.
>>>
>>> I appreciate any comment or help
>>>
>>>
>>> 2013/8/22 Miguel Angel Martin junquera <mianmarjun.mailingl...@gmail.com
>>> >
>>>
>>>> hi all:
>>>>
>>>>
>>>>
>>>>
>>>> I,m testing the new CqlStorage() with cassandra 1.28 and pig 0.11.1
>>>>
>>>>
>>>> I am using this sample data test:
>>>>
>>>>
>>>> http://frommyworkshop.blogspot.com.es/2013/07/hadoop-map-reduce-with-cassandra.html
>>>>
>>>> And I load and dump data Righ with this script:
>>>>
>>>> *rows = LOAD
>>>> 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
>>>> CqlStorage();*
>>>> *
>>>> *
>>>> *dump rows;*
>>>> *describe rows;*
>>>> *
>>>> *
>>>>
>>>> *resutls:
>>>>
>>>> ((id,6),(age,30),(title,QA))
>>>>
>>>> ((id,5),(age,30),(title,QA))
>>>>
>>>> rows: {id: chararray,age: int,title: chararray}
>>>>
>>>>
>>>> *
>>>>
>>>>
>>>> But i can not  get  the column values
>>>>
>>>> I try to define   another schemas in Load like I used with
>>>> cassandraStorage()
>>>>
>>>>
>>>> http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Cassandra-and-Pig-how-to-get-column-values-td5641158.html
>>>>
>>>>
>>>> example:
>>>>
>>>> *rows = LOAD
>>>> 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
>>>> CqlStorage() AS (columns: bag {T: tuple(name, value)});*
>>>>
>>>>
>>>> and I get this error:
>>>>
>>>> *2013-08-22 12:24:45,426 [main] ERROR org.apache.pig.tools.grunt.Grunt
>>>> - ERROR 1031: Incompatable schema: left is
>>>> "columns:bag{T:tuple(name:bytearray,value:bytearray)}", right is
>>>> "id:chararray,age:int,title:chararray"*
>>>>
>>>>
>>>>
>>>>
>>>> I try to use, FLATTEN, SUBSTRING, SPLIT UDF`s but i have not get good
>>>> result:
>>>>
>>>> Example:
>>>>
>>>>
>>>>    - when I flatten , I get a set of tuples like
>>>>
>>>> *(title,QA)*
>>>>
>>>> *(title,QA)*
>>>>
>>>> *2013-08-22 12:42:20,673 [main] INFO
>>>>  org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total
>>>> input paths to process : 1*
>>>>
>>>> *A: {title: chararray}*
>>>>
>>>>
>>>>
>>>> but i can get value QA
>>>>
>>>> Sustring only works with title
>>>>
>>>>
>>>>
>>>> example:
>>>>
>>>> *B = FOREACH A GENERATE SUBSTRING(title,2,5);*
>>>> *
>>>> *
>>>> *dump B;*
>>>> *describe B;*
>>>> *
>>>> *
>>>> *
>>>> *
>>>>
>>>> *results:*
>>>> *
>>>> *
>>>>
>>>> *(tle)*
>>>> *(tle)*
>>>> *B: {chararray}*
>>>>
>>>>
>>>>
>>>>
>>>> i try, this like ERIC LEE inthe other mail  and have the same results:
>>>>
>>>>
>>>>  Anyways, what I really what is the column value, not the name. Is
>>>> there a way to do that? I listed all of the failed attempts I made below.
>>>>
>>>>    - colnames = FOREACH cols GENERATE $1 and was told $1 was out of
>>>>    bounds.
>>>>    - casted = FOREACH cols GENERATE (tuple(chararray, chararray))$0;
>>>>    but all I got back were empty tuples
>>>>    - values = FOREACH cols GENERATE $0.$1; but I got an error telling
>>>>    me data byte array can't be casted to tuple
>>>>
>>>>
>>>> Please, I will appreciate any help
>>>>
>>>>
>>>> Regards
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>
>>>
>>> --
>>>
>>> Miguel Angel Martín Junquera
>>> Analyst Engineer.
>>> miguelangel.mar...@brainsins.com
>>> Tel. / Fax: (+34) 91 485 56 66
>>> *http://www.brainsins.com*
>>> Smart eCommerce
>>> *Madrid*: http://goo.gl/4B5kv
>>>  *London*: http://goo.gl/uIXdv
>>>  *Barcelona*: http://goo.gl/NZslW
>>>
>>> Antes de imprimir este e-mail, piense si es necesario.
>>> La legislación española ampara el secreto de las comunicaciones. Este
>>> correo electrónico es estrictamente confidencial y va dirigido
>>> exclusivamente a su destinatario/a. Si no es Ud., le rogamos que no difunda
>>> ni copie la transmisión y nos lo notifique cuanto antes.
>>>
>>
>>
>>
>> --
>>
>> Miguel Angel Martín Junquera
>> Analyst Engineer.
>> miguelangel.mar...@brainsins.com
>>
>>
>

Re: how can i get the column value? Need help!.. cassandra 1.28 and pig 0.11.1

Reply via email to