Hi Dan,
I am currently working on implementing GORA-267 [0] Cassandra composite
primary key support within the context of the gora-cassandra module.
I agree with you that the physical mapping you see is not easy for
unpacking and parsing within Spark. We also permit use legacy super columns
within gora-cassandra which we should emigrate from.
I'll look into the GoraCassandra codebase soon enough and provide more
detail on what you/we would need to meet your requirements.
Thanks
Lewis
[0] https://issues.apache.org/jira/browse/GORA-267
On Fri, Dec 5, 2014 at 5:56 AM, Dan Hanley dan.han...@activestandards.com
wrote:
Hi
I’m using Gora (0.3) to pipe Nutch (2.2.1) data into Cassandra, eventually
I’m hoping to analyse it with Spark.
The Gora-Cassandra mapping puts everything in three legacy style Cassandra
tables, f, p and sc all created roughly like:
CREATE TABLE p (
key blob,
column1 blob,
value blob,
PRIMARY KEY ((key), column1)
) WITH COMPACT STORAGE AND….
This is not easy to parse as an RDD in Spark.
It would be easier if e.g. the mapping:
field name=title family=p qualifier=t/
field name=text family=p qualifier=c/
field name=signature family=p qualifier=sig/
field name=prevSignature family=p qualifier=psig/
Produced a table like:
CREATE TABLE p (
key blob,
title blob,
text blob,
signature blob,
prevSignature blob
PRIMARY KEY (key)
) ….
So my question – is this something that is possible in more recent
versions of Gora? Or if not would it be something I could reasonably expect
to develop myself (I have no familiarity with the Gora codebase… any
pointers would be welcome)
Best Regards
Dan
*Dan Hanley*
CTO, ActiveStandards
Direct: +44 (0)207 019 4718
Switchboard: +44 (0)20 7019 4700
dan.han...@activestandards.com
*www.activestandards.com http://www.activestandards.com*
--
*Driving Digital Transformation: *
ActiveStandards launches new enterprise digital governance solutions
https://activestandards.com/about-us/newsroom/driving-digital-transformation-activestandards-launches-new-enterprise-digital
--
ActiveStandards, Studio 1001 Highgate Studios, 53-79 Highgate Road,
London, NW5 1TL
Registered in England: No. 3592714, VAT No. 625574723
--
*Lewis*