Thanks Dave. I found that Pig 0.14 and Hadoop 2.6.0 still use Guava 11.x
which was causing issue. I replacing all of those locations with Guava 17
did not end the ordeal. Seems like Guava made some breaking changes (
https://issues.apache.org/jira/browse/HADOOP-11032) in v17. You need
version 16.0
The method
com.google.common.collect.Sets.newConcurrentHashSet()Ljava/util/Set;
should be available in guava from 15.0 on. So guava-16.0 should be fine.
It's possible guava is being picked up from somewhere else? have a
global classpath variable?
you might want to do
URL u =
-- Forwarded message --
From: Mark Lewandowski mark.e.lewandow...@gmail.com
Date: Jun 8, 2013 8:03 AM
Subject: Cassandra (1.2.5) + Pig (0.11.1) Errors with large column families
To: user@cassandra.apache.org
Cc:
I'm cur.rently trying to get Cassandra (1.2.5) and Pig (0.11.1) to
On 06/07/2013 06:02 PM, Mark Lewandowski wrote:
I'm currently trying to get Cassandra (1.2.5) and Pig (0.11.1) to play
nice together. I'm running a basic script:
rows = LOAD 'cassandra://keyspace/colfam' USING CassandraStorage();
dump rows;
This fails for my column family which has ~100,000
Yes, my cluster is working.
I didn't realize it at the time, but the StorageService link I listed is
already in 0.8.4, so yes the only file I had to patch was VersionedValue.
Not sure what was going on with the pig jars, but after more configuration
changes than I can count, I'm pretty sure
Thanks William - so you were able to get everything running correctly, right?
FWIW, we're in the process of upgrading to 0.8.4 and found that all we needed
was that first link you mentioned - the VersionedValue modification. It's
running fine on our staging cluster and we're in the process of
On Wed, Jul 6, 2011 at 2:48 PM, William Oberman ober...@civicscience.comwrote:
I have a few cassandra/hadoop/pig questions. I currently have things set
up in a test environment, and for the most part everything works. But,
before I start to roll things out to production, I wanted to check
That makes sense. The problem is I jumped directly to using pig, which is
abstracting some of the data flow from me. I guess I'll have to figure out
what it's doing under the covers, to know how to optimize/fix bottlenecks.
But for now, I'm taking this information to mean I should run datanodes
I have this working now with the following:
rows = LOAD 'cassandra://TwitterExample/User' using CassandraStorage();
*cols = FOREACH rows GENERATE FLATTEN((bag{tuple(chararray,chararray)})$1);*
users = FOREACH cols GENERATE $1;
Not sure if that operation with *cols *is correct or not, but it
On Sat, Oct 16, 2010 at 3:55 PM, Eric Lee e...@c11software.com wrote:
I have this working now with the following:
rows = LOAD 'cassandra://TwitterExample/User' using CassandraStorage();
*cols = FOREACH rows GENERATE
FLATTEN((bag{tuple(chararray,chararray)})$1);*
users = FOREACH cols
Ah nice, even better, thanks!
On Sat, Oct 16, 2010 at 2:31 PM, Brandon Williams dri...@gmail.com wrote:
On Sat, Oct 16, 2010 at 3:55 PM, Eric Lee e...@c11software.com wrote:
I have this working now with the following:
rows = LOAD 'cassandra://TwitterExample/User' using CassandraStorage();
I got one step further by cheating a bit, I just took all the Cassandra Jars
and dropped them into the Hadoop lib folder, so at least now I can run some
pig scripts over the data in Cassandra, but this is far from optimal since
it means I'd have to distribute my UDFs also to the Hadoop cluster, or
: Cassandra and Pig
I got one step further by cheating a bit, I just took all the Cassandra Jars
and dropped them into the Hadoop lib folder, so at least now I can run some
pig scripts over the data in Cassandra, but this is far from optimal since
it means I'd have to distribute my UDFs also
Message-
From: Stu Hood stu.h...@rackspace.com
Sent: Friday, August 13, 2010 2:31pm
To: user@cassandra.apache.org
Subject: Re: Cassandra and Pig
Hmm, the example code there may not have been run in distributed mode
recently, or perhaps Pig performs some magic to automatically register Jars
That error is coming from the frontend: the jars must also be on the local
classpath. Take a look at how contrib/pig/bin/pig_cassandra sets up
$PIG_CLASSPATH.
-Original Message-
From: Christian Decker decker.christ...@gmail.com
Sent: Friday, August 13, 2010 11:30am
To:
Wow, that was extremely quick, thanks Stu :-)
I'm still a bit unclear on what the pig_cassandra script does. It sets some
variables (PIG_CLASSPATH for one) and then starts the original pig binary
but injects some libraries in it (libthrift and pig-core) but strangely not
the cassandra loadfunc,
, 2010 12:16pm
To: user@cassandra.apache.org
Subject: Re: Cassandra and Pig
Wow, that was extremely quick, thanks Stu :-)
I'm still a bit unclear on what the pig_cassandra script does. It sets some
variables (PIG_CLASSPATH for one) and then starts the original pig binary
but injects some libraries
requires properties
to be set by your startup script.
-Original Message-
From: Stu Hood stu.h...@rackspace.com
Sent: Friday, August 13, 2010 2:31pm
To: user@cassandra.apache.org
Subject: Re: Cassandra and Pig
Hmm, the example code there may not have been run in distributed mode recently
18 matches
Mail list logo