Re: Exception in Hadoop Word Count sample

2011-09-16 Thread Tharindu Mathew
The example works against the 7.0 branch, not against trunk.

JIRA created at https://issues.apache.org/jira/browse/CASSANDRA-3215

On Thu, Sep 15, 2011 at 3:58 PM, Tharindu Mathew mcclou...@gmail.comwrote:

 Now I get this,

 Any help would be greatly appreciated.

 ./bin/word_count
 11/09/15 12:28:28 INFO WordCount: output reducer type: cassandra
 11/09/15 12:28:29 INFO jvm.JvmMetrics: Initializing JVM Metrics with
 processName=JobTracker, sessionId=
 11/09/15 12:28:30 INFO mapred.JobClient: Running job: job_local_0001
 11/09/15 12:28:30 INFO mapred.MapTask: io.sort.mb = 100
 11/09/15 12:28:30 INFO mapred.MapTask: data buffer = 79691776/99614720
 11/09/15 12:28:30 INFO mapred.MapTask: record buffer = 262144/327680
 11/09/15 12:28:30 WARN mapred.LocalJobRunner: job_local_0001
 java.lang.RuntimeException: java.lang.UnsupportedOperationException: no
 local connection available
  at
 org.apache.cassandra.hadoop.ColumnFamilyRecordReader.initialize(ColumnFamilyRecordReader.java:132)
 at
 org.apache.hadoop.mapred.MapTask$NewTrackingRecordReader.initialize(MapTask.java:418)
  at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:620)
 at org.apache.hadoop.mapred.MapTask.run(MapTask.java:305)
  at
 org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:177)
 Caused by: java.lang.UnsupportedOperationException: no local connection
 available
  at
 org.apache.cassandra.hadoop.ColumnFamilyRecordReader.getLocation(ColumnFamilyRecordReader.java:176)
 at
 org.apache.cassandra.hadoop.ColumnFamilyRecordReader.initialize(ColumnFamilyRecordReader.java:113)
  ... 4 more
 11/09/15 12:28:31 INFO mapred.JobClient:  map 0% reduce 0%
 11/09/15 12:28:31 INFO mapred.JobClient: Job complete: job_local_0001
 11/09/15 12:28:31 INFO mapred.JobClient: Counters: 0
 11/09/15 12:28:31 INFO jvm.JvmMetrics: Cannot initialize JVM Metrics with
 processName=JobTracker, sessionId= - already initialized
 11/09/15 12:28:32 INFO mapred.JobClient: Running job: job_local_0002
 11/09/15 12:28:32 INFO mapred.MapTask: io.sort.mb = 100
 11/09/15 12:28:32 INFO mapred.MapTask: data buffer = 79691776/99614720
 11/09/15 12:28:32 INFO mapred.MapTask: record buffer = 262144/327680
 11/09/15 12:28:32 WARN mapred.LocalJobRunner: job_local_0002
 java.lang.RuntimeException: java.lang.UnsupportedOperationException: no
 local connection available
 at
 org.apache.cassandra.hadoop.ColumnFamilyRecordReader.initialize(ColumnFamilyRecordReader.java:132)
  at
 org.apache.hadoop.mapred.MapTask$NewTrackingRecordReader.initialize(MapTask.java:418)
 at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:620)
  at org.apache.hadoop.mapred.MapTask.run(MapTask.java:305)
 at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:177)
 Caused by: java.lang.UnsupportedOperationException: no local connection
 available
 at
 org.apache.cassandra.hadoop.ColumnFamilyRecordReader.getLocation(ColumnFamilyRecordReader.java:176)
  at
 org.apache.cassandra.hadoop.ColumnFamilyRecordReader.initialize(ColumnFamilyRecordReader.java:113)
 ... 4 more
 11/09/15 12:28:33 INFO mapred.JobClient:  map 0% reduce 0%
 11/09/15 12:28:33 INFO mapred.JobClient: Job complete: job_local_0002
 11/09/15 12:28:33 INFO mapred.JobClient: Counters: 0
 11/09/15 12:28:33 INFO jvm.JvmMetrics: Cannot initialize JVM Metrics with
 processName=JobTracker, sessionId= - already initialized
 11/09/15 12:28:34 INFO mapred.JobClient: Running job: job_local_0003
 11/09/15 12:28:34 INFO mapred.MapTask: io.sort.mb = 100
 11/09/15 12:28:34 INFO mapred.MapTask: data buffer = 79691776/99614720
 11/09/15 12:28:34 INFO mapred.MapTask: record buffer = 262144/327680
 11/09/15 12:28:34 WARN mapred.LocalJobRunner: job_local_0003
 java.lang.RuntimeException: java.lang.UnsupportedOperationException: no
 local connection available
  at
 org.apache.cassandra.hadoop.ColumnFamilyRecordReader.initialize(ColumnFamilyRecordReader.java:132)
 at
 org.apache.hadoop.mapred.MapTask$NewTrackingRecordReader.initialize(MapTask.java:418)
  at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:620)
 at org.apache.hadoop.mapred.MapTask.run(MapTask.java:305)
  at
 org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:177)
 Caused by: java.lang.UnsupportedOperationException: no local connection
 available
  at
 org.apache.cassandra.hadoop.ColumnFamilyRecordReader.getLocation(ColumnFamilyRecordReader.java:176)
 at
 org.apache.cassandra.hadoop.ColumnFamilyRecordReader.initialize(ColumnFamilyRecordReader.java:113)
  ... 4 more
 11/09/15 12:28:35 INFO mapred.JobClient:  map 0% reduce 0%
 11/09/15 12:28:35 INFO mapred.JobClient: Job complete: job_local_0003
 11/09/15 12:28:35 INFO mapred.JobClient: Counters: 0
 11/09/15 12:28:35 INFO jvm.JvmMetrics: Cannot initialize JVM Metrics with
 processName=JobTracker, sessionId= - already initialized
 11/09/15 12:28:36 INFO mapred.JobClient: Running job: job_local_0004
 11/09/15 12:28:36 INFO mapred.MapTask: io.sort.mb = 100
 11/09/15 12:28:37 INFO 

Re: Exception in Hadoop Word Count sample

2011-09-15 Thread Tharindu Mathew
Yes. That's the problem. Thanks Jonathan.

I'm actually using trunk against a 0.7. How can I generate the distro in
trunk?

Forgive my ignorance, I'm more used to maven.

On Thu, Sep 15, 2011 at 1:08 AM, Jonathan Ellis jbel...@gmail.com wrote:

 You're using a 0.8 wordcount against a 0.7 Cassandra?

 On Wed, Sep 14, 2011 at 2:19 PM, Tharindu Mathew mcclou...@gmail.com
 wrote:
  I see $subject. Can anyone help me to rectify this?
  Stacktrace:
  Exception in thread main org.apache.thrift.TApplicationException:
 Required
  field 'replication_factor' was not found in serialized data! Struct:
  KsDef(name:wordcount,
  strategy_class:org.apache.cassandra.locator.SimpleStrategy,
  strategy_options:{replication_factor=1}, replication_factor:0,
  cf_defs:[CfDef(keyspace:wordcount, name:input_words,
 column_type:Standard,
  comparator_type:AsciiType, default_validation_class:AsciiType),
  CfDef(keyspace:wordcount, name:output_words, column_type:Standard,
  comparator_type:AsciiType, default_validation_class:AsciiType),
  CfDef(keyspace:wordcount, name:input_words_count, column_type:Standard,
  comparator_type:UTF8Type, default_validation_class:CounterColumnType)])
  at
 
 org.apache.thrift.TApplicationException.read(TApplicationException.java:108)
  at
 
 org.apache.cassandra.thrift.Cassandra$Client.recv_system_add_keyspace(Cassandra.java:1531)
  at
 
 org.apache.cassandra.thrift.Cassandra$Client.system_add_keyspace(Cassandra.java:1514)
  at WordCountSetup.setupKeyspace(Unknown Source)
  at WordCountSetup.main(Unknown Source)
  --
  Regards,
 
  Tharindu
  blog: http://mackiemathew.com/
 



 --
 Jonathan Ellis
 Project Chair, Apache Cassandra
 co-founder of DataStax, the source for professional Cassandra support
 http://www.datastax.com




-- 
Regards,

Tharindu

blog: http://mackiemathew.com/


Re: Exception in Hadoop Word Count sample

2011-09-15 Thread Tharindu Mathew
Found it. 'ant artifacts'

On Thu, Sep 15, 2011 at 12:02 PM, Tharindu Mathew mcclou...@gmail.comwrote:

 Yes. That's the problem. Thanks Jonathan.

 I'm actually using trunk against a 0.7. How can I generate the distro in
 trunk?

 Forgive my ignorance, I'm more used to maven.


 On Thu, Sep 15, 2011 at 1:08 AM, Jonathan Ellis jbel...@gmail.com wrote:

 You're using a 0.8 wordcount against a 0.7 Cassandra?

 On Wed, Sep 14, 2011 at 2:19 PM, Tharindu Mathew mcclou...@gmail.com
 wrote:
  I see $subject. Can anyone help me to rectify this?
  Stacktrace:
  Exception in thread main org.apache.thrift.TApplicationException:
 Required
  field 'replication_factor' was not found in serialized data! Struct:
  KsDef(name:wordcount,
  strategy_class:org.apache.cassandra.locator.SimpleStrategy,
  strategy_options:{replication_factor=1}, replication_factor:0,
  cf_defs:[CfDef(keyspace:wordcount, name:input_words,
 column_type:Standard,
  comparator_type:AsciiType, default_validation_class:AsciiType),
  CfDef(keyspace:wordcount, name:output_words, column_type:Standard,
  comparator_type:AsciiType, default_validation_class:AsciiType),
  CfDef(keyspace:wordcount, name:input_words_count, column_type:Standard,
  comparator_type:UTF8Type, default_validation_class:CounterColumnType)])
  at
 
 org.apache.thrift.TApplicationException.read(TApplicationException.java:108)
  at
 
 org.apache.cassandra.thrift.Cassandra$Client.recv_system_add_keyspace(Cassandra.java:1531)
  at
 
 org.apache.cassandra.thrift.Cassandra$Client.system_add_keyspace(Cassandra.java:1514)
  at WordCountSetup.setupKeyspace(Unknown Source)
  at WordCountSetup.main(Unknown Source)
  --
  Regards,
 
  Tharindu
  blog: http://mackiemathew.com/
 



 --
 Jonathan Ellis
 Project Chair, Apache Cassandra
 co-founder of DataStax, the source for professional Cassandra support
 http://www.datastax.com




 --
 Regards,

 Tharindu

 blog: http://mackiemathew.com/




-- 
Regards,

Tharindu

blog: http://mackiemathew.com/


Re: Exception in Hadoop Word Count sample

2011-09-15 Thread Tharindu Mathew
Now I get this,

Any help would be greatly appreciated.

./bin/word_count
11/09/15 12:28:28 INFO WordCount: output reducer type: cassandra
11/09/15 12:28:29 INFO jvm.JvmMetrics: Initializing JVM Metrics with
processName=JobTracker, sessionId=
11/09/15 12:28:30 INFO mapred.JobClient: Running job: job_local_0001
11/09/15 12:28:30 INFO mapred.MapTask: io.sort.mb = 100
11/09/15 12:28:30 INFO mapred.MapTask: data buffer = 79691776/99614720
11/09/15 12:28:30 INFO mapred.MapTask: record buffer = 262144/327680
11/09/15 12:28:30 WARN mapred.LocalJobRunner: job_local_0001
java.lang.RuntimeException: java.lang.UnsupportedOperationException: no
local connection available
at
org.apache.cassandra.hadoop.ColumnFamilyRecordReader.initialize(ColumnFamilyRecordReader.java:132)
at
org.apache.hadoop.mapred.MapTask$NewTrackingRecordReader.initialize(MapTask.java:418)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:620)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:305)
at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:177)
Caused by: java.lang.UnsupportedOperationException: no local connection
available
at
org.apache.cassandra.hadoop.ColumnFamilyRecordReader.getLocation(ColumnFamilyRecordReader.java:176)
at
org.apache.cassandra.hadoop.ColumnFamilyRecordReader.initialize(ColumnFamilyRecordReader.java:113)
... 4 more
11/09/15 12:28:31 INFO mapred.JobClient:  map 0% reduce 0%
11/09/15 12:28:31 INFO mapred.JobClient: Job complete: job_local_0001
11/09/15 12:28:31 INFO mapred.JobClient: Counters: 0
11/09/15 12:28:31 INFO jvm.JvmMetrics: Cannot initialize JVM Metrics with
processName=JobTracker, sessionId= - already initialized
11/09/15 12:28:32 INFO mapred.JobClient: Running job: job_local_0002
11/09/15 12:28:32 INFO mapred.MapTask: io.sort.mb = 100
11/09/15 12:28:32 INFO mapred.MapTask: data buffer = 79691776/99614720
11/09/15 12:28:32 INFO mapred.MapTask: record buffer = 262144/327680
11/09/15 12:28:32 WARN mapred.LocalJobRunner: job_local_0002
java.lang.RuntimeException: java.lang.UnsupportedOperationException: no
local connection available
at
org.apache.cassandra.hadoop.ColumnFamilyRecordReader.initialize(ColumnFamilyRecordReader.java:132)
at
org.apache.hadoop.mapred.MapTask$NewTrackingRecordReader.initialize(MapTask.java:418)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:620)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:305)
at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:177)
Caused by: java.lang.UnsupportedOperationException: no local connection
available
at
org.apache.cassandra.hadoop.ColumnFamilyRecordReader.getLocation(ColumnFamilyRecordReader.java:176)
at
org.apache.cassandra.hadoop.ColumnFamilyRecordReader.initialize(ColumnFamilyRecordReader.java:113)
... 4 more
11/09/15 12:28:33 INFO mapred.JobClient:  map 0% reduce 0%
11/09/15 12:28:33 INFO mapred.JobClient: Job complete: job_local_0002
11/09/15 12:28:33 INFO mapred.JobClient: Counters: 0
11/09/15 12:28:33 INFO jvm.JvmMetrics: Cannot initialize JVM Metrics with
processName=JobTracker, sessionId= - already initialized
11/09/15 12:28:34 INFO mapred.JobClient: Running job: job_local_0003
11/09/15 12:28:34 INFO mapred.MapTask: io.sort.mb = 100
11/09/15 12:28:34 INFO mapred.MapTask: data buffer = 79691776/99614720
11/09/15 12:28:34 INFO mapred.MapTask: record buffer = 262144/327680
11/09/15 12:28:34 WARN mapred.LocalJobRunner: job_local_0003
java.lang.RuntimeException: java.lang.UnsupportedOperationException: no
local connection available
at
org.apache.cassandra.hadoop.ColumnFamilyRecordReader.initialize(ColumnFamilyRecordReader.java:132)
at
org.apache.hadoop.mapred.MapTask$NewTrackingRecordReader.initialize(MapTask.java:418)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:620)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:305)
at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:177)
Caused by: java.lang.UnsupportedOperationException: no local connection
available
at
org.apache.cassandra.hadoop.ColumnFamilyRecordReader.getLocation(ColumnFamilyRecordReader.java:176)
at
org.apache.cassandra.hadoop.ColumnFamilyRecordReader.initialize(ColumnFamilyRecordReader.java:113)
... 4 more
11/09/15 12:28:35 INFO mapred.JobClient:  map 0% reduce 0%
11/09/15 12:28:35 INFO mapred.JobClient: Job complete: job_local_0003
11/09/15 12:28:35 INFO mapred.JobClient: Counters: 0
11/09/15 12:28:35 INFO jvm.JvmMetrics: Cannot initialize JVM Metrics with
processName=JobTracker, sessionId= - already initialized
11/09/15 12:28:36 INFO mapred.JobClient: Running job: job_local_0004
11/09/15 12:28:36 INFO mapred.MapTask: io.sort.mb = 100
11/09/15 12:28:37 INFO mapred.MapTask: data buffer = 79691776/99614720
11/09/15 12:28:37 INFO mapred.MapTask: record buffer = 262144/327680
11/09/15 12:28:37 WARN mapred.LocalJobRunner: job_local_0004
java.lang.RuntimeException: java.lang.UnsupportedOperationException: no
local connection available
at

Re: Exception in Hadoop Word Count sample

2011-09-14 Thread Jonathan Ellis
You're using a 0.8 wordcount against a 0.7 Cassandra?

On Wed, Sep 14, 2011 at 2:19 PM, Tharindu Mathew mcclou...@gmail.com wrote:
 I see $subject. Can anyone help me to rectify this?
 Stacktrace:
 Exception in thread main org.apache.thrift.TApplicationException: Required
 field 'replication_factor' was not found in serialized data! Struct:
 KsDef(name:wordcount,
 strategy_class:org.apache.cassandra.locator.SimpleStrategy,
 strategy_options:{replication_factor=1}, replication_factor:0,
 cf_defs:[CfDef(keyspace:wordcount, name:input_words, column_type:Standard,
 comparator_type:AsciiType, default_validation_class:AsciiType),
 CfDef(keyspace:wordcount, name:output_words, column_type:Standard,
 comparator_type:AsciiType, default_validation_class:AsciiType),
 CfDef(keyspace:wordcount, name:input_words_count, column_type:Standard,
 comparator_type:UTF8Type, default_validation_class:CounterColumnType)])
 at
 org.apache.thrift.TApplicationException.read(TApplicationException.java:108)
 at
 org.apache.cassandra.thrift.Cassandra$Client.recv_system_add_keyspace(Cassandra.java:1531)
 at
 org.apache.cassandra.thrift.Cassandra$Client.system_add_keyspace(Cassandra.java:1514)
 at WordCountSetup.setupKeyspace(Unknown Source)
 at WordCountSetup.main(Unknown Source)
 --
 Regards,

 Tharindu
 blog: http://mackiemathew.com/




-- 
Jonathan Ellis
Project Chair, Apache Cassandra
co-founder of DataStax, the source for professional Cassandra support
http://www.datastax.com