Bulk Import Question

2017-01-23 Thread Joe Olson
I am bulk importing a large number of sstables that I pre-generated using the 
bulk load process outlined at 

https://github.com/yukim/cassandra-bulkload-example 

I am using the 'sstableloader' utility to import them into a nine node 
Cassandra cluster. 

During the sstableloader execution, I sometime get the following error in the 
logs of one of the nodes: 

ERROR [STREAM-OUT-/xx.xx.xx.xx:38544] 2017-01-19 13:38:52,148 
StreamSession.java:533 - [Stream #d90444c0-de7e-11e6-922a-e792f38c7245] 
Streaming error occurred on session with peer xx.xx.xx.xx through xx.xx.xx.xx 
java.io.IOException: Connection reset by peer 

I assume the load for that particular sstable failed, and the data within wass 
compromised and needs to be re-loaded. 

My question: is there any way to trap this (and other streaming errors) when 
using sstableloader to bulk import data? 


Bulk Import Question

2016-12-28 Thread Joe Olson
I'm following the example here for doing a bulk import into Cassandra: 
https://github.com/yukim/cassandra-bulkload-example 

Is there a way to get a number of rows written to a sstable set created via 
CQLSSTableWriter, without importing the sstable set into Cassandra? 

I'd like to do some QA on the converted sstables I have before importing them 
into Cassandra. 


Bulk Import Question

2016-11-23 Thread Joe Olson
I'm following the Cassandra bulk import example here: 
https://github.com/yukim/cassandra-bulkload-example 

Are the Cassandra data types inet, smallint, and tinyint supported by the bulk 
import CQLSSTableWriter ? 

I can't seem to get them to work...