Re: missing rows while importing data using sstable loader

2016-02-05 Thread Victor Chen
Arindam, What can you share regarding the source from which you are importing data? Is it a separate cassandra cluster? If so, how many nodes and datacenters? What is RF (replication factor) of source cluster? How certain are you that the rows indeed exist in the set of sstables which you are

Re: missing rows while importing data using sstable loader

2016-02-05 Thread Jack Krupansky
I sent a message to DataStax Docs to add this nodetool flush suggestion to the doc for sstableloader. -- Jack Krupansky On Fri, Feb 5, 2016 at 3:35 AM, Romain Hardouin wrote: > > What is the best practise to create sstables? > > When you run a "nodetool flush" Cassandra

Re: missing rows while importing data using sstable loader

2016-02-01 Thread Arindam Choudhury
What is the best practise to create sstables? On 1 February 2016 at 15:21, Romain Hardouin wrote: > Did you run "nodetool flush" on the source node? If not, the missing rows > could be in memtables. >

Re: missing rows while importing data using sstable loader

2016-02-01 Thread Romain Hardouin
Did you run "nodetool flush" on the source node? If not, the missing rows could be in memtables.

Re: missing rows while importing data using sstable loader

2016-01-29 Thread Arindam Choudhury
Hi Romain, The RF was set to 2. I changed it to one. CREATE KEYSPACE mordor WITH replication = {'class' : 'SimpleStrategy', 'replication_factor' : 1} AND durable_writes = true; re-inserted the columns, still missing rows. Regards, Arindam On 29 January 2016 at 15:14, Romain Hardouin

Re: missing rows while importing data using sstable loader

2016-01-29 Thread Romain Hardouin
Hi, I assume a RF > 1. Right?What is the consistency level you used? cqlsh use ONE by default. Try: cqlsh> CONSISTENCY ALLAnd run your query again. Best,Romain Le Vendredi 29 janvier 2016 13h45, Arindam Choudhury a écrit : Hi Kai, The table schema is:

Re: missing rows while importing data using sstable loader

2016-01-29 Thread Arindam Choudhury
I will check the output of nodetool cfstats. Its from version 2.1.2 to version 2.1.9. On 29 January 2016 at 16:02, Jack Krupansky wrote: > Are these sstables from an existing Cassandra cluster or generated by a > program? > > If the former, do a nodetool tablestats or

Re: missing rows while importing data using sstable loader

2016-01-29 Thread Jack Krupansky
Are these sstables from an existing Cassandra cluster or generated by a program? If the former, do a nodetool tablestats or cfstats to get the sstable count and compare it to both the number of sstables that the loader is reading from and the number that end up in the target cluster. What

Re: missing rows while importing data using sstable loader

2016-01-29 Thread Arindam Choudhury
I am counting the rows with "select count(*) from mordor.things_values_meta;" I am doing one node cluster to one node cluster for testing. On 29 January 2016 at 16:20, Jack Krupansky wrote: > And how are you counting the rows? With a query? If, so, what is the >

Re: missing rows while importing data using sstable loader

2016-01-29 Thread Kai Wang
Arindam, what's the table schema and what does your query to retrieve the rows look like? On Fri, Jan 29, 2016 at 7:33 AM, Arindam Choudhury < arindam.choudh...@ackstorm.com> wrote: > Hi, > > I am importing data to a new cassandra cluster using sstableloader. The > sstableloader runs without

Re: missing rows while importing data using sstable loader

2016-01-29 Thread Arindam Choudhury
Hi Kai, The table schema is: CREATE TABLE mordor.things_values_meta ( thing_id text, key text, bucket_timestamp timestamp, total_rows counter, PRIMARY KEY ((thing_id, key), bucket_timestamp) ) WITH CLUSTERING ORDER BY (bucket_timestamp ASC) AND bloom_filter_fp_chance =

Re: missing rows while importing data using sstable loader

2016-01-29 Thread Arindam Choudhury
Why in cqlsh when I query "select count(*) from mordor.things_values_meta ;" it says: 4692 But in nodetool cfstats it says Number of keys (estimate): 4720? On 29 January 2016 at 16:25, Arindam Choudhury < arindam.choudh...@ackstorm.com> wrote: > I am counting the rows with "select count(*) from

Re: missing rows while importing data using sstable loader

2016-01-29 Thread Jack Krupansky
I agree that there should be more clear doc on exactly how the estimation is calculated. When I inquired about this recently the response was that it should be within about 2% of the actual key count. I started looking at the code, but I ran out of time before I chased down all the subsidiary

Re: missing rows while importing data using sstable loader

2016-01-29 Thread Jack Krupansky
And how are you counting the rows? With a query? If, so, what is the query. Using nodetool cfstats (estimated) key count? Or... what? Are the tokens for the missing rows is the same range and a distinct range from the rest of the data in the original cluster? How many nodes in the original