Piotr Kołaczkowski created CASSANDRA-4588:
---------------------------------------------

             Summary: CQL COPY ... FROM command is slow
                 Key: CASSANDRA-4588
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-4588
             Project: Cassandra
          Issue Type: Bug
          Components: Tools
    Affects Versions: 1.1.4
         Environment: Ubuntu Linux 12.04, kernel 3.4.0
            Reporter: Piotr Kołaczkowski


1. created a csv file with 10,000,000 rows with two integer columns; saved it 
to an SSD disk, it took a few seconds, the file is 184 MB large. 
2. started a single local cassandra node from fresh empty data and commit log 
dirs
3. created a keyspace with simple strategy and RF=1
4. loading the file with COPY ... FROM command - it is over 15 minutes now and 
still loading

top reports about 50% CPU usage for java (cassandra) and 50% for python.
I/O is almost idle, iowait < 0.1%. 



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to