[ 
https://issues.apache.org/jira/browse/CASSANDRA-10938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15120077#comment-15120077
 ] 

Paulo Motta commented on CASSANDRA-10938:
-----------------------------------------

Sorry for the delay, I took a bit longer than expected trying to reproduce this 
on Windows due to CASSANDRA-11073. Your investigation makes sense and I agree 
{{NBHM}} might be the culprit here. However, even after the patch I was able to 
reproduce the issue, but with a much lower difference now:
{noformat}
======================================================================
FAIL: test_bulk_round_trip_blogposts 
(cqlsh_tests.cqlsh_copy_tests.CqlshCopyTest)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "C:\Users\Paulo\Repositories\cassandra-dtest\dtest.py", line 733, in 
wrapped
    f(obj)
  File 
"C:\Users\Paulo\Repositories\cassandra-dtest\cqlsh_tests\cqlsh_copy_tests.py", 
line 2132, in test_bulk_round_trip_blogposts
    copy_to_options={'PAGETIMEOUT': 60, 'PAGESIZE': 1000})
  File 
"C:\Users\Paulo\Repositories\cassandra-dtest\cqlsh_tests\cqlsh_copy_tests.py", 
line 2098, in _test_bulk_round_trip
    self.assertEqual(num_records, sum(1 for _ in open(tempfile1.name)))
AssertionError: 99981 != 99903
{noformat} 

Tonight I will run with flight recorder enabled on Windows and try to extract a 
JFR for further investigation. It would be nice to maybe comit the jfr-enabled 
version to CI to confirm its the same issue that's happening there.

The new {{test_bulk_round_trip_blogposts_with_max_connections}} also failed 
once on Windows but I suspect it's due to the test not being able to keep up 
with failed connections:

{noformat}
======================================================================
FAIL: test_bulk_round_trip_blogposts_with_max_connections 
(cqlsh_tests.cqlsh_copy_tests.CqlshCopyTest)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "C:\Users\Paulo\Repositories\cassandra-dtest\dtest.py", line 733, in 
wrapped
    f(obj)
  File 
"C:\Users\Paulo\Repositories\cassandra-dtest\cqlsh_tests\cqlsh_copy_tests.py", 
line 2152, in test_bulk_round_trip_blogposts_with_max_connections
    copy_from_options={'NUMPROCESSES': 4})
  File 
"C:\Users\Paulo\Repositories\cassandra-dtest\cqlsh_tests\cqlsh_copy_tests.py", 
line 2098, in _test_bulk_round_trip
    self.assertEqual(num_records, sum(1 for _ in open(tempfile1.name)))
AssertionError: 100457 != 1611
{noformat}

The python changes look good and the new connection policy is much more 
resilient now.

> test_bulk_round_trip_blogposts is failing occasionally
> ------------------------------------------------------
>
>                 Key: CASSANDRA-10938
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-10938
>             Project: Cassandra
>          Issue Type: Sub-task
>          Components: Tools
>            Reporter: Stefania
>            Assignee: Stefania
>             Fix For: 2.1.x, 2.2.x, 3.0.x, 3.x
>
>         Attachments: 6452.nps, 6452.png, 7300.nps, 7300a.png, 7300b.png, 
> node1_debug.log, node2_debug.log, node3_debug.log, recording_127.0.0.1.jfr
>
>
> We get timeouts occasionally that cause the number of records to be incorrect:
> http://cassci.datastax.com/job/trunk_dtest/858/testReport/cqlsh_tests.cqlsh_copy_tests/CqlshCopyTest/test_bulk_round_trip_blogposts/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to