[jira] [Resolved] (ACCUMULO-400) continuous random walk execution

Christopher Tubbs (Jira) Wed, 08 Feb 2023 01:47:06 -0800


     [ 
https://issues.apache.org/jira/browse/ACCUMULO-400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Christopher Tubbs resolved ACCUMULO-400.
----------------------------------------
    Resolution: Abandoned

Closing this old issue. If this is still a problem, please open a new issue or 
PR at https://github.com/apache/accumulo

> continuous random walk execution
> --------------------------------
>
>                 Key: ACCUMULO-400
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-400
>             Project: Accumulo
>          Issue Type: Improvement
>            Reporter: Adam Fuchs
>            Priority: Major
>
> Random walk is finding bugs like a boss, but we can anticipate future usage 
> in which the current setup will be limiting. In particular, with a larger 
> development team knocking off bugs and writing new tests we might get to the 
> point where the most obvious bug is the only one that we find in a given run 
> of all of the random walkers. Consider hundreds of random walkers walking 
> over all of the tests. Many of these tests will find bugs 
> non-deterministically. If we add one test that finds one bug with high 
> probability, all of the walkers will find that bug and halt. None of the 
> other bugs will be found until the one bug is fixed or the test is removed.
> Here are some things we could do to improve this situation and migrate to 
> more of a continual random walk setup:
> 1. Stop executing a test after some number of walkers have found a bug when 
> running it.
> 2. Store the random walk graph in a database and have the walkers re-query it 
> with some regularity. This will let us add new tests to running walkers.
> 3. Have the walkers snapshot the relevant parts of the overall system when 
> they find a bug. We currently rely on the walkers halting to preserve the 
> state of the system so that we can manually extract all of the relevant 
> details that may have led to the bug. Dynamically snapshotting the system 
> makes it possible to continue to run tests without rolling over logs and 
> forensic information. Exactly what information needs to be kept is TBD.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Resolved] (ACCUMULO-400) continuous random walk execution

Reply via email to