Re: [racket-dev] DrDr Feature Request
I noticed this functionality just now.. thanks a lot! On 08/08/2011 12:38 PM, Jay McCarthy wrote: > Your wish is my command. > > On Mon, Aug 8, 2011 at 10:00 AM, Robby Findler > wrote: >> PS: I'm also happy if this class of tests only emails the responsible >> person, and not the pusher. >> >> Robby >> >> On Mon, Aug 8, 2011 at 10:59 AM, Robby Findler >> wrote: >>> I like the two-times-in-a-row thought. >>> >>> FWIW, please try to avoid race conditions of the second kind. >>> >>> I think the drracket test suites are special because they fail >>> not-so-often and I don't actually know how to fix them. If either of >>> those weren't true then I'd say they should just not run in drdr. (So >>> the race-condition/using the same file thing fails this test.) >>> >>> Robby >>> >>> On Mon, Aug 8, 2011 at 10:56 AM, Vincent St-Amour >>> wrote: I love DrDr, but there's a small thing that annoys me about it. Some tests are prone to intermittent failures. For example, some benchmarks need to create a file, and several benchmarks share the same file, which leads to race conditions. Similarly, some DrRacket tests sometimes fail for focus reasons. So, whenever someone pushes, they may get failures from these tests, then have go look at the actual errors, and try to figure out if they actually broke something or not. (Or, they ignore these failures, which is bad.) Here are two potential solutions. Let's assume that I just pushed something, and a test started failing. - Have DrDr send me email for every push about the broken test for as long as it fails. If I get email more than once, it's likely that I actually broke something. If I only get email once, the problem went away on its own, and was likely an intermittent failure. - Have the possiblity to flag some tests as intermittent (something like `drdr:random'), and only report failures for these tests if they fail twice in a row. This would reduce the amount of noise, since I expect most of these tests to pass most of the time. Actual breakage would still be detected, since it's unlikely that such failures would go away on their own. Detection would happen one push late, but that shouldn't be too much of an issue. Or, maybe only notify the pusher after two failures in a row, but notify the responsible person right away. Any thoughts? Vincent _ For list-related administrative tasks: http://lists.racket-lang.org/listinfo/dev >> _ >> For list-related administrative tasks: >> http://lists.racket-lang.org/listinfo/dev > > _ For list-related administrative tasks: http://lists.racket-lang.org/listinfo/dev
Re: [racket-dev] DrDr Feature Request
Your wish is my command. On Mon, Aug 8, 2011 at 10:00 AM, Robby Findler wrote: > PS: I'm also happy if this class of tests only emails the responsible > person, and not the pusher. > > Robby > > On Mon, Aug 8, 2011 at 10:59 AM, Robby Findler > wrote: >> I like the two-times-in-a-row thought. >> >> FWIW, please try to avoid race conditions of the second kind. >> >> I think the drracket test suites are special because they fail >> not-so-often and I don't actually know how to fix them. If either of >> those weren't true then I'd say they should just not run in drdr. (So >> the race-condition/using the same file thing fails this test.) >> >> Robby >> >> On Mon, Aug 8, 2011 at 10:56 AM, Vincent St-Amour >> wrote: >>> >>> I love DrDr, but there's a small thing that annoys me about it. >>> >>> Some tests are prone to intermittent failures. For example, some >>> benchmarks need to create a file, and several benchmarks share the >>> same file, which leads to race conditions. Similarly, some DrRacket >>> tests sometimes fail for focus reasons. >>> >>> So, whenever someone pushes, they may get failures from these tests, >>> then have go look at the actual errors, and try to figure out if they >>> actually broke something or not. >>> >>> (Or, they ignore these failures, which is bad.) >>> >>> Here are two potential solutions. Let's assume that I just pushed >>> something, and a test started failing. >>> >>> - Have DrDr send me email for every push about the broken test for as >>> long as it fails. If I get email more than once, it's likely that I >>> actually broke something. If I only get email once, the problem went >>> away on its own, and was likely an intermittent failure. >>> >>> - Have the possiblity to flag some tests as intermittent (something >>> like `drdr:random'), and only report failures for these tests if >>> they fail twice in a row. This would reduce the amount of noise, >>> since I expect most of these tests to pass most of the time. Actual >>> breakage would still be detected, since it's unlikely that such >>> failures would go away on their own. Detection would happen one push >>> late, but that shouldn't be too much of an issue. >>> >>> Or, maybe only notify the pusher after two failures in a row, but >>> notify the responsible person right away. >>> >>> Any thoughts? >>> >>> Vincent >>> _ >>> For list-related administrative tasks: >>> http://lists.racket-lang.org/listinfo/dev >>> >> > > _ > For list-related administrative tasks: > http://lists.racket-lang.org/listinfo/dev -- Jay McCarthy Assistant Professor / Brigham Young University http://faculty.cs.byu.edu/~jay "The glory of God is Intelligence" - D&C 93 _ For list-related administrative tasks: http://lists.racket-lang.org/listinfo/dev
Re: [racket-dev] DrDr Feature Request
Could DrDr say "This build is not the latest" or "The latest push is 234234"? On 08/08/2011 11:37 AM, Jay McCarthy wrote: > It is useful to test all of them to find out when errors start. It > doesn't do the newest first, because then the calculation of "new > issue" wouldn't make any sense, because you wouldn't have the previous > push's tests. > > Jay > > On Mon, Aug 8, 2011 at 11:34 AM, Jon Rafkind wrote: >> Another request: could DrDr process the latest push first? Its a little >> annoying to get emails for tests that failed when the latest push fixes >> them but DrDr is so far behind. Is there any benefit to testing all the >> intermediate pushes? >> >> On 08/08/2011 09:56 AM, Vincent St-Amour wrote: >>> I love DrDr, but there's a small thing that annoys me about it. >>> >>> Some tests are prone to intermittent failures. For example, some >>> benchmarks need to create a file, and several benchmarks share the >>> same file, which leads to race conditions. Similarly, some DrRacket >>> tests sometimes fail for focus reasons. >>> >>> So, whenever someone pushes, they may get failures from these tests, >>> then have go look at the actual errors, and try to figure out if they >>> actually broke something or not. >>> >>> (Or, they ignore these failures, which is bad.) >>> >>> Here are two potential solutions. Let's assume that I just pushed >>> something, and a test started failing. >>> >>> - Have DrDr send me email for every push about the broken test for as >>> long as it fails. If I get email more than once, it's likely that I >>> actually broke something. If I only get email once, the problem went >>> away on its own, and was likely an intermittent failure. >>> >>> - Have the possiblity to flag some tests as intermittent (something >>> like `drdr:random'), and only report failures for these tests if >>> they fail twice in a row. This would reduce the amount of noise, >>> since I expect most of these tests to pass most of the time. Actual >>> breakage would still be detected, since it's unlikely that such >>> failures would go away on their own. Detection would happen one push >>> late, but that shouldn't be too much of an issue. >>> >>> Or, maybe only notify the pusher after two failures in a row, but >>> notify the responsible person right away. >>> >>> Any thoughts? >>> >>> Vincent >>> _ >>> For list-related administrative tasks: >>> http://lists.racket-lang.org/listinfo/dev >> _ >> For list-related administrative tasks: >> http://lists.racket-lang.org/listinfo/dev >> > > _ For list-related administrative tasks: http://lists.racket-lang.org/listinfo/dev
Re: [racket-dev] DrDr Feature Request
It is useful to test all of them to find out when errors start. It doesn't do the newest first, because then the calculation of "new issue" wouldn't make any sense, because you wouldn't have the previous push's tests. Jay On Mon, Aug 8, 2011 at 11:34 AM, Jon Rafkind wrote: > Another request: could DrDr process the latest push first? Its a little > annoying to get emails for tests that failed when the latest push fixes > them but DrDr is so far behind. Is there any benefit to testing all the > intermediate pushes? > > On 08/08/2011 09:56 AM, Vincent St-Amour wrote: >> I love DrDr, but there's a small thing that annoys me about it. >> >> Some tests are prone to intermittent failures. For example, some >> benchmarks need to create a file, and several benchmarks share the >> same file, which leads to race conditions. Similarly, some DrRacket >> tests sometimes fail for focus reasons. >> >> So, whenever someone pushes, they may get failures from these tests, >> then have go look at the actual errors, and try to figure out if they >> actually broke something or not. >> >> (Or, they ignore these failures, which is bad.) >> >> Here are two potential solutions. Let's assume that I just pushed >> something, and a test started failing. >> >> - Have DrDr send me email for every push about the broken test for as >> long as it fails. If I get email more than once, it's likely that I >> actually broke something. If I only get email once, the problem went >> away on its own, and was likely an intermittent failure. >> >> - Have the possiblity to flag some tests as intermittent (something >> like `drdr:random'), and only report failures for these tests if >> they fail twice in a row. This would reduce the amount of noise, >> since I expect most of these tests to pass most of the time. Actual >> breakage would still be detected, since it's unlikely that such >> failures would go away on their own. Detection would happen one push >> late, but that shouldn't be too much of an issue. >> >> Or, maybe only notify the pusher after two failures in a row, but >> notify the responsible person right away. >> >> Any thoughts? >> >> Vincent >> _ >> For list-related administrative tasks: >> http://lists.racket-lang.org/listinfo/dev > > _ > For list-related administrative tasks: > http://lists.racket-lang.org/listinfo/dev > -- Jay McCarthy Assistant Professor / Brigham Young University http://faculty.cs.byu.edu/~jay "The glory of God is Intelligence" - D&C 93 _ For list-related administrative tasks: http://lists.racket-lang.org/listinfo/dev
Re: [racket-dev] DrDr Feature Request
This is a rare event (playing catchup like this) so I think it is probably best if we just let it catch up. Should be just a couple of more days (maybe a week) by my sketchy guesstimationizing. Robby On Mon, Aug 8, 2011 at 12:34 PM, Jon Rafkind wrote: > Another request: could DrDr process the latest push first? Its a little > annoying to get emails for tests that failed when the latest push fixes > them but DrDr is so far behind. Is there any benefit to testing all the > intermediate pushes? > > On 08/08/2011 09:56 AM, Vincent St-Amour wrote: >> I love DrDr, but there's a small thing that annoys me about it. >> >> Some tests are prone to intermittent failures. For example, some >> benchmarks need to create a file, and several benchmarks share the >> same file, which leads to race conditions. Similarly, some DrRacket >> tests sometimes fail for focus reasons. >> >> So, whenever someone pushes, they may get failures from these tests, >> then have go look at the actual errors, and try to figure out if they >> actually broke something or not. >> >> (Or, they ignore these failures, which is bad.) >> >> Here are two potential solutions. Let's assume that I just pushed >> something, and a test started failing. >> >> - Have DrDr send me email for every push about the broken test for as >> long as it fails. If I get email more than once, it's likely that I >> actually broke something. If I only get email once, the problem went >> away on its own, and was likely an intermittent failure. >> >> - Have the possiblity to flag some tests as intermittent (something >> like `drdr:random'), and only report failures for these tests if >> they fail twice in a row. This would reduce the amount of noise, >> since I expect most of these tests to pass most of the time. Actual >> breakage would still be detected, since it's unlikely that such >> failures would go away on their own. Detection would happen one push >> late, but that shouldn't be too much of an issue. >> >> Or, maybe only notify the pusher after two failures in a row, but >> notify the responsible person right away. >> >> Any thoughts? >> >> Vincent >> _ >> For list-related administrative tasks: >> http://lists.racket-lang.org/listinfo/dev > > _ > For list-related administrative tasks: > http://lists.racket-lang.org/listinfo/dev > _ For list-related administrative tasks: http://lists.racket-lang.org/listinfo/dev
Re: [racket-dev] DrDr Feature Request
Another request: could DrDr process the latest push first? Its a little annoying to get emails for tests that failed when the latest push fixes them but DrDr is so far behind. Is there any benefit to testing all the intermediate pushes? On 08/08/2011 09:56 AM, Vincent St-Amour wrote: > I love DrDr, but there's a small thing that annoys me about it. > > Some tests are prone to intermittent failures. For example, some > benchmarks need to create a file, and several benchmarks share the > same file, which leads to race conditions. Similarly, some DrRacket > tests sometimes fail for focus reasons. > > So, whenever someone pushes, they may get failures from these tests, > then have go look at the actual errors, and try to figure out if they > actually broke something or not. > > (Or, they ignore these failures, which is bad.) > > Here are two potential solutions. Let's assume that I just pushed > something, and a test started failing. > > - Have DrDr send me email for every push about the broken test for as > long as it fails. If I get email more than once, it's likely that I > actually broke something. If I only get email once, the problem went > away on its own, and was likely an intermittent failure. > > - Have the possiblity to flag some tests as intermittent (something > like `drdr:random'), and only report failures for these tests if > they fail twice in a row. This would reduce the amount of noise, > since I expect most of these tests to pass most of the time. Actual > breakage would still be detected, since it's unlikely that such > failures would go away on their own. Detection would happen one push > late, but that shouldn't be too much of an issue. > > Or, maybe only notify the pusher after two failures in a row, but > notify the responsible person right away. > > Any thoughts? > > Vincent > _ > For list-related administrative tasks: > http://lists.racket-lang.org/listinfo/dev _ For list-related administrative tasks: http://lists.racket-lang.org/listinfo/dev
Re: [racket-dev] DrDr Feature Request
At Mon, 8 Aug 2011 11:06:30 -0500, Robby Findler wrote: > > On Mon, Aug 8, 2011 at 11:05 AM, Vincent St-Amour > wrote: > > At Mon, 8 Aug 2011 10:59:24 -0500, > > Robby Findler wrote: > >> FWIW, please try to avoid race conditions of the second kind. > > > > Some of these I can try to fix. But I don't think all intermittent > > failures fit in this category. > > Right. I'm saying this: if you know you have a race condition and you > know how to fix it, then it is better to fix it than to use a > mechanism like the one you're asking for. I agree. Vincent _ For list-related administrative tasks: http://lists.racket-lang.org/listinfo/dev
Re: [racket-dev] DrDr Feature Request
On Mon, Aug 8, 2011 at 11:05 AM, Vincent St-Amour wrote: > At Mon, 8 Aug 2011 10:59:24 -0500, > Robby Findler wrote: >> FWIW, please try to avoid race conditions of the second kind. > > Some of these I can try to fix. But I don't think all intermittent > failures fit in this category. Right. I'm saying this: if you know you have a race condition and you know how to fix it, then it is better to fix it than to use a mechanism like the one you're asking for. Robby _ For list-related administrative tasks: http://lists.racket-lang.org/listinfo/dev
Re: [racket-dev] DrDr Feature Request
At Mon, 8 Aug 2011 10:59:24 -0500, Robby Findler wrote: > FWIW, please try to avoid race conditions of the second kind. Some of these I can try to fix. But I don't think all intermittent failures fit in this category. > I think the drracket test suites are special because they fail > not-so-often and I don't actually know how to fix them. If either of > those weren't true then I'd say they should just not run in drdr. (So > the race-condition/using the same file thing fails this test.) Running these tests in DrDr has the benefit of detecting actual breakage when it happens, so I don't think we should give up on this. > PS: I'm also happy if this class of tests only emails the responsible > person, and not the pusher. I like that, and it's probably simpler to implement. Vincent _ For list-related administrative tasks: http://lists.racket-lang.org/listinfo/dev
Re: [racket-dev] DrDr Feature Request
PS: I'm also happy if this class of tests only emails the responsible person, and not the pusher. Robby On Mon, Aug 8, 2011 at 10:59 AM, Robby Findler wrote: > I like the two-times-in-a-row thought. > > FWIW, please try to avoid race conditions of the second kind. > > I think the drracket test suites are special because they fail > not-so-often and I don't actually know how to fix them. If either of > those weren't true then I'd say they should just not run in drdr. (So > the race-condition/using the same file thing fails this test.) > > Robby > > On Mon, Aug 8, 2011 at 10:56 AM, Vincent St-Amour > wrote: >> >> I love DrDr, but there's a small thing that annoys me about it. >> >> Some tests are prone to intermittent failures. For example, some >> benchmarks need to create a file, and several benchmarks share the >> same file, which leads to race conditions. Similarly, some DrRacket >> tests sometimes fail for focus reasons. >> >> So, whenever someone pushes, they may get failures from these tests, >> then have go look at the actual errors, and try to figure out if they >> actually broke something or not. >> >> (Or, they ignore these failures, which is bad.) >> >> Here are two potential solutions. Let's assume that I just pushed >> something, and a test started failing. >> >> - Have DrDr send me email for every push about the broken test for as >> long as it fails. If I get email more than once, it's likely that I >> actually broke something. If I only get email once, the problem went >> away on its own, and was likely an intermittent failure. >> >> - Have the possiblity to flag some tests as intermittent (something >> like `drdr:random'), and only report failures for these tests if >> they fail twice in a row. This would reduce the amount of noise, >> since I expect most of these tests to pass most of the time. Actual >> breakage would still be detected, since it's unlikely that such >> failures would go away on their own. Detection would happen one push >> late, but that shouldn't be too much of an issue. >> >> Or, maybe only notify the pusher after two failures in a row, but >> notify the responsible person right away. >> >> Any thoughts? >> >> Vincent >> _ >> For list-related administrative tasks: >> http://lists.racket-lang.org/listinfo/dev >> > _ For list-related administrative tasks: http://lists.racket-lang.org/listinfo/dev
Re: [racket-dev] DrDr Feature Request
I like the two-times-in-a-row thought. FWIW, please try to avoid race conditions of the second kind. I think the drracket test suites are special because they fail not-so-often and I don't actually know how to fix them. If either of those weren't true then I'd say they should just not run in drdr. (So the race-condition/using the same file thing fails this test.) Robby On Mon, Aug 8, 2011 at 10:56 AM, Vincent St-Amour wrote: > > I love DrDr, but there's a small thing that annoys me about it. > > Some tests are prone to intermittent failures. For example, some > benchmarks need to create a file, and several benchmarks share the > same file, which leads to race conditions. Similarly, some DrRacket > tests sometimes fail for focus reasons. > > So, whenever someone pushes, they may get failures from these tests, > then have go look at the actual errors, and try to figure out if they > actually broke something or not. > > (Or, they ignore these failures, which is bad.) > > Here are two potential solutions. Let's assume that I just pushed > something, and a test started failing. > > - Have DrDr send me email for every push about the broken test for as > long as it fails. If I get email more than once, it's likely that I > actually broke something. If I only get email once, the problem went > away on its own, and was likely an intermittent failure. > > - Have the possiblity to flag some tests as intermittent (something > like `drdr:random'), and only report failures for these tests if > they fail twice in a row. This would reduce the amount of noise, > since I expect most of these tests to pass most of the time. Actual > breakage would still be detected, since it's unlikely that such > failures would go away on their own. Detection would happen one push > late, but that shouldn't be too much of an issue. > > Or, maybe only notify the pusher after two failures in a row, but > notify the responsible person right away. > > Any thoughts? > > Vincent > _ > For list-related administrative tasks: > http://lists.racket-lang.org/listinfo/dev > _ For list-related administrative tasks: http://lists.racket-lang.org/listinfo/dev
[racket-dev] DrDr Feature Request
I love DrDr, but there's a small thing that annoys me about it. Some tests are prone to intermittent failures. For example, some benchmarks need to create a file, and several benchmarks share the same file, which leads to race conditions. Similarly, some DrRacket tests sometimes fail for focus reasons. So, whenever someone pushes, they may get failures from these tests, then have go look at the actual errors, and try to figure out if they actually broke something or not. (Or, they ignore these failures, which is bad.) Here are two potential solutions. Let's assume that I just pushed something, and a test started failing. - Have DrDr send me email for every push about the broken test for as long as it fails. If I get email more than once, it's likely that I actually broke something. If I only get email once, the problem went away on its own, and was likely an intermittent failure. - Have the possiblity to flag some tests as intermittent (something like `drdr:random'), and only report failures for these tests if they fail twice in a row. This would reduce the amount of noise, since I expect most of these tests to pass most of the time. Actual breakage would still be detected, since it's unlikely that such failures would go away on their own. Detection would happen one push late, but that shouldn't be too much of an issue. Or, maybe only notify the pusher after two failures in a row, but notify the responsible person right away. Any thoughts? Vincent _ For list-related administrative tasks: http://lists.racket-lang.org/listinfo/dev