On Mon, Apr 13, 2009 at 6:57 PM, Glenn Wilson <gwil...@chromium.org> wrote:
> Ok, I ran the script and created 211 bugs (in the ~10270 - 10500 range). > As far as I can tell, all the bugs were created successfully....the script > now picks up 0 tests without bugs. > Ojan, I sent you the review of test_expectations: > http://codereview.chromium.org/73026/show > Two things I noticed: > 1. The script didn't recognize layout tests in chrome/ and not in > LayoutTests/ -- it's only a handful, so updating those by hand shouldn't > take very long. > 2. The script re-arranged some of the flags, as expected. If it is > important that we keep the prior ordering of the flags, let me know. > No, order is not important. > Also, I'd love to get this script reviewed and checked in, if anyone would > be willing to do a Java review. > If I knew Java, much less had Java readability, I'd be glad to... - Pam > > Regards, > Glenn > > > On Mon, Apr 13, 2009 at 10:19 AM, Ojan Vafai <o...@google.com> wrote: > >> On Mon, Apr 13, 2009 at 9:35 AM, Pam Greene <p...@chromium.org> wrote: >> >>> On Mon, Apr 13, 2009 at 9:09 AM, Glenn Wilson <gwil...@chromium.org>wrote: >>> >>>> Yes, the intention is to have something maintainable that gets run as >>>> part of the regular testing/merging/etc. process -- which means eventually >>>> rewriting in python. I wrote this in Java because we had a java lib for >>>> automatically creating bugs in the issue tracker, and I wanted to get it up >>>> and running quickly. We should have a python API for doing the same thing. >>>> >>>> I can easily change the script so it marks DEFERs as P3s. However, >>>> currently, the script doesn't alter any entries in the file that already >>>> have bugs -- meaning something like "BUG1234 DEFER : ..." or "WONTFIX DEFER >>>> : ..." won't change. Is this the correct behavior? >>>> >>> >>> The end goal is to take DEFER out and let the bug priorities run the >>> show. But if we're not entirely confident that this will work (both the >>> script and the human tracking process afterward), it's easy enough to strip >>> the "DEFER" modifiers later. >>> >> >> Makes sense to me. Lets leave in the defer modifiers, but mark any *new* >> bugs for deferred tests as P3s. The current behavior of leaving existing >> bugs unaltered seems correct to me. >> >> BTW, doing it in Java for expediency sake is totally reasonable. >> >> Ojan >> >> >>> >>> - Pam >>> >>> >>>> >>>> I'm planning on running the script later today with the most recent >>>> version of test_expectations.txt -- there will be 200+ new bugs. Let me >>>> know if I should hold off on doing so. Ojan, I'll send you the review for >>>> test_expecations.txt. >>>> >>>> Regards, >>>> Glenn >>>> >>>> >>>> >>>> On Fri, Apr 10, 2009 at 4:36 PM, Ojan Vafai <o...@google.com> wrote: >>>> >>>>> I hate to ask this, but any chance we could rewrite it in python? It >>>>> would be a lot less code in python (one script file) and would match the >>>>> rest of our codebase (which currently does all scripting in python). I'm >>>>> mainly worried about usability and maintainability here. >>>>> That all said, I think you should go ahead and run this script as a one >>>>> off so we can get the test lists having bug IDs ASAP as that's kind of >>>>> urgent to us being able to track the current state of layout tests. Then >>>>> we >>>>> can worry about checking in a python version later if we decide it's >>>>> necessary. >>>>> >>>>> Pam, Darin, Jon, Mark: You might want to confirm that my request below >>>>> makes sense with respect to future handling/triaging of layout tests bugs. >>>>> >>>>> Also for the initial one-off version, can you make any tests that are >>>>> currently marked as DEFER be P3s? Then when you spit out the results to >>>>> the >>>>> test_expectations file, I don't think there's any need to include the >>>>> DEFER >>>>> anymore as we'll rely entirely on bug priorities to decide which tests >>>>> need >>>>> fixing for the next release (i.e. all P1 tests and maybe some P2 tests). >>>>> That way there will be less initial triaging to do in order to make sure >>>>> we >>>>> keep the tree shippable. Eventually we'll have to go through all the P3s >>>>> in >>>>> the "Untriaged" state and up some of them to P2s, but there is no pressing >>>>> need to do so right away. >>>>> >>>>> Ojan >>>>> >>>>> On Fri, Apr 10, 2009 at 3:42 PM, Glenn Wilson <gwil...@chromium.org>wrote: >>>>> >>>>>> (Ack...resending) >>>>>> Ok, I have the CL ready, if anyone with Java readability would be >>>>>> willing to do a review, please let me know. >>>>>> more inline... >>>>>> >>>>>> On Fri, Apr 10, 2009 at 2:07 PM, Pam Greene <p...@chromium.org> wrote: >>>>>> >>>>>>> >>>>>>> >>>>>>> On Thu, Apr 9, 2009 at 6:49 PM, Ojan Vafai <o...@google.com> wrote: >>>>>>> >>>>>>>> At a quick glance, this looks great. I didn't look over every bug, >>>>>>>> but the ones I did look at look good. >>>>>>> >>>>>>> >>>>>>> Yep. You'll want some sort of default description for the ones that >>>>>>> have none. >>>>>>> >>>>>> >>>>>> I ran the script on a small test file...an example bug is here: >>>>>> http://code.google.com/p/chromium/issues/detail?id=9991 >>>>>> >>>>>> There's at least a small bit of indication of where it came from. I >>>>>> also added the line numbers from test_expectations.txt that generated >>>>>> the >>>>>> bug. Additionally, I also wrote the script to then add the created bug >>>>>> numbers to the file. This ends up re-arranging some of the flags (DEBUG >>>>>> DEFER ... etc. --> DEFER DEBUG ...), but all data is retained. >>>>>> >>>>>> >>>>>>> 200+ bugs is certainly too many, but that's no reason not to file >>>>>>> them. (Sorry, couldn't resist. Seriously, yes, definitely file them. >>>>>>> Better >>>>>>> in the issue tracker than getting lost.) >>>>>>> >>>>>> >>>>>> There are probably some improvements to be had to better group some of >>>>>> the bugs, but it's probably not an order of magnitude improvement. I >>>>>> just >>>>>> didn't want to create tons of bugs that were not useful. >>>>>> >>>>>> >>>>>>> >>>>>>> >>>>>>>> It would be great to check in a version of this script that we could >>>>>>>> run when a number of tests fail (e.g. when doing a bad webkit merge). >>>>>>>> That >>>>>>>> way, we can add them all to the local test_expectations.txt file and >>>>>>>> have it >>>>>>>> spit out the new results. >>>>>>>> >>>>>>> >>>>>>> Fixing the file we have and moving forward are slightly different use >>>>>>> cases, but yes. In the long run, we shouldn't need any script to fix an >>>>>>> existing bug-less expectations file, only the part that adds bugs for >>>>>>> newly >>>>>>> added tests. I'm not sure whether that part should be controlled by >>>>>>> adding >>>>>>> the tests to the file and having the script file bugs, or making it a >>>>>>> fully >>>>>>> interactive "app": give it a list of files and a description, and it >>>>>>> both >>>>>>> changes the file and creates a bug. >>>>>>> >>>>>> >>>>>> There may be a short-term solution and a long-term solution. The >>>>>> short term solution seems to be for someone (assumedly the merger) to add >>>>>> the failing tests to the file, run the script, then commit the file. In >>>>>> the >>>>>> long term, this could be one step, or perhaps be driven right from a >>>>>> "roll >>>>>> DEPS to new version of WebKit" script...right? >>>>>> >>>>>> >>>>>>> >>>>>>>> Really, it would be great if the script filed bugs and then just >>>>>>>> modified test_expectations.txt directly (without committing it). Also, >>>>>>>> the >>>>>>>> script should remove any comments it moves into bug descriptions. We >>>>>>>> should >>>>>>>> get to a point where all the comments about layout tests are in the bug >>>>>>>> descriptions themselves instead of in this file. >>>>>>>> >>>>>>> >>>>>>> I disagree with this last part. I'd prefer a brief description to >>>>>>> remain in the file, with any details in the bug. Certainly that's >>>>>>> needed for >>>>>>> WONTFIX bugs, where we may not have a bug since there's no work to be >>>>>>> tracked, but it's helpful for others too. I've found it frustrating and >>>>>>> time-consuming to track down when I see big blocks of failures with no >>>>>>> explanations at all. Think of it like the svn checkin comment: enough to >>>>>>> have an idea what's going on right there where you need it, with more >>>>>>> detail >>>>>>> in the bug for when you're really digging. >>>>>>> >>>>>> >>>>>> Yep, I modified the script so that it will change >>>>>> test_expectations.txt to have the bug number (you'd just need to commit >>>>>> it >>>>>> afterwards). It's pretty easy to for the script to clip out the >>>>>> comments, >>>>>> but I'm reluctant to, because the script may be over/under zealous in >>>>>> what >>>>>> comments it associates with a layout test. Maybe I can add that >>>>>> behavior as >>>>>> another command-line flag. >>>>>> >>>>>> >>>>>>> >>>>>>> - Pam >>>>>>> >>>>>>> >>>>>>>> I think it would be good to get the script checked in first and then >>>>>>>> run it on the existing test_expectations.txt file. >>>>>>>> >>>>>>>> Ojan >>>>>>>> >>>>>>>> >>>>>>>> On Thu, Apr 9, 2009 at 6:42 PM, Glenn Wilson <gwil...@google.com> >>>>>>>> wrote: >>>>>>>> >>>>>>>>> Hi Pam & Ojan, >>>>>>>>> I wrote a script that would extract all of the layout tests from >>>>>>>>> test_expectations.txt that we haven't marked as WONTFIX and don't >>>>>>>>> have a bug >>>>>>>>> number. I also tried a simple heuristic to get the context of the >>>>>>>>> layout >>>>>>>>> test via nearby comments....it's not perfect, and I'll have to change >>>>>>>>> some >>>>>>>>> of them by hand, but many of the merge comments are getting picked up. >>>>>>>>> >>>>>>>>> I've also hooked up our library for creating demetrius bugs, so >>>>>>>>> getting bugs made for these should be a matter of running the script >>>>>>>>> with >>>>>>>>> different arguments (I hope). >>>>>>>>> >>>>>>>>> What are your thoughts? Is these as descriptive/accurate as we >>>>>>>>> need? Is 200+ bugs too many? >>>>>>>>> >>>>>>>>> Thanks! >>>>>>>>> Glenn >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>> >>>>>> >>>>> >>>> >>> >>> >>> >> >> >> >> > --~--~---------~--~----~------------~-------~--~----~ Chromium Developers mailing list: chromium-dev@googlegroups.com View archives, change email options, or unsubscribe: http://groups.google.com/group/chromium-dev -~----------~----~----~----~------~----~------~--~---