On Mon, Jan 28, 2013 at 6:48 AM, Igor Kovacevic <igor.kovacev...@gmail.com> wrote: > > I'm a master student and I'm writing my thesis on bug triaging in open > source project and I wondering if I can access to a big part of the > bug repository, > if I can, how to do it ? > Writing a crawler/parser for bugzilla or something else? > I need 5 to 8 years of development.
As you know, we use bugzilla for GCC. I don't know what facilities bugzilla provides for analysis of the bug database. That would be something to ask the bugzilla developers about. I'm willing to provide you with a dump of gcc's bugzilla database if you can give me the exact command to run. It's fine to crawl the bugzilla web pages as long as you throttle your crawler. If you download pages as fast as you can we will wind up blocking you. Ian