Hi! Yes, I think that this discussion leads us to a good project structure. Very important thing is to not allow to heuristic to be placed in central algorithm. Because of that all heuristic will be separated by an interface from application. This will allow us much easier testing, too.
About modules: Important thing is module for reading source files( something like DirectoryWalker from RAT) and module for generating report. I think that reports can be generated on that way that biggest source parts matched, will be on beginning on the list(there will be lot of false alarms in rapport, but we can make easier reading this document if it is well-structured ). Probably there, except from source code, will be placed a link to web page where similar code is found. because of that I am thinking about a simple HTML report. Regards, Marija
