Hi, I've set up an experimental Sonar server at http://sonar.zitting.name/ with Tika quality metrics at http://sonar.zitting.name/project/index/3338.
Overall it looks like we are in a reasonably good shape, but the reports do highlight some areas that we should look at in more detail. To improve the report quality we'll need to clean up or exclude some obvious issues like the way the magic number warnings go crazy with the ICU4J charset patterns. BR, Jukka Zitting