All, Some updates...please see Peter Wyatt's recent article on the refreshing of the bug tracker corpus: https://twitter.com/PDFAssociation/status/1327237439732260865?s=20
* successfully upgraded and rebooted the server. * finished running tika-eval's new FileProfile on the full corpus, and I've made this available via datasette. * documented some useful queries in datasette: https://cwiki.apache.org/confluence/display/TIKA/TikaEvalDatasetteExamples ttps:// * reported a bug with datasette ( https://github.com/simonw/datasette/issues/1091). It looks like the base_url fix didn't work across all buttons, but it did get better. Cheers, Tim