What version of Solr? In Solr 8.2 there will be a tool to facilitate this kind of analysis - see SOLR-13512. In the meantime, if you’re on Solr 8.x you should be able to easily back port this change to your version (7x should be possible too, but with more changes).
> On 1 Jul 2019, at 11:23, Rob Audenaerde <rob.audenae...@gmail.com> wrote: > > Hello, > > We are currently trying to investigate an issue where in the index-size is > disproportionally large for the number of documents. We see that the .fdt > file is more than 10 times the regular size. > > Reading the docs, I found that this file contains the fielddata. > > I would like to find the documents and/or field names/contents with extreme > sizes, so we can delete those from the index without needing to re-index > all data. > > What would be the best approach for this? > > Thanks, > Rob Audenaerde --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org