Erick, 1. Yes both indices are optimized. They use lucene_40 version for their solr indices. 2. I discovered a bloat on one of the index. I am trying to figure out what might have caused the bloat. There are some schema changes on the bloated one but I was wondering if there is way to identify some smoking guns purely by using luke/analyzing .pos files.
Thanks, Nitin On Mon, Jul 11, 2016 at 9:52 AM Erick Erickson <erickerick...@gmail.com> wrote: > Why do you care? Is there a specific problem you're trying > to diagnose? Since the merge logic wasn't identical > (almost guaranteed), the sizes of the files are unreliable > since they include deleted documents which are compacted > on merging. > > So, you can try an optimize and compare immediately > afterwards. That should give you a general sense. > Otherwise, be absolutely sure that the schema definitions > haven't changed. What versions of Solr? Be sure > DocValues isn't different (that's recently become a > default, and you haven't told us _which_ versions of > Solr you're comparing). > > Best, > Erick > > On Sun, Jul 10, 2016 at 9:27 PM, KNitin <nitin.t...@gmail.com> wrote: > > Hi, > > > > I am trying to diff between 2 versions of solr index. Both the indices > > have similar .doc, .pay file sizes but their .pos files are extremely > > different. How do i dig deeper to understand what could be causing this > > difference? Is there a way to just open/analyze .pos file/compare 2 > .pos > > files? > > > > Thanks in advance, > > Nitin >