Ok, I got it, thank you very much. On Mon, Jul 16, 2018 at 6:25 PM Erick Erickson <erickerick...@gmail.com> wrote:
> Terms are already sorted when you use TermsComponent. So you fetch the > first 1,000 from each > field and compare... if you're starting with the same prefix for both > fields the lists should be > comparable in a straightforward manner. > > Best, > Erick > > On Mon, Jul 16, 2018 at 9:10 AM, Vincenzo D'Amore <v.dam...@gmail.com> > wrote: > > Hi Alexandre, well... you're right. Sooner or later I had to create a > > collection with synthetic data where run my test. > > > > Well I have SolrCloud, I'm curious, could you please suggest me an > example > > with the streaming expression you're talking? > > > > On Mon, Jul 16, 2018 at 4:50 PM Alexandre Rafalovitch < > arafa...@gmail.com> > > wrote: > > > >> For the test, can't you just use synthetic data where you know the terms > >> from the start? > >> > >> Otherwise maybe something from streaming expressions will help, but it > >> needs SolrCloud. > >> > >> Regards, > >> Alex > >> > >> On Mon, Jul 16, 2018, 10:22 AM Vincenzo D'Amore, <v.dam...@gmail.com> > >> wrote: > >> > >> > Hi all, > >> > > >> > I have a question for you, Solr Gurus :) > >> > > >> > there is an index where there are two fields: short_title and > long_title. > >> > As the field names suggest, this two fields are very similar, the long > >> > title has just more terms in it. > >> > > >> > So, looking at all the documents I have in the index, I would like to > >> > extract all the terms that are present in the long_title title only. > >> > > >> > Could you suggest me, if it is possibile, how to figure out from this > >> > problem? > >> > > >> > I've tried with the term component, and it should return all the terms > >> > present in a field but what happens when I have millions of terms? > >> > > >> > I thought to use the termcomponent or luke, but the only doable way > I've > >> > found is download the entire list of terms present in both the fields > and > >> > remove a term that is present in both the lists. > >> > > >> > I need this because I would like to write a test that try few terms > >> present > >> > only in the long_title. > >> > > >> > Thanks for your time, > >> > Vincenzo > >> > > >> > -- > >> > Vincenzo D'Amore > >> > > >> > > > > > > -- > > Vincenzo D'Amore > -- Vincenzo D'Amore