Re: Help with new Join Functionallity in Solr 4.0

Erick Erickson Mon, 24 Sep 2012 04:51:35 -0700

NP, good luck!


On Sun, Sep 23, 2012 at 3:41 PM,  <milen.ti...@materna.de> wrote:
> Hello Erick,
>
> Thanks a lot for your reply! Your suggestion is actually exactly the 
> alternative solution we are thinking about and with your clarification on 
> Solr's performance we are going to go for it! Many thanks again!
>
> Milen
>
> ________________________________________
> Von: Erick Erickson [erickerick...@gmail.com]
> Gesendet: Sonntag, 23. September 2012 17:50
> An: solr-user@lucene.apache.org
> Betreff: Re: Help with new Join Functionallity in Solr 4.0
>
> The very first thing to try is flatten your data so you don't have to use 
> joins.
> I know that goes against your database instincts, but Solr easily handles
> millions and millions of documents. So if the cross-product of docs and 
> modules
> isn't prohibitive, that's what I'd do first. Then it's just a matter of
> forming a search without joins....
>
> Joins run into performance issues when the join field has many unique
> values, unfortunately the field people often want to join on is something
> like a <uniqueKey> (or PK in RDBMS terms), so be aware of that.
>
> Best
> Erick
>
> On Fri, Sep 21, 2012 at 5:46 AM,  <milen.ti...@materna.de> wrote:
>> Dear Solr community,
>>
>> I am rather new to Solr, however I already find it kind of attractive. We 
>> are developing a research application, which contains a Solr index with 
>> three different kinds of documents, here the basic idea:
>>
>>
>> -          A document of type "doc" consisting of fields id, docid, doctitle 
>> and some other metadata
>>
>> -          A document of type "module" consisting of fields id, modid and 
>> text
>>
>> -          A document of type "docmodule" consisting of fields id, docrefid, 
>> modrefid and some metadata about the relation between a document and a 
>> module; filed docrefid refers to the id of a "doc" document, while field 
>> modrefid contains the id of a "module" document
>>
>> In other words, in our model there are documents (type "doc") consisting of 
>> several modules and there is some characterization of each link between a 
>> document and a module.
>>
>> Almost all fields of a "doc" document are searchable, as well as the text of 
>> a module and the metadata of the "docmodule" entries.
>>
>> We are looking for a fast way to retrieve all modules containing a certain 
>> text and associated with a given document, preferably with a single query. 
>> This means we want to query the text from a "module" document while we set a 
>> restriction on the docrefid from a "docmodule" or the id from a "doc" 
>> document. Is this possible by means of the new pseudo joins? Any ideas are 
>> highly appreciated!
>>
>> Thanks in advance!
>>
>> Milen Tilev
>> Master of Science
>> Softwareentwickler
>> Business Unit Information
>> ________________________________________________
>>
>> MATERNA GmbH
>> Information & Communications
>>
>> Voßkuhle 37
>> 44141 Dortmund
>> Deutschland
>>
>> Telefon: +49 231 5599-8257
>> Fax: +49 231 5599-98257
>> E-Mail: milen.ti...@materna.de<mailto:milen.ti...@materna.de>
>>
>> www.materna.de<http://www.materna.de/> | 
>> Newsletter<http://www.materna.de/newsletter> | 
>> Twitter<http://twitter.com/MATERNA_GmbH> | 
>> XING<http://www.xing.com/companies/MATERNAGMBH> | 
>> Facebook<http://www.facebook.com/maternagmbh>
>> ________________________________________________
>>
>> Sitz der MATERNA GmbH: Voßkuhle 37, 44141 Dortmund
>> Geschäftsführer: Dr. Winfried Materna, Helmut an de Meulen, Ralph Hartwig
>> Amtsgericht Dortmund HRB 5839
>>

Re: Help with new Join Functionallity in Solr 4.0

Reply via email to