Absolutely. Solr will return the reference along the docs/results; those
references may be used to look-up the actual stuff. Such use cases aren't
hard to solve.

If the use case demands returning the actual stuff alongside the results,
it becomes non-trivial, especially during high loads.

To avoid this and do a quick implementation I can judiciously create stored
fields and see how it performs. I will need to figure out what happens if
the volume growth of stored fields is high, how much is the disk I/O and
what happens if we shard the index, like, what happens to the stored fields
then.

Best,
Sourajit




On Tue, Jun 4, 2013 at 5:31 PM, Erick Erickson <erickerick...@gmail.com>wrote:

> You have to index something with your Solr documents that
> has meaning in _your_ system so you can find the
> original record. You don't search this field, you just
> return it with the search results and then use it to get
> the original document.
>
> If you're storing the original in a DB, this can be the PK.
> If on a file system the path. etc.
>
> Essentially, since the association is specific to your environment
> you need to handle it explicitly...
>
> Best
> Erick
>
> On Mon, Jun 3, 2013 at 11:56 AM, Sourajit Basak
> <sourajit.ba...@gmail.com> wrote:
> > Consider the following use case.
> >
> > Certain words are extracted from a document and indexed. The exact
> sentence
> > containing the word cannot be stored alongside the extracted word because
> > of the volume at which the documents grow; How can the index and, lets
> call
> > it doc servers be separated ?
> >
> > An option is to store the sentences in MongoDB or a RDBMS. But there
> seems
> > to be a schema level design issue. Assuming 'word' to be a multivalued
> > field, how do we associate to it a reference to the corresponding entry
> in
> > the doc server.
> >
> > May create (word_1, ref_1) tuples. Is there any other in-built feature ?
> >
> > Any related project which separates index & doc servers ?
> >
> > Thanks,
> > Sourajit
>

Reply via email to