Hi.

Comments inline.

On Fri, Apr 24, 2009 at 1:00 PM, Grant Ingersoll <gsing...@apache.org>wrote:

> How often are you updating the rank?


The goal is to optimize the pagerank calculating algorithm so can have
continuous updates (10000 blogs at a time 24/7) but more likely we'll end up
refreshing the index once a weeks or so (hopefully each night).


>
>
> You might also be able to keep the rank info in a flat file via the
> ExternalFileField and the FileFloatSource and do FunctionQuery stuff that
> way.   However, I don't know how that handles refreshing data or if it would
> be efficient in your case.


Great! That seems like something that could work. Depends on how that field
get's re-read/indexed I guess. Or is it used at query time solely ? I feel
that googling ExternalFileField does not really give the "meat" I need to
narrow this down. Any pointers and/or pseudo code ?

>
>
> On Apr 24, 2009, at 1:52 AM, Marcus Herou wrote:
>
>  Hi.
>>
>> I've posted before but here it goes again:
>>
>> I have BlogData data which is more or less 100% static but one field is
>> not
>> - the PageRank.
>> I would like to sort on that field and on the Lucene list I got these
>> answers.
>>
>> 1. Use two indexes and a ParallellReader
>> 2. Use a FieldScoreQuery containing the PageRank field.
>> 3. Use a CustomScoreQuery which uses the FieldScoreQuery combined with
>> other
>> Queries (the actual search).
>>
>> I think I could use this pattern as well:
>> 1. Use two indexes and a ParallellReader
>> 2. Normal search and Sort on the PageRank column (perhaps consuming more
>> memory)
>>
>> Anyone have an idea of howto implement these patterns in SOLR ?
>> I have never extended SOLR but am not afraid of doing so if someone pushes
>> me in the right direction.
>>
>> Kindly
>>
>> //Marcus
>>
>>
>>
>>
>> --
>> Marcus Herou CTO and co-founder Tailsweep AB
>> +46702561312
>> marcus.he...@tailsweep.com
>> http://www.tailsweep.com/
>> http://blogg.tailsweep.com/
>>
>
> --------------------------
> Grant Ingersoll
> http://www.lucidimagination.com/
>
> Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids) using
> Solr/Lucene:
> http://www.lucidimagination.com/search
>
>
It seems to be a generic issue with Lucene since it is not really built in
the way that one would plugin an external scoring mechanism (very fast
internal one instead) but hopefully I'll sort this one out.

Thanks for the reply, really apprecciated.

Kindly

//Marcus



-- 
Marcus Herou CTO and co-founder Tailsweep AB
+46702561312
marcus.he...@tailsweep.com
http://www.tailsweep.com/
http://blogg.tailsweep.com/

Reply via email to