Filtering results

2008-09-16 Thread ristretto . rb
Hello All, I'm looking for a way to filter results by some ranking mechanism. For example... Suppose you have 30 docs in an index, and they are in groups of 10, like this A, 1 A, 2 : A, 10 B, 1 B, 2 : B, 10 C, 1 C, 2 : C, 10 I would like to get 3 records back such that I get a single, best,

How to copy a solr index to another index with a different schema collapsing stored data?

2008-09-16 Thread ristretto . rb
Is it possible to copy stored index data from index to another, but concatenating it as you go. Suppose 2 categories A and B both with 20 docs, for a total of 40 docs in the index. The index has a stored field for the content from the docs. I want a new index with only two docs in it, one for A

Re: How to copy a solr index to another index with a different schema collapsing stored data?

2008-09-16 Thread ristretto . rb
is it possible to query out the stored data as, uh, tokens I suppose. Then, index those tokens in the next index? thanks gene On Wed, Sep 17, 2008 at 1:14 PM, Gene Campbell [EMAIL PROTECTED] wrote: I was pretty sure you'd say that. But, I means lots that you take the time to confirm it.

How to set term frequency given a term and a value stating the frequency?

2008-09-17 Thread ristretto . rb
Hello, I'm looking through the wiki, so if it's there, I'll find it, and you can ignore this post. If this isn't documented, can anyone explain how to achieve this? Suppose I have two docs A and B that I want to index. I want to index these documents so that A has the equivalent of 100 copies

how to find terms on a page?

2008-09-17 Thread ristretto . rb
Hello, I haven't heard of or found a way to find the number of times a term is found on a page. Lucene uses it in scoring, I believe, (solr scoring: http://tinyurl.com/4tb55r) Basically, for a given page, I would like a list of terms on the page and number of times the terms appear on the page?

Re: Filtering results

2008-09-18 Thread ristretto . rb
Otis, Would be reasonable to run a query like this http://localhost:8280/solr/select/?q=terms_xversion=2.2start=0rows=0indent=on 10 times, one for each result from an initial category query on a different index. So, it's still 1+10, but I'm not returning values. This would give me the number

Re: Filtering results

2008-09-18 Thread ristretto . rb
Thanks Otis for reply! Always appreciated! That is indeed what we are looking for implementing. But, I'm running out of time to prototype or experiment for this release. I'm going to run the two index thing for now, unless I find something saying is really easy and sensible to run one and

Are facet searches slower on large indexes?

2008-09-19 Thread ristretto . rb
Hello I'm doing a facet search like the following. The content field schema is tokenizer class=solr.HTMLStripStandardTokenizerFactory/ filter class=solr.StandardFilterFactory/ filter class=solr.StopFilterFactory ignoreCase=true

Re: solr on ubuntu 8.04

2008-10-02 Thread ristretto . rb
I had absolutely not luck with the jetty-solr package on Ubuntu 8.04. I haven't tried Tomcat for solr. I do have it running on Ubuntu though. Here's what I did. Hope this helps. Don't do this unless you understand the steps. When I say things like 'remove contents' I don't know what you have

Re: solr on ubuntu 8.04

2008-10-02 Thread ristretto . rb
before. Hope this helps. gene On Fri, Oct 3, 2008 at 10:14 AM, ristretto. rb [EMAIL PROTECTED] wrote: I had absolutely not luck with the jetty-solr package on Ubuntu 8.04. I haven't tried Tomcat for solr. I do have it running on Ubuntu though. Here's what I did. Hope this helps. Don't do

How are multivalued fields used?

2008-10-13 Thread ristretto . rb
How does one use of this field type. Forums, wiki, Lucene in Action, all coming up empty. If there's a doc somewhere please point me there. I use pysolr to index. But, that's not a requirement. I'm not sure how one adds multivalues to a document. And once added, if you want to remove one how