Searching Textile Documents

2005-11-23 Thread Alan Chandler
text is on a search page. However I can easily retrieve the article from the database given its ID. Would I therefore make the ID of the article a keyword, and the body of it unstored? and would I build a special space separated string of the (undetermined number of) categories and make t

Re: Searching Textile Documents

2005-11-23 Thread Alan Chandler
On Wednesday 23 Nov 2005 20:30, Erik Hatcher wrote: > On 23 Nov 2005, at 14:30, Alan Chandler wrote: > > 1) The Analyser > > First you'll have to spell it the US English way :) You mean yet another corruption of my language:-) I am still having trouble with color rather t

Re: Searching Textile Documents

2005-11-23 Thread Alan Chandler
s the sort field. Is there anyway to limit the number of hits returned in the search to 5 (or does it not generate any storage overhead to return them all and just use the first 5) -- Alan Chandler http://www.chandlerfamily.org.uk Open Source. It's the diff

Question of strategy with respect to multiple field types within one piece of a document

2005-11-26 Thread Alan Chandler
o one of the field names (presumably of type keyword - field name "content") and using field name "urls" of type (? can't see how to set the field type to indexed, not stored, not tokenized) -- Alan Chandler http://www.chandlerfami

Getting Dates Back out of lucene

2005-12-06 Thread Alan Chandler
I added a date field to a document with doc.add(Field.keyword("A Date",myDate)); How do I get it back out again as a date? -- Alan Chandler http://www.chandlerfamily.org.uk Open Source. It's the difference between trus

Re: Getting Dates Back out of lucene

2005-12-06 Thread Alan Chandler
On Tuesday 06 Dec 2005 10:10, Miles Barr wrote: > On Tue, 2005-12-06 at 09:35 +0000, Alan Chandler wrote: > > I added a date field to a document with > > > > doc.add(Field.keyword("A Date",myDate)); > > > > How do I get it back out again a

Is there any timing constraints with lucene.

2005-12-06 Thread Alan Chandler
I do this slowly it works, when it is running at full speed it doesn't. Is there any attempt to create some background thread to update the index that could cause the problem? -- Alan Chandler http://www.chandlerfamily.org.uk Open Source. It's the difference between

Re: Is there any timing constraints with lucene.

2005-12-06 Thread Alan Chandler
On Tuesday 06 Dec 2005 22:44, Alan Chandler wrote: > I have implemented a "Database" on top of lucene and am currently testing > it. > > When I manually step through my test suite in the debugger, it works fine. > > When I run it without the debugger it fails.

Confused about boolean query and how an IndexReader is associated with Hits

2005-12-06 Thread Alan Chandler
to me how hits are associated with a reader. Is it possible to close the reader and then re-open it later to access a document with the hit? -- Alan Chandler http://www.chandlerfamily.org.uk Open Source. It's the difference between trust and antitrust. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: Confused about boolean query and how an IndexReader is associated with Hits

2005-12-07 Thread Alan Chandler
Erik Hatcher writes: On Dec 7, 2005, at 2:38 AM, Alan Chandler wrote: Worse than that, when I attempt to access Hits.doc(0) I am getting an immediate IOException with the message "Bad file descriptor". I think ... You must keep your IndexSearcher instance alive and well wh

Re: Confused about boolean query and how an IndexReader is associated with Hits

2005-12-07 Thread Alan Chandler
Erik Hatcher writes: On Dec 7, 2005, at 7:06 AM, Alan Chandler wrote: Erik Hatcher writes: On Dec 7, 2005, at 2:38 AM, Alan Chandler wrote: Worse than that, when I attempt to access Hits.doc(0) I am getting an immediate IOException with the message "Bad file descriptor".

Re: Confused about boolean query and how an IndexReader is associated with Hits

2005-12-07 Thread Alan Chandler
Erik Hatcher writes: ... or use IndexReader to navigate to it. That is something I wanted to ask about IndexReader.TermPositions(Term t) Returns an object which returns all occurrences of term. Is that what I use to find the actual position in my documents of the seach item? -- Alan

Re: Confused about boolean query and how an IndexReader is associated with Hits

2005-12-07 Thread Alan Chandler
On Wednesday 07 Dec 2005 07:38, Alan Chandler wrote: > I am trying to construct, via individual query api, a query to search for > documents with a field name of "Category" and a value of either "Category1" > OR "Category2" (or both). > > My code to do

Re: Confused about boolean query and how an IndexReader is associated with Hits

2005-12-07 Thread Alan Chandler
ke this, filters can > be faster then BooleanQueries with negated clauses (especially if you know > that there will only ever be one of these config docs, you can short > circut all the others) and they can be cached and reused, saving even more > time. I'll take a look - I

Re: Confused about boolean query and how an IndexReader is associated with Hits

2005-12-07 Thread Alan Chandler
assive sequence of building functionality on top of functionality. -- Alan Chandler http://www.chandlerfamily.org.uk Open Source. It's the difference between trust and antitrust. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: Confused about boolean query and how an IndexReader is associated with Hits

2005-12-07 Thread Alan Chandler
ave left an indexWriter open, or not re-opened the searcher after some updates. -- Alan Chandler http://www.chandlerfamily.org.uk Open Source. It's the difference between trust and antitrust. - To unsubscribe, e-mail:

Re: Confused about ... [SOLVED]

2005-12-08 Thread Alan Chandler
e to locate the problem in a simple typo of the querying I was doing. Dooh! -- Alan Chandler http://www.chandlerfamily.org.uk Open Source. It's the difference between trust and antitrust. - To unsubscribe, e-mail: [EMAIL P

Confused again ... Getting at results

2005-12-09 Thread Alan Chandler
suspect it has something to do with FilterIndexReader and its nested class FilterTermPositions, but I can't see how to link these to the seach I have just done How is it done? -- Alan Chandler http://www.chandlerfamily.org.uk Open Source. It's the difference between trust and

Re: Confused again ... Getting at results

2005-12-09 Thread Alan Chandler
think that is doing something like this, so am going to experiment with that. Initial attempt failed to compile because I think it was assuming a later version of the lucene code, but it looks like I just cut out the offending class and its ok. -- Alan Chandler http:

Re: Searching Textile Documents

2006-01-28 Thread Alan Chandler
there are a few rough edges to deal with. I have to say a database based on lucene seems very small (the directory with most of the current articles in was about 150K bytes) and it seems to run very fast. URL below is where its running -- Alan Chandler http://www.chandlerfamily.org.uk