Re: Lexicon access questions

2006-06-03 Thread eks dev
Thanks Chuck, I have to try it with example (s). Use case one: Documents: D1 == "John Doe" D2 == "sky scraper" D3 == "blue sky LTD" Imagine name "John" is ultra frequent => low IDF weight and "sky" is super low freq => very high weigt So Query: Q: "sky john" will give order: D2, D3, D1

[jira] Reopened: (LUCENE-587) Explanation.toHtml outputs invalid HTML

2006-06-03 Thread Daniel Naber (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-587?page=all ] Daniel Naber reopened LUCENE-587: - Sorry, I think the fix in not correct. Output now starts with "", but "" must always be inside "". > Explanation.toHtml outputs invalid HTML >

Re: [jira] Reopened: (LUCENE-587) Explanation.toHtml outputs invalid HTML

2006-06-03 Thread Chris Hostetter
: Sorry, I think the fix in not correct. Output now starts with "", : but "" must always be inside "". Daniel, I must be missunderstanding your comment, because i can't see nay situation in which Explanation.toHtml() can ever output an "" tag except immediately following a "" tag -- if anything,

IndexWriter multithreaded?

2006-06-03 Thread Robert Engels
I understand that there can only be a single active IndexWriter per Directory, but is the IndexWriter multithreaded (it seems to be)? It would appear that on a multiprocessor box using multiple threads to add documents from a queue would be much faster (due to the parallel tokenization)? Do I h

Structure of Lucene (OO Design)

2006-06-03 Thread Hayden Melton
Hi all, I am a PhD student at the University of Auckland, New Zealand. As part of my research I have performed an empirical study on a large corpus of open-source Java software. Several of the applications in the corpus (Ant, Tomcat, JMeter, POI) are from the Apache Software Foundation. If you ar