cost of proximity question

2004-11-15 Thread Anson Lau
Hi all, Does anyone know what's the performance cost of a Nutch like proximity query that looks like this: (+Hello +World +\Hello world\~p^a)x ? or just how in general how much processing does proximity add to a query? Thanks, Anson

RE: Lucene vs. MySQL Full-Text

2004-07-21 Thread Anson Lau
Depending on what MySQL Full-text search support you probably will lose some of the advance things you get for free from Lucene, such as proximity search, wildcard search, search term and search field boosting, scoring of the documents, etc. Afterall it depends on what you need to do. In our dev

RE: speeding up lucene search

2004-07-21 Thread Anson Lau
:43 PM To: Lucene Users List Subject: Re: speeding up lucene search In general, yes. By splitting up a large index into smaller indicies, you are linearizing the search time. Furthermore, that allows you to make your search distributable. -John On Wed, 21 Jul 2004 13:00:28 +1000, Anson Lau [EMAIL

RE: Weighting database fields

2004-07-21 Thread Anson Lau
] Sent: Thursday, July 22, 2004 12:52 AM To: Lucene Users List Subject: Re: Weighting database fields On Jul 21, 2004, at 10:09 AM, Anson Lau wrote: Apply boost factor to fields when you do a lucene search. Or... set the boost on the Field during indexing. Erik Anson -Original

speeding up lucene search

2004-07-20 Thread Anson Lau
Hello guys, What are some general techniques to make lucene search faster? I'm thinking about splitting up the index. My current index has approx 1.8 million documents (small documents) and index size is about 550MB. Am I likely to get much gain out of splitting it up and use a

RE: Scoring without normalization!

2004-07-14 Thread Anson Lau
If you don't mind hacking the source: In Hits.java In method getMoreDocs() // Comment out the following //float scoreNorm = 1.0f; //if (length 0 scoreDocs[0].score 1.0f) { // scoreNorm = 1.0f / scoreDocs[0].score; //} // And just set scoreNorm to 1. int

Re: Pool of IndexReaders or Pool of Searchers?

2004-07-11 Thread Anson Lau
Hi, When I did some load testing on a lucene powered search app, using a pool of index searchers doesn't give me any more search per second than just using a singleton index searcher. Anson Quoting [EMAIL PROTECTED]: Hi, I have multiple threads reading an index. Should they all be using

RE: best ways of using IndexSearcher

2004-06-29 Thread Anson Lau
--- Anson Lau [EMAIL PROTECTED] wrote: Hi Guys, What's the recommended way of using IndexSearcher? Should IndexSearcher be a singleton or pooled? Would pooling provide a more scalable solution by allowing you to decide how many IndexSearcher to use based on say how many CPU u have

best ways of using IndexSearcher

2004-06-28 Thread Anson Lau
Hi Guys, What's the recommended way of using IndexSearcher? Should IndexSearcher be a singleton or pooled? Would pooling provide a more scalable solution by allowing you to decide how many IndexSearcher to use based on say how many CPU u have on ur server? Thanks, Anson

RE: using boost factor

2004-06-23 Thread Anson Lau
I hope this helps. Somebody else will probably be able to give more information, but this should get you started while you wait. Otis --- Anson Lau [EMAIL PROTECTED] wrote: Hi guys, Lets say I want to search the term hello world over 3 fields with different boost: ((hello:field1

RE: a list of matching search term

2004-06-02 Thread Anson Lau
Thanks Erik I'll give that a try. Anson -Original Message- From: Erik Hatcher [mailto:[EMAIL PROTECTED] Sent: Wednesday, June 02, 2004 7:28 PM To: Lucene Users List Subject: Re: a list of matching search term On Jun 1, 2004, at 9:19 PM, Anson Lau wrote: Further to my previous email

a list of matching search term

2004-06-01 Thread Anson Lau
Hi All, Eg. Lets say someone do a search on the terms 'apple orange banana'. In the search results, is it possible to find out for each hit, which of those terms did match? Ie. The document with the highest score has all three words so the matching terms are all of those words. A lesser

RE: a list of matching search term

2004-06-01 Thread Anson Lau
Further to my previous email: The highlighter package should be able to pick up the matching search terms. Can some experienced highlighter package users tell me if I should look down that line? Thanks a lot. Anson -Original Message- From: Anson Lau [mailto:[EMAIL PROTECTED] Sent

field boost factor

2004-05-14 Thread Anson Lau
Hi all, Is it possible to set different boost factor to different fields when you do a search, rather than when you index? Thanks, Anson - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL

RE: field boost factor

2004-05-14 Thread Anson Lau
I think I found it in Query API... Thanks, Anson -Original Message- From: Anson Lau [mailto:[EMAIL PROTECTED] Sent: Friday, May 14, 2004 4:27 PM To: [EMAIL PROTECTED] Subject: field boost factor Hi all, Is it possible to set different boost factor to different fields when you do

looking for developer

2004-03-28 Thread Anson Lau
Hi All, Our company is looking for 2 java developer with strong Lucene experience to do some contract work. We're in Sydney, Australia. If anyone is interested plesaes email me direct ([EMAIL PROTECTED]). Thanks, Anson - To

RE: looking for developer

2004-03-28 Thread Anson Lau
bites maybe we can talk about telecommuting the occasional day trip? Esmond Pitt FACS 0400 139 869 - Original Message - From: Anson Lau [EMAIL PROTECTED] To: [EMAIL PROTECTED] Sent: Monday, March 29, 2004 12:23 AM Subject: looking for developer Hi All, Our company is looking for 2

RE: looking for developer

2004-03-28 Thread Anson Lau
Ops - sorry should reply to Esmond direct. Pls ignore the previous msg. Anson -Original Message- From: Anson Lau [mailto:[EMAIL PROTECTED] Sent: Monday, March 29, 2004 1:29 PM To: 'Lucene Users List' Subject: RE: looking for developer Esmond, Thanks a lot for your email

RE : Lucene scalability/clustering

2004-02-24 Thread Anson Lau
RBP, I'm implementing a search engine for a project at work. It's going to index approx 1.5 rows in a database. I am trying to get a feel of what my options are when scalability becomes an issue. I also want to know if those options require me to implement my app in a different way right from

RE: RE : Lucene scalability/clustering

2004-02-23 Thread Anson Lau
I'm trying to see what are some common ways to scale lucene onto multiple boxes. Is RMI based search and using a MultiSearcher the general approach? There doesn't seem to be many articles on the web on how to implement a lucene search cluster. If anyone knows a good article can you please post

RE: Lucene scalability/clustering

2004-02-22 Thread Anson Lau
Further on this topic - has anyone tried implementing a distributed search with Lucene? How does it work and does it work well? Anson -Original Message- From: Hamish Carpenter [mailto:[EMAIL PROTECTED] Sent: Monday, February 23, 2004 5:24 AM To: Lucene Users List Subject: Re: Lucene

number of fields, size of fields

2004-02-17 Thread Anson Lau
Hi All, I'm a beginner with Lucene. I would like to know if there are general guidelines on: 1. the number of field a document can have 2. size of unindexed fields 3. size of a stored text field I just want to get a feel for what are the good practices. Thanks, Anson Lau