Don't throw away the stopwords? :-) Lucene can't score something it doesn't know exists. I suppose you could try to get fancy w/ payloads and add payloads if stopwords exist, but I am just thinking out loud there.

On Mar 21, 2008, at 9:20 PM, Chris Lu wrote:

Let's say "the" is considered stopword. And for example two documents are
document A, content: "... search the database..."
document B, content: "... search database..."

So when the user's input is "search the database", searching with
query content:"search database"~1 can return both.
But is there any way to translate that into a query that can rank the
document A higher than document B?

Thanks!

--
Chris Lu
-------------------------
Instant Scalable Full-Text Search On Any Database/Application
site: http://www.dbsight.net
demo: http://search.dbsight.com
Lucene Database Search in 3 minutes:
http://wiki.dbsight.com/index.php?title=Create_Lucene_Database_Search_in_3_minutes
DBSight customer, a shopping comparison site, (anonymous per request)
got 2.6 Million Euro funding!

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


--------------------------
Grant Ingersoll
http://www.lucenebootcamp.com
Next Training: April 7, 2008 at ApacheCon Europe in Amsterdam

Lucene Helpful Hints:
http://wiki.apache.org/lucene-java/BasicsOfPerformance
http://wiki.apache.org/lucene-java/LuceneFAQ






---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to