Robert,

>======================================
>
>We are having problems making Verity search like the documentation says.
>
>Using CFSERACH with type=simple (and case-insensitive), we find the
>following:
>
>- searching for the phrase, american painting, returns all records with
>either american or painting
>- searching for american AND painting returns only records with both
>- searching for "american painting" (with quotes) returns no rows
>
>According to the documentation, the first case should have returned only
>rows with the phrase "american painting" in it. I have confirmed that there
>are 4 records in the collection that have that phrase, so it should be
>returning something.
>
>======================================

What is the exact <cfsearch /> tag syntax you're using?

I do think that a simple unquoted string is treated more like a <near>
search, rather than an exact quote. So if the words are in close proximity,
they will also find matches. 


The docs also state that the default behavior of the "simple" search type
includes the <stem> and <many> operators:

http://livedocs.macromedia.com/coldfusion/7/htmldocs/00001324.htm#wp1212738

<quote>
Stemming in simple queries

By default, Verity interprets words in a simple query as if you entered the
STEM operator (and MANY modifier). The STEM operator searches for words that
derive from a common stem. For example, a search for instructional returns
files that contain instruct, instructs, instructions, and so on.
</quote>

This is why the simple search for american painting (without the quotes)
pulls back derivatives of the word "american" and "painting".

<quote>
The STEM operator works on words, not word fragments. A search for
"instrument" returns documents containing "instrument," "instruments,"
"instrumental," and "instrumentation," whereas a search for "instru" does
not. (A wildcard search for instru* returns documents with these words, and
also those with instruct, instructional, and so on.)

Note: The MANY modifier presents the files returned in the search as a list
based on a relevancy score. A file with more occurrences of the search word
has a higher score than a file with fewer occurrences. As a result, the
search engine ranks files according to word density as it searches for the
word that you specify, as well as words that have the same stem. For more
information on the MANY modifier, see Modifiers.
</quote>

-Dan


~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~|
Introducing the Fusion Authority Quarterly Update. 80 pages of hard-hitting,
up-to-date ColdFusion information by your peers, delivered to your door four 
times a year.
http://www.fusionauthority.com/quarterly

Archive: 
http://www.houseoffusion.com/groups/CF-Talk/message.cfm/messageid:254233
Subscription: http://www.houseoffusion.com/groups/CF-Talk/subscribe.cfm
Unsubscribe: 
http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=11502.10531.4

Reply via email to