Lars Eggert wrote:

I've made a small change locally that (1) throws out any non-word characters from the name (\W) and (2) throws out any single-character words from the name. This seems to produce much better matches.

In the example above, it would search for "dvd fellowship ext" and frind it, instead of searching for "dvd [fellowship ext d 1]".

I should add that the idea here is to feed imdb more significant words for searching, instead of interpreting the matches it returns. Their search algorithm doesn't seem to be too smart about weighing terms.

For the same reason, it may make sense to strip other common short words (in, the, for, not, a, an, of, etc.) fromt he search string.

Lars
--
Lars Eggert <[EMAIL PROTECTED]>           USC Information Sciences Institute

Attachment: smime.p7s
Description: S/MIME Cryptographic Signature



Reply via email to