David Balmain wrote: > The core of ferret is character encoding agnostic. It treats all > strings as an array of bytes so it doesn't matter what you put in. You > could store JPEGs in the index if you wanted to.
On which subject, I happen to have chucked some bmp files into my index, and was really quite amazed to see them being returned on search results. Not only that, but the results were accurate. For example, if I have a bmp which contains the word "Sheep" (when viewed as an image) and I search the index for "Sheep" - the bmp is returned. I am adding documents using the standard analyser and file.readlines to add the contents. If I open the bmp in a text editor and search for "Sheep" - that word is not contained within the file. So how come ferret can read the bmp? Cheers, Steven -- Posted via http://www.ruby-forum.com/. _______________________________________________ Ferret-talk mailing list [email protected] http://rubyforge.org/mailman/listinfo/ferret-talk

