David Balmain wrote:
> The core of ferret is character encoding agnostic. It treats all
> strings as an array of bytes so it doesn't matter what you put in. You
> could store JPEGs in the index if you wanted to.

On which subject, I happen to have chucked some bmp files into my index, 
and was really quite amazed to see them being returned on search 
results. Not only that, but the results were accurate.

For example, if I have a bmp which contains the word "Sheep" (when 
viewed as an image) and I search the index for "Sheep" - the bmp is 
returned.

I am adding documents using the standard analyser and file.readlines to 
add the contents.

If I open the bmp in a text editor and search for "Sheep" - that word is 
not contained within the file.

So how come ferret can read the bmp?

Cheers,
Steven

-- 
Posted via http://www.ruby-forum.com/.
_______________________________________________
Ferret-talk mailing list
[email protected]
http://rubyforge.org/mailman/listinfo/ferret-talk

Reply via email to