David Balmain wrote:
> On 9/6/06, Kent Sibilev <[EMAIL PROTECTED]> wrote:
>> end of a token. Does it mean that '_' character is considered
>> alphanumeric?
> 
> irb(main):001:0> require 'rubygems'
> irb(main):002:0> require 'ferret'
> irb(main):004:0> i = Ferret::I.new(:analyzer =>
> Ferret::Analysis::AsciiLetterAnalyzer.new)
> irb(main):005:0> i << "test_case"
> irb(main):006:0> i.search("case")
> => #<struct Ferret::Search::TopDocs total_hits=1, hits=[#<struct
> Ferret::Search::Hit doc=0, score=0.191783010959625>],
> max_score=0.191783010959625>
> irb(main):007:0>
> 
> So no, '_' is not considered alphanumeric (or in this case alpha, as
> AsciiLetterAnalyzer won't match numbers)
> 

Yes. It seems to work correctly, but I've noticed that index.search_each 
doesn't return more that 10 documents. Is there an option to change it?


>>
>>
>>
>>  you initialize tree VALUE objects but never mark them for garbage
>> collector.  Eventually they are being freed behind my back. What you
>> should do is to keep the type of the stream in TokenStream structure and
>> rework frt_ts_mark method.
>>
>> Hope that helps,
>> Kent
> 
> Actually, frt_rets_mark already marks the three VALUE objects
> correctly. What would really help would be if you could give me an
> example script that segfaults. If you can do this I'll fix it and get
> a new gem out as soon as possible.
> 

I guess I didn't look carefully at the code.

-- 
Posted via http://www.ruby-forum.com/.
_______________________________________________
Ferret-talk mailing list
Ferret-talk@rubyforge.org
http://rubyforge.org/mailman/listinfo/ferret-talk

Reply via email to