David Balmain wrote: > On 9/6/06, Kent Sibilev <[EMAIL PROTECTED]> wrote: >> end of a token. Does it mean that '_' character is considered >> alphanumeric? > > irb(main):001:0> require 'rubygems' > irb(main):002:0> require 'ferret' > irb(main):004:0> i = Ferret::I.new(:analyzer => > Ferret::Analysis::AsciiLetterAnalyzer.new) > irb(main):005:0> i << "test_case" > irb(main):006:0> i.search("case") > => #<struct Ferret::Search::TopDocs total_hits=1, hits=[#<struct > Ferret::Search::Hit doc=0, score=0.191783010959625>], > max_score=0.191783010959625> > irb(main):007:0> > > So no, '_' is not considered alphanumeric (or in this case alpha, as > AsciiLetterAnalyzer won't match numbers) >
Yes. It seems to work correctly, but I've noticed that index.search_each doesn't return more that 10 documents. Is there an option to change it? >> >> >> >> you initialize tree VALUE objects but never mark them for garbage >> collector. Eventually they are being freed behind my back. What you >> should do is to keep the type of the stream in TokenStream structure and >> rework frt_ts_mark method. >> >> Hope that helps, >> Kent > > Actually, frt_rets_mark already marks the three VALUE objects > correctly. What would really help would be if you could give me an > example script that segfaults. If you can do this I'll fix it and get > a new gem out as soon as possible. > I guess I didn't look carefully at the code. -- Posted via http://www.ruby-forum.com/. _______________________________________________ Ferret-talk mailing list Ferret-talk@rubyforge.org http://rubyforge.org/mailman/listinfo/ferret-talk