[Ferret-talk] indexing large tokens

Justin Kan Fri, 16 Jun 2006 16:15:16 -0700

Hi,

I'm using the StandardAnalyzer to build an index, and passing in Documents that have Fields that contain large tokens (22+ characters) interpersed with normal English words. This seems to cause the IndexWriter to slow to a crawl. Is this a known issue, or am I doing something wrong?

If this is a known issue I don't have any problem just not indexing tokens longer than a certain length, but what's the best way to eliminate them? Using a TokenFilter on my own Analyzer? Sorry for the newbish questions, I'm new to ferret having never used lucene. Thanks in advance,

Justin

_______________________________________________
Ferret-talk mailing list
[email protected]
http://rubyforge.org/mailman/listinfo/ferret-talk

[Ferret-talk] indexing large tokens

Reply via email to