[Zope-dev] Bug Or Feature in ZopeSplitter
Hi all, After playing with Zope 2.4.3 and TextIndexes i found out that the TextIndex does not Index Digits( 12345 ). Thatswhy i looked in the sources and i saw in next_word, that there is the caracter checked with isalnum, but in check_synstop the caracter is checked with isalpha? This means to me that every DIGIT is a stop-word and not a real character. If have changed the isalpha with isalnum and it seems to work... Can anyone explain this... thanks as ___ Zope-Dev maillist - [EMAIL PROTECTED] http://lists.zope.org/mailman/listinfo/zope-dev ** No cross posts or HTML encoding! ** (Related lists - http://lists.zope.org/mailman/listinfo/zope-announce http://lists.zope.org/mailman/listinfo/zope )
Re: [Zope-dev] Bug Or Feature in ZopeSplitter
This is correct. The current splitter implementation ignores digits. You can make your own implementation that does not. Andre Schubert wrote: Hi all, After playing with Zope 2.4.3 and TextIndexes i found out that the TextIndex does not Index Digits( 12345 ). Thatswhy i looked in the sources and i saw in next_word, that there is the caracter checked with isalnum, but in check_synstop the caracter is checked with isalpha? This means to me that every DIGIT is a stop-word and not a real character. If have changed the isalpha with isalnum and it seems to work... Can anyone explain this... thanks as ___ Zope-Dev maillist - [EMAIL PROTECTED] http://lists.zope.org/mailman/listinfo/zope-dev ** No cross posts or HTML encoding! ** (Related lists - http://lists.zope.org/mailman/listinfo/zope-announce http://lists.zope.org/mailman/listinfo/zope ) ___ Zope-Dev maillist - [EMAIL PROTECTED] http://lists.zope.org/mailman/listinfo/zope-dev ** No cross posts or HTML encoding! ** (Related lists - http://lists.zope.org/mailman/listinfo/zope-announce http://lists.zope.org/mailman/listinfo/zope )
Re: [Zope-dev] Bug Or Feature in ZopeSplitter
I hope the changes I made will work. :) mfg as Chris McDonough schrieb: This is correct. The current splitter implementation ignores digits. You can make your own implementation that does not. Andre Schubert wrote: Hi all, After playing with Zope 2.4.3 and TextIndexes i found out that the TextIndex does not Index Digits( 12345 ). Thatswhy i looked in the sources and i saw in next_word, that there is the caracter checked with isalnum, but in check_synstop the caracter is checked with isalpha? This means to me that every DIGIT is a stop-word and not a real character. If have changed the isalpha with isalnum and it seems to work... Can anyone explain this... thanks as ___ Zope-Dev maillist - [EMAIL PROTECTED] http://lists.zope.org/mailman/listinfo/zope-dev ** No cross posts or HTML encoding! ** (Related lists - http://lists.zope.org/mailman/listinfo/zope-announce http://lists.zope.org/mailman/listinfo/zope ) ___ Zope-Dev maillist - [EMAIL PROTECTED] http://lists.zope.org/mailman/listinfo/zope-dev ** No cross posts or HTML encoding! ** (Related lists - http://lists.zope.org/mailman/listinfo/zope-announce http://lists.zope.org/mailman/listinfo/zope ) ___ Zope-Dev maillist - [EMAIL PROTECTED] http://lists.zope.org/mailman/listinfo/zope-dev ** No cross posts or HTML encoding! ** (Related lists - http://lists.zope.org/mailman/listinfo/zope-announce http://lists.zope.org/mailman/listinfo/zope )