Hi there,
I've been building a search platform for a custom database I have, using
Zend_Search_Lucene, and I've run into a couple of oddities.
The prototype version I have gets its index rebuilt from scratch
occasionally, currently approximately 3,000 documents. All of the
documents are being added in code, the input source from which
everything is being input is all UTF-8 XML files,
1) Occasionally I'll find that Luke 0.81 will report "read past EOF"
when searching for a term, and ZSL itself will report no matches, even
when I know the term is in the database (Luke shows it amongst the word
list).
2) Sometimes I'll get a bunch of PHP notices like the following:
PHP Notice: Uninitialized string offset: 628 in
C:\public_html\.common\Zend\Search\Lucene\Index\DictionaryLoader.php on
line 116, referer: [deleted]
PHP Notice: Uninitialized string offset: 629 in
C:\public_html\.common\Zend\Search\Lucene\Index\DictionaryLoader.php on
line 124, referer: [deleted]
PHP Notice: Uninitialized string offset: 630 in
C:\public_html\.common\Zend\Search\Lucene\Index\DictionaryLoader.php on
line 183, referer: [deleted]
PHP Notice: Uninitialized string offset: 631 in
C:\public_html\.common\Zend\Search\Lucene\Index\DictionaryLoader.php on
line 191, referer: [deleted]
PHP Notice: Uninitialized string offset: 632 in
C:\public_html\.common\Zend\Search\Lucene\Index\DictionaryLoader.php on
line 199, referer: [deleted]
PHP Notice: Uninitialized string offset: 633 in
C:\public_html\.common\Zend\Search\Lucene\Index\DictionaryLoader.php on
line 208, referer: [deleted]
PHP Notice: Uninitialized string offset: 634 in
C:\public_html\.common\Zend\Search\Lucene\Index\DictionaryLoader.php on
line 230, referer: [deleted]
The offsets are never the same, though.
Is it something I'm doing wrong?
Many thanks in advance,
Pete