DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG�
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://issues.apache.org/bugzilla/show_bug.cgi?id=33702>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND�
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=33702





------- Additional Comments From [EMAIL PROTECTED]  2005-06-02 04:51 -------
solprovider wrote on the mailing list:

On 6/1/05, Gregor J. Rothfuss <[EMAIL PROTECTED]> wrote:

>> Michael Wechner wrote:
>
>>> > connect to a Search engine API in order to allow incremental indexing
>>> > of content being changed within Lenya.
>>> > Also refer to http://opensearch.a9.com/
>
>> agreed. we can bundle this into:
>> * port search to usecase framework
>> * switch crawler to nutch
>> * update index incrementally


Um, I already made search a usecase.  That was necessary to maintain
visitor language information.

This code is available at:
http://lenya.apache.org/1_2_x/how-to/search.html
The original, and comments about using on Linux are at:
http://solprovider.com/lenya/search

While the crawler is good if you want to index non-Lenya websites, it
is poor design for Lenya.  It feels like someone got lazy and used the
DefaultIndexer for HTML rather than use the ConfigurableIndexer for
true integration with Lenya, or maybe there are historical reasons
such as the ConfigurableIndexer not being available when Lucene was
first integrated.  I expect the website crawler to disappear, rather
than be a priority improvement.

My project is moving into production.  Incremental indexing and
integration with the CMS GUI are priorities.  I want the index to be
updated x minutes after updates are completed, delaying if another
save happens before it starts.  The indexer needs to be started from a
scheduler.  I need to integrate with the scheduler, and from the
number of issues about the scheduler in the mailing lists, may need to
revise it.

-- 
Configure bugmail: http://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to