Re: [Catalyst] Re: [OT] Search Solution

Octavian Rasnita Fri, 09 Nov 2007 13:59:30 -0800

From: "Peter Karman" <[EMAIL PROTECTED]>

Do you know if it can index the html documents without parsing them with
other tools, or possibly other type of files like pdf, doc?
Xapian is a library. The related Omega project has support for parsingdocs of various formats.

Oh yes, Omega seems to be nice. Too bad it doesn't allow indexing theauto-generated web pages, but only the static content.

Do you have a recommendation for a good perl module that can be used easylyfor creating a spider that should index a web site?

Octavian


_______________________________________________
List: [email protected]
Listinfo: http://lists.scsys.co.uk/cgi-bin/mailman/listinfo/catalyst
Searchable archive: http://www.mail-archive.com/[EMAIL PROTECTED]/
Dev site: http://dev.catalyst.perl.org/

Re: [Catalyst] Re: [OT] Search Solution

Reply via email to