[Sean] > Has anyone here got any experience integrating django with lucence, > sphinx, or anything like that? > Any tips, recommendations? Im particularly interested in engines that > can index documents (word,excel,pdf, etc) as well as web pages.
Take a look at SOLR, which is a J2EE indexing server wrapped around Lucene. You talk to it through a HTTP interface, using XML, or JSON, which means that you don't have to integrate or link Lucene into your codebase. AFAIR, SOLR has decomposers for Word, Excel, PDF, etc. It doesn't have a webcrawler, but can integrate with Nutch. http://lucene.apache.org/solr/ Al. --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "Python Ireland" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [EMAIL PROTECTED] For more options, visit this group at http://groups.google.ie/group/pythonireland?hl=en -~----------~----~----~----~------~----~------~--~---
