Hi everybody, I'm working on a document/information management system with web-based search for a small legal aid organization (~30 workstations in office + ~10 remote users). Most of it is in place -- files accessible via Samba on the internal network, accessible via webdav from outside, high availability, etc. We've considered using third party search tools, but I'd like something light, OS-independent, globally accessible (secure remote access is a major priority) and tailored to the needs of our organization (such as having special indexes for shared cases and so forth).
1) Would pyLucene + Apache2/mod_python running on Debian Stable on a P4 server with 2gb ram would be a good direction to go in from a performance and stability standpoint? We're talking on the order of 50-100 searches per user per day, with the concentration being during the eight hour workday window. To be liberal, let's say we're looking about 4000 searches during the workday and maybe 500 or so outside that time window. Will pyLucene be sufficient? Furthermore, how much more could it scale on a similar platform? If this project is successful, I'd like to deploy it at a startup I'm consulting for, which will have about 60 workstations or so. My consideration of pyLucene mainly revolves around my comfort with Python (I love Python... probably as a result of my undergrad work in physics and my grad school work in statistical social science) and lack of in-depth experience with JSP/servlets, as well as keeping things simple from an administrative standpoint. Not having to run Tomcat means one less service to keep track of, particularly since we're already using an Apache2/mod_python set-up for other things. It also feels to me like the pyLucene community is a smart and friendly bunch. 2) Using pyLucene on Linux, are there filesystem considerations? I've become a huge fan of ReiserFS on my mail servers, where I'm using Maildirs -- the performance is noticably better than Ext3. I'm guessing ResierFS would have similarly happy results for pyLucene (or regular old Lucene). Anyone know about this? 3) I've been looking through the archives -- I'll make sure to compile pyLucene with the latest gcj! Thanks a lot, David www.viewfromtheground.com _______________________________________________ pylucene-dev mailing list [email protected] http://lists.osafoundation.org/mailman/listinfo/pylucene-dev
