Hi everybody,

I'm working on a document/information management system with web-based search 
for a small legal aid organization (~30 workstations in office + ~10 remote 
users).  Most of it is in place -- files accessible via Samba on the internal 
network, accessible via webdav from outside, high availability, etc.  We've 
considered using third party search tools, but I'd like something light, 
OS-independent, globally accessible (secure remote access is a major 
priority) and tailored to the needs of our organization (such as having 
special indexes for shared cases and so forth).

1) Would pyLucene + Apache2/mod_python running on Debian Stable on a P4 server 
with 2gb ram would be a good direction to go in from a performance and 
stability standpoint? 

We're talking on the order of 50-100 searches per user per day, with the 
concentration being during the eight hour workday window.   To be liberal, 
let's say we're looking about 4000 searches during the workday and maybe 500 
or so outside that time window.  Will pyLucene be sufficient?  

Furthermore, how much more could it scale on a similar platform?  If this 
project is successful, I'd like to deploy it at a startup I'm consulting for, 
which will have about 60 workstations or so.

My consideration of pyLucene mainly revolves around my comfort with Python (I 
love Python... probably as a result of my undergrad work in physics and my 
grad school work in statistical social science) and lack of in-depth 
experience with JSP/servlets, as well as keeping things simple from an 
administrative standpoint.  Not having to run Tomcat means one less service 
to keep track of, particularly since we're already using an 
Apache2/mod_python set-up for other things.  It also feels to me like the 
pyLucene community is a smart and friendly bunch.

2) Using pyLucene on Linux, are there filesystem considerations?  I've become 
a huge fan of ReiserFS on my mail servers, where I'm using Maildirs -- the 
performance is noticably better than Ext3.  I'm guessing ResierFS would have 
similarly happy results for pyLucene (or regular old Lucene).  Anyone know 
about this?

3) I've been looking through the archives -- I'll make sure to compile 
pyLucene with the latest gcj!

Thanks a lot,
David
www.viewfromtheground.com
_______________________________________________
pylucene-dev mailing list
[email protected]
http://lists.osafoundation.org/mailman/listinfo/pylucene-dev

Reply via email to