Dear Jose, On Tue, 2007-06-19 at 07:10, Jose Blanco wrote:
> 3. We will also have the need to update these records periodically, > and so it seems like following a similar architecture as the one > DSpace uses, it would take a very unreasonable amount of time to > update 12 million records. Just last week I used the ItemImporter to > load 3,600 records and I believe it took about 7 hours for the load to > complete. I'm assuming that the reason it took so long was because my > repository already has about 35,000 items and the inserts to the > database were taking time, more than anything related to lucene. When > I loaded the same number of records in my development instance it took > less than 2 hours and I have very few records there ( probably about > 1000 ). Any thoughts on this? This issue has been raised many times on the various DSpace lists, but I am yet to see any substantive action on the part of the core developers to address it. Actually setting up some dedicated test servers with a decent amount of representative and scalable test data would be a start. One would take this for granted with any well organised test and release cycle, but to my knowledge DSpace releases are not subjected to serious performance profiling, scalability or stress testing. I have been hoping for some time that this deficiency would be corrected, but I am beginning to doubt that it will be addressed in the medium term. > 4. Has any one out there had to do something like this, and if so what > have you found that works. One solution that comes to mind is Zebra. > It is suppose to handle large repositories quite well. Are there any > users of Zebra out there that might have an opinion on this? I definitely suggest that you mail your requirements to the IndexData list. http://www.indexdata.dk/zebra/ http://lists.indexdata.dk/cgi-bin/mailman/listinfo/zebralist Over the years I have found IndexData's developers to be extremely helpful and responsive. An anecdote: Only the other day I found a bug in `yaz-client'. Not only was it fixed within a couple of days, but after consultation, new functionality was added. Just splendid. If only this attitude was more widespread. Best regards, Richard MAHONEY -- Richard MAHONEY | internet: http://indica-et-buddhica.org/ Littledene | telephone/telefax (man.): +64 3 312 1699 Bay Road | cellular: +64 27 482 9986 OXFORD, NZ | email: [EMAIL PROTECTED] ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Indica et Buddhica: Materials for Indology and Buddhology Repositorium: http://indica-et-buddhica.org/repositorium/ Philologica: http://indica-et-buddhica.org/philologica/ Subscriptions: http://subscriptions.indica-et-buddhica.org/ ------------------------------------------------------------------------- This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ _______________________________________________ DSpace-tech mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dspace-tech

