I am forwarding one of Kimbros' old e-mails...
--- Begin Message --- As I've been working out some issues with the CORBA system I've been working on getting larger document sets into the server. My largest set right now is 149,025 documents in a single collection. The server can easily handle more documents this is just the largest dataset I have available right now. Here are some stats to give us a better idea where we stand. These are run against the current CVS version with one exception. I used OpenORB for the server ORB instead of JacORB. JacORB was still used for the client. It's likely we'll need to switch to OpenORB overall as even the latest JacORB leaks memory on the server.
computer: 750MHZ P3 256MB RAM Laptop running Mandrake Linux 8
jdk: Sun 1.3.0_04
Dataset size: 149,025 documents 601MB
Insertion time (no indexes): 1 hour 45 minutes which is roughly 1,424 docs per minute or 24 per second.
Collection size: 657MB
Document retrieval: 2 seconds (including VM startup which is most of the time)
Full collection scan query /disc[id = '11041c03']: 12 minutes
Index creation: 13.5 minutes
Index based query /disc[id = '11041c03']: 2.12 seconds (including VM startup which is most of that time)
Index size 164MB
The data set consists of documents similar to the following.
<?xml version="1.0"?>
<disc>
<id>11041c03</id>
<length>1054</length>
<title>Orchestral Manoeuvres In The Dark / The OMD Remixes (Single)</title>
<genre>cddb/misc</genre>
<track index="1" offset="150">Enola Gay (OMD vs Sash! Radio Edit)</track>
<track index="2" offset="18790"> (2)Souvenir (Moby Remix)</track>
<track index="3" offset="39790"> (3)Electricity (The Micronauts Remix)</track>
</disc>
Kimbro Staken The dbXML Group L.L.C. - http://www.dbxmlgroup.com/ Embedded XML Database Software and Services
_______________________________________________ dbXML-Core-Devel mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/dbxml-core-devel
--- End Message ---
