Dear Mark,
 
I am coming back to the tests by Kimbro you mentioned some time ago (see below).  Is there any additional info available?  I haven't seen any follow-up on this.
 
For general intrest, I give here some numbers relative to a single query on a 2.4 MB collection composed of 787 documents.
Here is the code performing the query:

String xpath = "//EmdResult[//Event/DateTime>'2000-12-31' and //Event/DateTime<'2001-02-01']";

XPathQueryService service =(XPathQueryService) col.getService("XPathQueryService", "1.0");

ResourceSet resultSet = service.query(xpath);

A typical XML document looks as follows and takes about 2.5 kB:

 <EmdResult>
 <Patient>
  <Id>B261D1W</Id>
  <LastName>THE GREAT</LastName>
  <FirstName>ALEXANDER</FirstName>
  <Street>BIG BANG STREET 25</Street>
  <Zip>3456789900</Zip>
  <City>MARS</City>
  <Sex>V</Sex>
  <BirthDate>1706-10-14</BirthDate>
  <Age Unit="jaren">275</Age>
  </Patient>
 <Request>
  <RequestDateTime>3001-15-22</RequestDateTime>
 <RequestingHcActor Oid="OrganizationSrv::THcActor(u80803)">
  <Name>DR. FRANKENSTEIN</Name>
  <OgNr>1766</OgNr>
  <PersId>M751BL00R</PersId>
  </RequestingHcActor>
 <RequestingService>
  <Id>CGYN</Id>
  </RequestingService>
 <Events>
 <Event Oid="emd::CBXMLEvent(B261014CYTOLOGIE0_LABALABA|0)">
 <PerformingHcActor Oid="OrganizationSrv::THcActor(1000006)">
  <Name>JULIUS CAESAR</Name>
  <OgNr />
  <PersId>V621</PersId>
  </PerformingHcActor>
 <ResponsableHcActor Oid="OrganizationSrv::THcActor(5981)">
  <Name>DR. FEELGOOD</Name>
  <OgNr>1981</OgNr>
  <PersId>9BN00H</PersId>
  </ResponsableHcActor>
  <DateTime>3001-24-06</DateTime>
...
  </Event>
  </Events>
  </Request>
  </EmdResult>
 
I have put an index on the Event and DateTime tags.
The query takes about 26 seconds using Xindice.  Using the commercial Tamino Server, I get about half of these timings.
I have seen similar figures in earlier discussions in this list.
For the kind of queries shown above, we aim at timings below 5 seconds.  Is this realistic?
 
Patrick.
 
=======================================================================
 
Are there any stats that could be dug up and posted about the testing that
Kimbro did?  I'm currently creating some adhoc ones on my own, to try to see
what the limitations are. 

Dan


-----Original Message-----
From: Mark J. Stang [mailto:[EMAIL PROTECTED]
Sent: Saturday, May 04, 2002 4:32 PM
To: [email protected]
Subject: Re: Performance with large databases


It should.   Kimbro has done some tests and it looks fast.   How fast do you
need to be?   Is it a multi-user project?

Mark

Dan Barron wrote:

> I'm considering Xindice for a project here. It's so easy to use and does
all I need it to do. But my concern is performance. The project I'm thinking
of will require a DB with in the neighborhood of 5 million records, each and
XML file of 5-15K. I don't know too much about our hardware other than they
are Sun and are maintained by our IS department so I'm sure they're
up-to-date hardware. Performance is crucial because the database will serve
dynamic data live to a website.
>
> I was wondering if anyone with experience could tell me if Xinidice would
handle this. Any one out there with large databases?
>
> Any help would be appreciated.
>
> dan
>
> ____________________________________________________________________
> Daniel W. Barron
> Senior Systems Analyst/Application Developer
> American College of Physicians-American Society of Internal Medicine
> Tel: (215) 351-2617     Tel: (800) 523-1546 x2617
> Fax: (215) 351-2644    E-mail: [EMAIL PROTECTED]

Reply via email to