Re: [basex-talk] Performance Question

2012-12-13 Thread Thomas Kaltofen
: Andreas Weiler [mailto:andreas.wei...@uni-konstanz.de] Sent: Wednesday, December 12, 2012 2:33 PM To: Thomas Kaltofen Cc: Christian Grün; basex-talk@mailman.uni-konstanz.de Subject: Re: [basex-talk] Performance Question Hi Thomas, did you try with different hard disks? Probably the disk

Re: [basex-talk] Performance Question

2012-12-13 Thread Thomas Kaltofen
-konstanz.de Subject: Re: [basex-talk] Performance Question Hi Thomas, as Andreas indicated, it looks as if the hard disks need to re-adjust to your query patterns after longer breaks; after all, I doubt that this is something that could be fixed within BaseX. Instead, it may help to have

Re: [basex-talk] Performance Question

2012-12-13 Thread Johannes.Lichtenberger
On 12/13/2012 08:16 PM, Christian Grün wrote: Hi Thomas, some years ago, we did experiments with nio that didn’t differ too much from conventional I/O, but we may have overseen issues, so your input is welcome. Note, however, that nio file channels are limited to 2GB (see e.g. [1]). As a

Re: [basex-talk] Performance Question

2012-12-12 Thread Thomas Kaltofen
.org] Sent: Tuesday, December 11, 2012 9:21 PM To: Thomas Kaltofen Cc: basex-talk@mailman.uni-konstanz.de Subject: Re: [basex-talk] Performance Question On Tue, 2012-12-11 at 17:07 +, Thomas Kaltofen wrote: I leave the server running for several hours without touching the database

Re: [basex-talk] Performance Question

2012-12-12 Thread Andreas Weiler
To: Thomas Kaltofen Cc: basex-talk@mailman.uni-konstanz.de Subject: Re: [basex-talk] Performance Question Hi Thomas, P.S. When (approx.) do you plan to release the next version of BaseX? It's only a few days left! As a little hint, I can already disclose that the release will be nicknamed

[basex-talk] Performance Question

2012-12-11 Thread Thomas Kaltofen
Hi, I have a question regarding performance, because my database shows a somehow strange behavior. I have a database with around 3000 documents, the size of each document does not really matter, because I tried it with small and big documents and there is no real difference (total size of test

Re: [basex-talk] Performance Question

2012-12-11 Thread Christian Grün
Hi Thomas, P.S. When (approx.) do you plan to release the next version of BaseX? It's only a few days left! As a little hint, I can already disclose that the release will be nicknamed »BaseXMas Edition«. I have a question regarding performance, because my database shows a somehow strange

Re: [basex-talk] Performance Question

2012-12-11 Thread Liam R E Quin
On Tue, 2012-12-11 at 17:07 +, Thomas Kaltofen wrote: I leave the server running for several hours without touching the database at all (e.g. over the night). Most servers (especially Unix/Linux/Solaris) schedule checks overnight that often visit every file on the system, and the

Re: [basex-talk] performance question

2012-01-16 Thread Christian Grün
Thanks. I guess, I cannot do everything directly within XQuery, e.g., extending marked elements to continuous marking, to make markKorb/mark markgeben/mark to be markKorb geben/mark -- it will be more important for queries with ftand or ftor. Currently, the ft:mark() and ft:extract()

Re: [basex-talk] performance question

2012-01-13 Thread Cerstin Mahlow
Zitat von Cerstin Mahlow cerstin.mah...@unibas.ch: I will try the Beta 7.1. And now everything runs smoothly: I had to use the bigger machine for creating and indexing the collection (now consisting of 677 documents with 1.9 GB input size resulting in a 2.1 GB collection). Opening the

Re: [basex-talk] performance question

2012-01-13 Thread Maximilian Gärber
Hi Cerstin, for testing purposes you could use the ft:search function (see http://docs.basex.org/wiki/Full-Text_Module#ft:search ). This automatically applies the correct options. @Christian: I could not find it on the wiki but if I remember correctyl, the full-text would not be used, if the

Re: [basex-talk] performance question

2012-01-12 Thread Christian Grün
Dear Cerstin, thanks for your e-mail, and the detailed information on your use case. To make inspection of results easier, I added ft:mark. A collection with only a dozen of texts of about 71 MB with full text index for German, optimized, etc. works quite well. However, the example query