Re: [basex-talk] Issue with Full Text Retrieval

2017-09-18 Thread Christian Grün
Hi Ron, With mixed-content, it can be beneficial if element boundaries are ignored. An example: Hello world! contains text 'hello' If you set the CHOP option to false before creating a database, whitespaces will be included in your database. As Fabrice has pointed out, however, it is

Re: [basex-talk] A few general questions about BaseX

2017-09-18 Thread Christian Grün
Hi Anastasiou (and thanks Bridger), >> BaseX fails with a message along the lines “This is too big for one >> database”. >> >> 1) Are there any logs, beyond the DB logs? If yes, where can I find >> them? >> > I'm not sure how to enable more verbose logging with the GUI -- hopefully > one of

Re: [basex-talk] Basex Inner Workings

2017-09-18 Thread Anastasiou A .
Must be the day today, sorry, please see below: No, but it did not make any difference. But I will tell you what did make a difference, forcing everything to be a string and hard coding the names of the tags. That’s a ~3-4 sec query to return ~5 million items. I was led to this by what you

Re: [basex-talk] Is there any documentation on the narrow limits of XQuery index optimization in BaseX?

2017-09-18 Thread Omar Siam
Hi! Interesting ideas. I don't like the pragma idea that much because there is already sth. like that with xquery:eval. The thing I miss most is a function like xquery:eval that accepts a function as an argument but also takes a context and does that runtime optimization. Or a way to convert

Re: [basex-talk] Server Variables, cached vars, etc

2017-09-18 Thread Christian Grün
Hi Erik, I think that Xavier-Laurent, Marco, Fabrice and Kendall have already given excellent feedback. In our own projects, we store all global data in databases, or in local configuration files. One advantage is that this data requires no initialization and will automatically be available

Re: [basex-talk] Basex Inner Workings

2017-09-18 Thread Fabrice ETANCHAUD
Hi Athanasios, Could you please give us a idea of your resulting document size after 1.5 minutes of BaseX time ? Best regards, Fabrice De : basex-talk-boun...@mailman.uni-konstanz.de [mailto:basex-talk-boun...@mailman.uni-konstanz.de] De la part de Anastasiou A. Envoyé : lundi 18 septembre

Re: [basex-talk] retrieving the name of the archive?

2017-09-18 Thread Graydon Saunders
Thank you! Someday I will get it through my head that it's not really a file system down there. :) On Mon, Sep 18, 2017 at 11:00 AM, Christian Grün wrote: > Hi Graydon, > > > the config switch is "ARCHIVENAME = true" > > Exactly, that’s the option you’ll need to

Re: [basex-talk] Is there any documentation on the narrow limits of XQuery index optimization in BaseX?

2017-09-18 Thread Christian Grün
Hi Omar, Our current XQuery optimizer opens the addressed database in order to find out if it has the required index structures, and if these are up-to-date. Moreover, the cheapest index lookup will be selected if there are several index candidates. For example, in the following query, it will be

[basex-talk] retrieving the name of the archive?

2017-09-18 Thread Graydon Saunders
Hello -- BaseX will happily consume zip archives; this is just splendid for loading up a bunch of docx files. Now I find myself wanting the name of the docx file -- the original name of the archive -- and I don't know how to retrieve that. (or if it can be!) But I think it must be there

Re: [basex-talk] Basex Inner Workings

2017-09-18 Thread Anastasiou A .
Hello Many thanks, Dirk, Fabrice and Graydon. I was going to look up ways of enabling the server to run as fast as possible anyway later on, so it is always good to know how is BaseX “thinking”. I can see what you mean Graydon. This is a simple nested `for` to denormalise some of the

Re: [basex-talk] Startup hooks or persisting jobs

2017-09-18 Thread Christian Grün
Hi Kendall, Coincidentally, we had a similar discussion in our team. I have added a little issue; more feedback is welcome. Cheers, Christian PS: Thanks everyone for keeping the mailing list alive! [1] https://github.com/BaseXdb/basex/issues/1498 On Mon, Aug 28, 2017 at 9:28 PM, Kendall

Re: [basex-talk] retrieving the name of the archive?

2017-09-18 Thread Christian Grün
Hi Graydon, > the config switch is "ARCHIVENAME = true" Exactly, that’s the option you’ll need to enable to get the archive names included in your database paths. It can also be passed on to XQuery functions (db:create, db:add, etc.). > This gets me the behaviour I was expecting would happen,

Re: [basex-talk] Basex Inner Workings

2017-09-18 Thread Anastasiou A .
Hello Fabrice Given: [a number][a string] … [a number][a string] The size is ~5m `item`. (Depending on the query, we are talking about a few million items) If I don’t add any external additional structure, which here is defined by the `item`, `items` elements, then the

Re: [basex-talk] Basex Inner Workings

2017-09-18 Thread Anastasiou A .
No, but it did not make any difference. But I will tell you what did make a difference, forcing everything to be a string and hard coding the names of the tags. That’s a ~3-4 sec query to return ~5 million items. I was led to this by what you said about computed elements because it makes

Re: [basex-talk] Fwd: Re: using apply-function within an updating function

2017-09-18 Thread Christian Grün
Hi Rob, the official XQuery functions are limited to non-updating function arguments. Enabling MIXUPDATES is the only way out, because you would otherwise be able to write code that is both updating and non-updating: for $f in (db:output#1, count#1) return apply($f, [1]) Hope this helps,

Re: [basex-talk] Server Variables, cached vars, etc

2017-09-18 Thread Marco Lettere
Hi Christian, welcome back to the list! Having the possibility to run a script or call any sort of cuntvionality at startup is quite a useful thing. Doing it in a declarative way would be great maybe exploiting the .basex file or the web.xml? Btw, hooking up on Server variables, a colleague

Re: [basex-talk] Basex Inner Workings

2017-09-18 Thread Graydon Saunders
Sorry for the fumble-fingers; let me try that again. Have you tried creating literal elements? Computed elements have overhead; it's presumptively akin to why you don't want to create untyped variables in XSLT 2.0 and 3.0 (an untyped variable might be anything and needs a whole document node to

Re: [basex-talk] Simple Map Operator in oXygen XML (XQJ)

2017-09-18 Thread Christian Grün
Hi Omar, > Ok. Interesting. Please someone note this in the Wiki so there are no > surprises. I have added a little hint in our Wiki [1]. Cheers, Christian [1] http://docs.basex.org/wiki/Developing For the ! syntax I can just stick with for in return but missing > map {} is a reason to

Re: [basex-talk] Basex Inner Workings

2017-09-18 Thread Graydon Saunders
Have you tried creating literal elements? Computed elements have overhead; it's presumptively akin to why you don't want to create untyped variables in XSLT 2.0 and 3.0 (an untyped variable might be anything and needs a whole document node to exist in, and this is expensive). In this case, I'd

Re: [basex-talk] Is there any documentation on the narrow limits of XQuery index optimization in BaseX?

2017-09-18 Thread Christian Grün
> The thing I miss most is a function > like xquery:eval that accepts a function as an argument but also takes a > context and does that runtime optimization. I assume you are looking for something like the following query? xquery:eval-func( function($db, $name) { db:open($db)//person[name

Re: [basex-talk] Server Variables, cached vars, etc

2017-09-18 Thread Fabrice ETANCHAUD
Hello Christian ! Yes, a -c option for the basexhttp would help, as mentioned earlier, for example creating a shared mainmem collection. Best regards, Fabrice -Message d'origine- De : basex-talk-boun...@mailman.uni-konstanz.de [mailto:basex-talk-boun...@mailman.uni-konstanz.de] De la

Re: [basex-talk] retrieving the name of the archive?

2017-09-18 Thread Graydon Saunders
Ok, so it looks like: 1. find where BaseX is really getting its config files ($HOME in my case); http://docs.basex.org/wiki/Configuration#Configuration_Files says "Q{org.basex.util.Prop}USERHOME()" which is exceedingly helpful! 2. add, to .basex (NOT .basexgui) AFTER # Local Options

Re: [basex-talk] java.lang.NoClassDefFoundError: Could not initialize class java.nio.file.FileSystems$DefaultFileSystemHolder

2017-09-18 Thread Liam R. E. Quin
On Mon, 2017-09-18 at 23:31 +, Kendall Shaw wrote: > [...] maybe using the oracle JDK would work. Thank you for such a quick answer - it wasn't what i did, but it helped me find the problem. It seems an OS upgrade had upgraded the Java runtime, and that class is now neither provided nor

[basex-talk] java.lang.NoClassDefFoundError: Could not initialize class java.nio.file.FileSystems$DefaultFileSystemHolder

2017-09-18 Thread Liam R. E. Quin
It seems with the latest Java 1.8 - java-1.8.0-openjdk-headless-1.8.0.144-0.b01.el7_4.x86_64 on Centos 7, I can no longer drop a database, any ideas? This is with both 8.5.3 and 8.6.6, and also with the latest snapshot, BaseX867-20170824.195627.zip [[ $ bin/basexclient -p 1994 Username: admin

Re: [basex-talk] java.lang.NoClassDefFoundError: Could not initialize class java.nio.file.FileSystems$DefaultFileSystemHolder

2017-09-18 Thread Kendall Shaw
I don’t know if this helps. It could be because of openjdk vs oracle. In FileSystems.java in the oracle JDK getDefault loads: sun.nio.fs.DefaultFileSystemProvider.create() I imagine that openjdk wouldn’t use that, perhaps. So, maybe using the oracle JDK would work. Kendall On 9/18/17, 4:06

Re: [basex-talk] java.lang.NoClassDefFoundError: Could not initialize class java.nio.file.FileSystems$DefaultFileSystemHolder

2017-09-18 Thread Kendall Shaw
Or build basex using openjdk. On 9/18/17, 4:30 PM, "basex-talk-boun...@mailman.uni-konstanz.de on behalf of Kendall Shaw" wrote: I don’t know if this helps. It could be because of openjdk vs oracle. In

Re: [basex-talk] Issue with Full Text Retrieval

2017-09-18 Thread Ron Katriel
Hi Christian, Yes, this helps. By index rewritings, are you referring to the indices created when FTINDEX is set to true? Thanks, Ron On September 18, 2017 at 11:12:54 AM, Christian Grün ( christian.gr...@gmail.com) wrote: Hi Ron, With mixed-content, it can be beneficial if element boundaries