Re: [basex-talk] Feature Request: serializable functions

2014-11-06 Thread Christian Grün
Hi Marco, You may be surprised (or not) that I thought about a similar functionality. Indeed I wondered if we could allow function items as input for client:query and xquery:evaluate. I like your idea of providing an explicit serialization function. For several reasons, we cannot guarantee that

Re: [basex-talk] Dynamic Evaluation of XQUERY

2014-11-06 Thread Christian Grün
Hi Mansi, ~/Downloads/basex/bin/basex -bn='/Archives/*//class[contains(@name,abc) and contains(@name,pqr)]' get_paths.xq Stopped at /Users/mansiadmin/Documents/Research-Projects/BigData, 1/4: [XPDY0002] and: no context value bound. It seems that and was interpreted as XPath step, so it seems

[basex-talk] Fwd: Preventing unescaping of entities

2014-11-06 Thread Christian Grün
Hi James, Beforehand: Please send your mails to basex-talk@mailman.uni-konstanz.de; otherwise, they won't be directed to everyone. Currently, there is no way to prevent the unescaping the entities (the XML and XQuery specifications do not foresee this). One way to enforce the escaping of certain

Re: [basex-talk] Feature Request: serializable functions

2014-11-06 Thread Christian Grün
Hi Carl, In the BaseX query parser, tokenization and parsing is a joint operation. in the past, we thought about splitting these steps, as the tokens could also be used in the GUI to provide better highlighting. However, we'd probably need some more coders interested in making that happen..

Re: [basex-talk] Feature Request: serializable functions

2014-11-06 Thread Christian Grün
One more thing we have to be aware of: In XQuery, we have lots of different function items. I wonder if we should support all of these? declare function local:f() { 123 }; inspect:serialize(map {}), inspect:serialize(local:f#0), inspect:serialize(substring#2),

Re: [basex-talk] Out Of Memory

2014-11-06 Thread Mansi Sheth
This would need a lot of details, so bear with me below: Briefly my XML files look like: A name= B name= C name= D name= E name=/ A can contain B, C or D and B, C or D can contain E. We have 1000s (currently 3000 in my test data set) of such xml files, of

Re: [basex-talk] Out Of Memory

2014-11-06 Thread Fabrice Etanchaud
Hi Mansi, Here you have a natural partition of your data : the files you ingested. So my first suggestion would be to query your data on a file basis: for $doc in db:open(‘your_collection_name’) let $file-name := db:path($doc) return file:write( $file-name, names

Re: [basex-talk] Out Of Memory

2014-11-06 Thread Mansi Sheth
Interesting idea, I thought of using db partition, but didn't pursue it further, mainly due to below thought process. Currently, I am ingesting ~3000 xml files, storing ~50 xml files per db, which would be growing quickly. So, below approach would lead to ~3000 more files (which would be

Re: [basex-talk] Out Of Memory

2014-11-06 Thread Graydon Saunders
Hi Mansi -- Just out of habitual paranoia about the performance of *// in XPath, I might try replacing /A/*//E/@name/string() with E[ancestor::A[not(parent::*)]/@name and not worry about stringifying the resulting sequence of attribute nodes until the next step, whatever that might be. It might

Re: [basex-talk] Out Of Memory

2014-11-06 Thread Fabrice Etanchaud
The solution depends on the usage you will have of your extraction. May I ask you what is your extraction for ? Best regards, Fabrice De : Mansi Sheth [mailto:mansi.sh...@gmail.com] Envoyé : jeudi 6 novembre 2014 17:11 À : Fabrice Etanchaud Cc : Christian Grün; BaseX Objet : Re: [basex-talk] Out

Re: [basex-talk] Feature Request: serializable functions

2014-11-06 Thread Christian Grün
To complement this: In a straightforward implementation, the following query… declare function local:f() { 123 }; inspect:serialize(map {}), inspect:serialize(local:f#0), inspect:serialize(substring#2), inspect:serialize(function() {1}) …would yield the following result: map { }

Re: [basex-talk] Out Of Memory

2014-11-06 Thread Mansi Sheth
I would be doing tons of post processing. I never use UI. I either use REST thru cURL or command line. I would basically need data in below format: XML File Name, @name I am trying to whitelist picking up values for only starts-with(@name,pqr). where pqr is a list of 150 odd values. My file

Re: [basex-talk] Out Of Memory

2014-11-06 Thread Graydon Saunders
Hi Mansi -- If you use for x in E/@name[starts-with(.,'pqr')] return (tokenize(base-uri($x),'/')[last()],string($x)) for each of the 150-odd values (you may want to generate the query :) it will more likely work. It's not just the size of the database, it's the size of the result, too; keeping

Re: [basex-talk] Feature Request: serializable functions

2014-11-06 Thread Andy Bunce
In some case, it's even not possible because the optimizer creates new expressions that have no counterpart in the original XQuery language Do you mean they invoke internal BaseX functions? Also to help me understand what BaseX optimization involves: If these serializations are post

Re: [basex-talk] Tomcat - Multiple BaseX Services - Startup Problems

2014-11-06 Thread Bridger Dyson-Smith
Hi all, On Tue, Nov 4, 2014 at 10:31 AM, Bridger Dyson-Smith bdysonsm...@gmail.com wrote: My apologies -- I accidentally clicked 'send'. :/ On Tue, Nov 4, 2014 at 10:12 AM, Bridger Dyson-Smith bdysonsm...@gmail.com wrote: Hi all, I know that there aren't many Tomcat users on this list,