[basex-talk] Accessing DOCTYPE information after DB creation?

2014-03-28 Thread Hondros, Constantine (ELS-AMS)
Hi all, I would really like to be able to query a large corpus of documents to get names and counts of the DTDs which are declared in the (somewhat old-fashioned now) DOCTYPE declaration: ?xml version=1.0 encoding=utf-8? !DOCTYPE converted-article PUBLIC -//ES//DTD journal article DTD version

Re: [basex-talk] Accessing DOCTYPE information after DB creation?

2014-03-28 Thread Christian Grün
Hi Constantine, unfortunately no, because this information is already consumed by the XML parser (i. e., we don’t get to see it at all when the database is being built). Suggestions from other users with similar problems are welcome. Christian Hi all, I would really like to be able to query

Re: [basex-talk] Accessing DOCTYPE information after DB creation?

2014-03-28 Thread Imsieke, Gerrit, le-tex
You can preprocess your documents with Andrew Welch’s LexEv parser: http://andrewjwelch.com/lexev/ On 28.03.2014 12:25, Christian Grün wrote: Hi Constantine, unfortunately no, because this information is already consumed by the XML parser (i. e., we don’t get to see it at all when the

Re: [basex-talk] Accessing DOCTYPE information after DB creation?

2014-03-28 Thread Hondros, Constantine (ELS-AMS)
Thanks all, Unfortunately this is legacy content – and there is an unbelievable amount of it too. So, I will probably pre-process the content and write the DTD info out into an element or PI node. org.basex.core.Command.setInput(org.xml.sax.InputSource is) looks like a probable place to do

Re: [basex-talk] stream binary responses from http:module

2014-03-28 Thread Florent Georges
On 27 March 2014 13:57, Christian Grün wrote: Hi, * HTTP Module (our favorite): add additional functions (e.g. http:get(), http:post(), etc.) with xs:base64 as return type. I am wondering... The current API already supports xs:base64Binary, in the case of a binary type (technically, for

Re: [basex-talk] stream binary responses from http:module

2014-03-28 Thread Christian Grün
Hi Florent, Is there any intrinsic difference, from an implementation point of view, in the context of streamability of the returned content, between a function returning always an xs:base64Binary item, and another one returning sometimes an xs:base64Binary item, sometimes a string, and

Re: [basex-talk] Accessing DOCTYPE information after DB creation?

2014-03-28 Thread France Baril
We moved to schemas, this way I don't lose the schema declaration and users who edit documents from Oxygen (webdav connection) can get all the advantages of editing documents that are linked to their model, including suggestions for enumerated attributes and indent that respect spacing for mixed