On 11.07.2020 14:41, Giuseppe G. A. Celano wrote:

I am trying to perform a join operation between two large XML files
(~490 MB and ~40 MB), which are the result of the automatic conversion
of old sql dumps into XML files. I created two databases for the files.
The query I wrote to join them is correct because it works when I limit
the join to just a few items, but it never ends if I apply it to all items:

here is the xquery:
https://git.informatik.uni-leipzig.de/celano/perseus_morpheus/-/blob/master/join_files.xq
here is the first file:



Saxon EE seems to be capable of handling it (loading the files with the
doc function of course instead of from a db), although needing more than
2GB of memory.

Not sure where BaseX struggles, I am sure someone of the BaseX team can
tell you soon.

Saxon HE also struggles so it must be some of the advanced join
optimizations in EE that allow it to run that query in a reasonable time.

Reply via email to