Bugs item #2586088, was opened at 2009-02-10 19:47
Message generated for change (Settings changed) made by vzzzbx
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=482468&aid=2586088&group_id=56967

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: PF/runtime
Group: MonetDB4 "stable"
Status: Open
Resolution: None
Priority: 5
Private: No
Submitted By: Wouter Alink (vzzzbx)
>Assigned to: Jan Flokstra (jflokstra)
Summary: XQ: large text nodes

Initial Comment:
(monetdb nov2008 sp2 on Linux)

The following occurred:

wal...@ldc:~/tmp> xmlwf tmp.xml # content is well-formed
wal...@ldc:~/tmp> cat tmp.xml | mclient -lxq -I oops5.xml
MAPI  = mone...@localhost:50000
ACTION= mapi_stream_into
ERROR = !ERROR: Detected an entity reference loop
        !ERROR: shredder_parse: XML input not well-formed.
        !ERROR: CMDshred_stream: operation failed.
wal...@ldc:~/tmp>

What happened is that there is a text-node in tmp.xml which contains more than 
8M characters.

In shred_characters() in shredder.mx the maximum text content buffer size is 
set at 8M (1<<23). It ignores everything after the 8Mth character. If the 8Mth 
character is in the middle of an entity (like "&quot;"), then the error above 
is returned.

I was able to reproduce a document with the features described above using the 
following python script:

wal...@ldc:~/tmp> cat createLargeTextField.py
i=0
print "<aap>"
while i < 10000000:
        print '&quot;'
        i+=1
print "</aap>"
wal...@ldc:~/tmp> python createLargeTextField.py > tmp.xml


p.s. another issue, not really a bug, is that for each (small) portion after 
the 8Mth character of a text-node a warning is issued. I would expect only 1 
warning to be issued for each text-node that is too large. different bug-report?


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=482468&aid=2586088&group_id=56967

------------------------------------------------------------------------------
Create and Deploy Rich Internet Apps outside the browser with Adobe(R)AIR(TM)
software. With Adobe AIR, Ajax developers can use existing skills and code to
build responsive, highly engaging applications that combine the power of local
resources and data with the reach of the web. Download the Adobe AIR SDK and
Ajax docs to start building applications today-http://p.sf.net/sfu/adobe-com
_______________________________________________
Monetdb-bugs mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/monetdb-bugs

Reply via email to