Bugs item #2722174, was opened at 2009-03-30 18:38
Message generated for change (Comment added) made by vzzzbx
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=482468&aid=2722174&group_id=56967

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: PF/loader
Group: Pathfinder "stable"
Status: Open
Resolution: None
Priority: 5
Private: No
Submitted By: Wouter Alink (vzzzbx)
Assigned to: Nobody/Anonymous (nobody)
Summary: XQ: shredding 70GB XML fails.

Initial Comment:
Shredding 1000000 (1 million) documents (+- 70GB) using MonetDB/XQuery fails, 
while it succeeds when only shredding the first 700000 of this collection. See 
error-message below. More investigation is needed. To be continued.

MAPI  = mone...@localhost:52009
QUERY = for $i in doc("1M_docs.xml")//doc return 
pf:add-doc($i,concat("1M_docs_",$i
),"1M_collection.xml")
ERROR = !ERROR: BBPdecref: 1001729024_rid_size does not have pointer fixes.
        !ERROR: BBPdecref: 1001729024_rid_level does not have pointer fixes.
        !ERROR: BBPdecref: 1001729024_rid_prop does not have pointer fixes.
        !ERROR: BBPdecref: 1001729024_prop_text does not have pointer fixes.
        !ERROR: BBPdecref: 1001729024_prop_val does not have pointer fixes.

real    240m55.752s
user    0m0.004s
sys     0m0.004s


----------------------------------------------------------------------

>Comment By: Wouter Alink (vzzzbx)
Date: 2009-04-02 16:22

Message:
In the initial posting of this bug, the '1001729024_rid_size' bat was a bat
which belonged to the /some/path/1M_docs.xml document (which contained the
names of the documents). This document was shredded on the fly (cached, and
not made persistent). The actual bug-query read:

for $i in doc("/some/path/1M_docs.xml")//doc return
pf:add-doc($i,concat("1M_docs_",$i),"1M_collection.xml")

A possible problem could have been that the 'temporary' document was
thrown away too early, but this doesn't seem to be the case:

If using an explicit add-doc for the XML-document which contained the
document-names of the
million documents, which can be done using the following two queries

pf:add-doc("/some/path/1M_docs.xml","1M_docs.xml")
<>
for $i in doc("1M_docs.xml")//doc return
pf:add-doc($i,concat("1M_docs_",$i),"1M_collection.xml")

Results in:

Mserver: gdk_bbp.mx:1705: decref: Assertion `0' failed.

Investigation showed that this time a 'ws' bat had too few refs.

So the thought that the 'temporary' document was thrown away too early may
not be the problem after all.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=482468&aid=2722174&group_id=56967

------------------------------------------------------------------------------
_______________________________________________
Monetdb-bugs mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/monetdb-bugs

Reply via email to