Re: [MarkLogic Dev General] xpath string construction

Robert Koberg Wed, 15 Oct 2008 07:24:52 -0700


On Oct 15, 2008, at 10:09 AM, Eric Palmitesta wrote:

Rob,

I think so far we're talking about insertion, not editing.

OK, that wasn't what I was understanding. And sorry to keep comingback, but I want to understand what I am missing.

Assuming you don't mean inserting in an existing document (which Iunderstand to be editing), and you are just inserting a new document,how would you have an ID to compare against? And, why isn't a URI goodenough?


best,
-Rob

What you're referring to is a whole other can of worms. I'veimplemented something like a lock-less editor before (java-basedwebsite, nothing to do with xquery) which, upon saving an editeddocument, would check to see if the timestamp on the document haschanged while your editing was taking place. If so, it would holdonto the data and say "Hey, someone edited and saved the doc you'reediting and trying to save now. I've recovered your data though, wecan proceed from here". This was for a relatively low-traffic app,though.
I think someone described something similar to this not too long agoon this mailing list, although I can't find that email now.
Eric

Robert Koberg wrote:
Hi again,
To me, this is the same as locking the file, except that you arepossibly letting someone spend wasted time editing a doc only tolose their changes if not up-to-date. As you say it is rare, butjust wait till you hear from someone who spends 10 minutes editinga file only to see all the work lost.
best,
-Rob
On Oct 15, 2008, at 9:41 AM, Eric Palmitesta wrote:
Good morning all! Sorry to cause such a stir. Upon reading yourresponses, I feel you've gotten the wrong idea, which is probablydue to communication failure on my part.
My idea of sequential ids is one 'special' document, for example /id.xml, which contains nothing but <id>42</id>, and an id()function which exclusive-locks the file, yanks 42 out, incrementsit, replaces the text node with 43, and unlocks the file. Myenvironment is read-heavy, write-light, so although writeoperations which require a unique id would touch this file, Idon't think it would be an awful bottleneck. This guaranteedunique ids without having to ever worry about collisions.
Of course, the counter-argument is that since it's a write-lightenvironment, the chances of using random() and lighting strikingtwice, as Michael put it, are infinitesimally small. I don'ttruly have a problem with using random ids, I'm just saying it'sworth noting that it is *impossible* for lighting to strike twicewith sequential ids.
Eric

Wayne Feick wrote:
Hi Eric,
A disadvantage of sequential ids is that you can end up readlocking all of your documents in order to find the current maxid. You can address this partially by moving the next id into aseparate document, but that document can still become abottleneck if you have a high insertion rate. You could alsoaddress this by creating a range index on the id and usingcts:element-values() or cts:element-attribute-values() to findthe max.By switching to random ids, you get better parallelism since ourindexes can quickly determine if the id is already in use andwill lock at most one document (or 0 if your existing id searchis unfiltered). There is still a vanishingly small probabilitythat two competing threads would allocate the same random id atthe same moment in time, but that is improbable enough to beignored.
Wayne.
On Tue, 2008-10-14 at 13:07 -0400, Eric Palmitesta wrote:
Wow, thanks for the reply, Michael. I'll probably be using somevariation of one of your examples.
Michael Blakeley wrote:
> Many people ask about sequential ids. It is possible to modelan id > sequence as a database document. But as with RDBMSsequences, there are > serialization penalties. I don't see theadvantage of sequential ids, so > I rarely, if ever, use thisapproach.
Assuming the recursive check isn't feasible (it doesn't scalewell), the advantage of sequential ids is being able to sleep atnight knowing collisions are simply impossible, and are notreliant on a 'good-enough' random() function. I'm nit-pickingof course, I'm sure random() is fine. :)
Cheers,

Eric
_______________________________________________
General mailing list
General@developer.marklogic.com <mailto:General@developer.marklogic.com>
http://xqzone.com/mailman/listinfo/general
------------------------------------------------------------------------
_______________________________________________
General mailing list
General@developer.marklogic.com
http://xqzone.com/mailman/listinfo/general
_______________________________________________
General mailing list
General@developer.marklogic.com
http://xqzone.com/mailman/listinfo/general
_______________________________________________
General mailing list
General@developer.marklogic.com
http://xqzone.com/mailman/listinfo/general
_______________________________________________
General mailing list
General@developer.marklogic.com
http://xqzone.com/mailman/listinfo/general


_______________________________________________
General mailing list
General@developer.marklogic.com
http://xqzone.com/mailman/listinfo/general

Re: [MarkLogic Dev General] xpath string construction

Reply via email to