Re: [MarkLogic Dev General] xpath string construction

Robert Koberg Wed, 15 Oct 2008 06:57:21 -0700

Hi again,

To me, this is the same as locking the file, except that you arepossibly letting someone spend wasted time editing a doc only to losetheir changes if not up-to-date. As you say it is rare, but just waittill you hear from someone who spends 10 minutes editing a file onlyto see all the work lost.


best,
-Rob


On Oct 15, 2008, at 9:41 AM, Eric Palmitesta wrote:

Good morning all! Sorry to cause such a stir. Upon reading yourresponses, I feel you've gotten the wrong idea, which is probablydue to communication failure on my part.
My idea of sequential ids is one 'special' document, for example /id.xml, which contains nothing but <id>42</id>, and an id() functionwhich exclusive-locks the file, yanks 42 out, increments it,replaces the text node with 43, and unlocks the file. Myenvironment is read-heavy, write-light, so although write operationswhich require a unique id would touch this file, I don't think itwould be an awful bottleneck. This guaranteed unique ids withouthaving to ever worry about collisions.
Of course, the counter-argument is that since it's a write-lightenvironment, the chances of using random() and lighting strikingtwice, as Michael put it, are infinitesimally small. I don't trulyhave a problem with using random ids, I'm just saying it's worthnoting that it is *impossible* for lighting to strike twice withsequential ids.
Eric

Wayne Feick wrote:
Hi Eric,
A disadvantage of sequential ids is that you can end up readlocking all of your documents in order to find the current max id.You can address this partially by moving the next id into aseparate document, but that document can still become a bottleneckif you have a high insertion rate. You could also address this bycreating a range index on the id and using cts:element-values() orcts:element-attribute-values() to find the max.By switching to random ids, you get better parallelism since ourindexes can quickly determine if the id is already in use and willlock at most one document (or 0 if your existing id search isunfiltered). There is still a vanishingly small probability thattwo competing threads would allocate the same random id at the samemoment in time, but that is improbable enough to be ignored.
Wayne.
On Tue, 2008-10-14 at 13:07 -0400, Eric Palmitesta wrote:
Wow, thanks for the reply, Michael. I'll probably be using somevariation of one of your examples.
Michael Blakeley wrote:
> Many people ask about sequential ids. It is possible to model anid > sequence as a database document. But as with RDBMS sequences,there are > serialization penalties. I don't see the advantage ofsequential ids, so > I rarely, if ever, use this approach.
Assuming the recursive check isn't feasible (it doesn't scalewell), the advantage of sequential ids is being able to sleep atnight knowing collisions are simply impossible, and are notreliant on a 'good-enough' random() function. I'm nit-picking ofcourse, I'm sure random() is fine. :)
Cheers,

Eric
_______________________________________________
General mailing list
General@developer.marklogic.com <mailto:General@developer.marklogic.com>
http://xqzone.com/mailman/listinfo/general
------------------------------------------------------------------------
_______________________________________________
General mailing list
General@developer.marklogic.com
http://xqzone.com/mailman/listinfo/general
_______________________________________________
General mailing list
General@developer.marklogic.com
http://xqzone.com/mailman/listinfo/general


_______________________________________________
General mailing list
General@developer.marklogic.com
http://xqzone.com/mailman/listinfo/general

Re: [MarkLogic Dev General] xpath string construction

Reply via email to