HI. I had a similiar query recently.
That is the only way to get the exact number of triples as of ML7.2... Part of the overhead is that the cts:triples removes duplicates. My solution was to *add an index on sem:triples* and then use regular cts related features (including xdmp:estimate), but that required an index and did not remove duplicates - BUT, it was fast. John Snelson form MarkLogic shared this as a solution: *If you turn on the diagnostic flag called "SPARQL Value Frequencies" (Admin UI: Configure->Groups->Default->Diagnostics), then when you run a new SPARQL query it will log something like this: * *2014-02-25 10:23:36.246 Info: [Event:id=SPARQL Value Frequencies] sessionKey=9770609635549801812 values= * *2014-02-25 10:23:36.246 Info: () = count:1469913, sUniq:69712, pUniq:1622, oUniq:196241 * *2014-02-25 10:23:36.246 Info: <blah> = freq:0, s: (count:0, pUniq: 0, oUniq: 0), p: (count:0, sUniq: 0, oUniq: 0), o: (count:0, sUniq: 0, pUniq: 0) * *2014-02-25 10:23:36.246 Info: <foo> = freq:0, s: (count:0, pUniq: 0, oUniq: 0), p: (count:0, sUniq: 0, oUniq: 0), o: (count:0, sUniq: 0, pUniq: 0) * *If you take the figure after "() = count:" (in this case 1469913) and divide by 3 that will me the numbers of triples in the system, including duplicates and triples in deleted but not yet merged out documents. * This gives you the rough number without the need for a new index, but is not available via programming (unless, of course you parsed the log file) Kind Regards, David Ennis -- David Ennis Content Engineer [image: HintTech Mastering the value of content] <http://www.hinttech.com> Mastering the value of content creative | technology | content Delftechpark 37i 2628 XJ Delft The Netherlands T: +31 88 268 25 00 M: +31 6 000 000 00 [image: Website] <http://www.hinttech.com> [image: Twitter]<https://twitter.com/HintTech> [image: Facebook] <http://www.facebook.com/HintTech> [image: LinkedIn]<http://www.linkedin.com/company/HintTech> [image: HintTech Mastering the value of content] <http://www.dayon.nl> On 24 March 2014 19:12, Joe Bryan <[email protected]> wrote: > Hi Clark, > > This can be very slow, but I'm not aware of any alternatives: > > fn:count(cts:triples()) > > Thanks. > > -jb > > Joseph Bryan > Sr. Field Engineer > MarkLogic Corporation > > From: Clark Richey <[email protected]> > Reply-To: MarkLogic Developer Discussion <[email protected]> > Date: Monday, March 24, 2014 at 2:10 PM > To: MarkLogic Developer Discussion <[email protected]> > Subject: [MarkLogic Dev General] number of triples > > Is there an easy way to determine the number of total triples in the > DB? I have triples insides of other documents. > > -------------------- > Clark Richey, CTO > FactGem > 240-252-7507 > [email protected] > > > > > Need immediate assistance? Please try: > > Cate Downing, Assistant Beth Price, Assistant > [email protected] [email protected] > 614.325.2404 614.365.0740 > > This message and any included attachments are property of FactGem and > its affiliates, and are intended only for the addressee(s). The information > contained herein may include trade secrets or privileged or otherwise > confidential information. Unauthorized review, forwarding, printing, > copying, distributing, or using such information is strictly prohibited and > may be unlawful. If you received this message in error, or have reason to > believe you are not authorized to receive it, please promptly delete this > message and notify the sender by e-mail. Thank you. > > > > > _______________________________________________ > General mailing list > [email protected] > http://developer.marklogic.com/mailman/listinfo/general > >
<<inline: 8F9B7491-1F5C-43A4-8779-D11F38F15E68.png>>
_______________________________________________ General mailing list [email protected] http://developer.marklogic.com/mailman/listinfo/general
