I think your call to element-words is missing the second parameter;  $options 
is the 3rd parameter.  So I think it should be:

cts:element-words(fn:QName("http://grtjn.nl/twitter/utils";, "text"), (), 
"collation=http://marklogic.com/collation/nl/S1/AS/T00BB";)

It looks like the example in the doc is missing that second arg too--I'll see 
if I can get that fixed ;)

-Danny

________________________________________
From: [email protected] 
[[email protected]] On Behalf Of Geert Josten 
[[email protected]]
Sent: Saturday, May 12, 2012 8:52 AM
To: MarkLogic Developer Discussion
Subject: [MarkLogic Dev General] Bug in cts:element-words? (was: Term with      
same stem)

Curious how well the idea of Danny would perform, I thought to apply it to one 
of my test databases with a fair number of tweets (roughly 400K last time I 
checked). I had to rewrite cts:words to cts:element-words since I have no words 
lexicon. But it breaks with me. Did I hit a bug?

let $map := map:map()
let $all :=
  for $x in cts:element-words(fn:QName("http://grtjn.nl/twitter/utils";, 
"text"), "collation=http://marklogic.com/collation/nl/S1/AS/T00BB";)
  return map:put($map, cts:stem($x), $x)
return (
fn:concat(xs:string(fn:count(map:keys($map))), " unique stems in the database"),
fn:concat(fn:count(cts:words()), " unique words in the database
"),
map:keys($map) )

Note that I specify a specific collation, but that seems to get ignored. Can 
anyone confirm this behavior?

Kind regards,
Geert

Van: 
[email protected]<mailto:[email protected]>
 
[mailto:[email protected]<mailto:[email protected]>]
 Namens Danny Sokolsky
Verzonden: zaterdag 12 mei 2012 0:13
Aan: MarkLogic Developer Discussion
Onderwerp: Re: [MarkLogic Dev General] Term with same stem

If you have a word lexicon you can do something like this to get information 
about your words and stems:

let $map := map:map()
let $all :=
  for $x in cts:words()
  return map:put($map, cts:stem($x), $x)
return (
fn:concat(xs:string(fn:count(map:keys($map))), " unique stems in the database"),
fn:concat(fn:count(cts:words()), " unique words in the database
"),
map:keys($map) )

-Danny

From: 
[email protected]<mailto:[email protected]>
 
[mailto:[email protected]]<mailto:[mailto:[email protected]]>
 On Behalf Of Michael Blakeley
Sent: Friday, May 11, 2012 2:02 PM
To: MarkLogic Developer Discussion
Cc: MarkLogic Developer Discussion
Subject: Re: [MarkLogic Dev General] Term with same stem

If stemming=advanced I think cts:stem will do that. With basic the best you can 
do is to pass terms to cts:stem and see if they have the same stem.
-- Mike

On May 11, 2012, at 13:39, Abhishek53 S 
<[email protected]<mailto:[email protected]>> wrote:
Hi Folks,

Is it possible to get the all terms that have same stem from Marklogic 
database? I want to get all terms that belongs to the same stem.

Thanks & Regards
Abhishek Srivastav
Systems Engineer
Tata Consultancy Services
Cell:- +91-9883389968
Mailto: [email protected]<mailto:[email protected]>
Website: http://www.tcs.com<http://www.tcs.com/>
____________________________________________
Experience certainty. IT Services
Business Solutions
Outsourcing

=====-----=====-----=====
Notice: The information contained in this e-mail
message and/or attachments to it may contain
confidential or privileged information. If you are
not the intended recipient, any dissemination, use,
review, distribution, printing or copying of the
information contained in this e-mail message
and/or attachments to it are strictly prohibited. If
you have received this communication in error,
please notify us by reply e-mail or telephone and
immediately and permanently delete the message
and any attachments. Thank you
_______________________________________________
General mailing list
[email protected]<mailto:[email protected]>
http://developer.marklogic.com/mailman/listinfo/general
_______________________________________________
General mailing list
[email protected]
http://developer.marklogic.com/mailman/listinfo/general

Reply via email to