I think your call to element-words is missing the second parameter; $options
is the 3rd parameter. So I think it should be:
cts:element-words(fn:QName("http://grtjn.nl/twitter/utils", "text"), (),
"collation=http://marklogic.com/collation/nl/S1/AS/T00BB")
It looks like the example in the doc is missing that second arg too--I'll see
if I can get that fixed ;)
-Danny
________________________________________
From: [email protected]
[[email protected]] On Behalf Of Geert Josten
[[email protected]]
Sent: Saturday, May 12, 2012 8:52 AM
To: MarkLogic Developer Discussion
Subject: [MarkLogic Dev General] Bug in cts:element-words? (was: Term with
same stem)
Curious how well the idea of Danny would perform, I thought to apply it to one
of my test databases with a fair number of tweets (roughly 400K last time I
checked). I had to rewrite cts:words to cts:element-words since I have no words
lexicon. But it breaks with me. Did I hit a bug?
let $map := map:map()
let $all :=
for $x in cts:element-words(fn:QName("http://grtjn.nl/twitter/utils",
"text"), "collation=http://marklogic.com/collation/nl/S1/AS/T00BB")
return map:put($map, cts:stem($x), $x)
return (
fn:concat(xs:string(fn:count(map:keys($map))), " unique stems in the database"),
fn:concat(fn:count(cts:words()), " unique words in the database
"),
map:keys($map) )
Note that I specify a specific collation, but that seems to get ignored. Can
anyone confirm this behavior?
Kind regards,
Geert
Van:
[email protected]<mailto:[email protected]>
[mailto:[email protected]<mailto:[email protected]>]
Namens Danny Sokolsky
Verzonden: zaterdag 12 mei 2012 0:13
Aan: MarkLogic Developer Discussion
Onderwerp: Re: [MarkLogic Dev General] Term with same stem
If you have a word lexicon you can do something like this to get information
about your words and stems:
let $map := map:map()
let $all :=
for $x in cts:words()
return map:put($map, cts:stem($x), $x)
return (
fn:concat(xs:string(fn:count(map:keys($map))), " unique stems in the database"),
fn:concat(fn:count(cts:words()), " unique words in the database
"),
map:keys($map) )
-Danny
From:
[email protected]<mailto:[email protected]>
[mailto:[email protected]]<mailto:[mailto:[email protected]]>
On Behalf Of Michael Blakeley
Sent: Friday, May 11, 2012 2:02 PM
To: MarkLogic Developer Discussion
Cc: MarkLogic Developer Discussion
Subject: Re: [MarkLogic Dev General] Term with same stem
If stemming=advanced I think cts:stem will do that. With basic the best you can
do is to pass terms to cts:stem and see if they have the same stem.
-- Mike
On May 11, 2012, at 13:39, Abhishek53 S
<[email protected]<mailto:[email protected]>> wrote:
Hi Folks,
Is it possible to get the all terms that have same stem from Marklogic
database? I want to get all terms that belongs to the same stem.
Thanks & Regards
Abhishek Srivastav
Systems Engineer
Tata Consultancy Services
Cell:- +91-9883389968
Mailto: [email protected]<mailto:[email protected]>
Website: http://www.tcs.com<http://www.tcs.com/>
____________________________________________
Experience certainty. IT Services
Business Solutions
Outsourcing
=====-----=====-----=====
Notice: The information contained in this e-mail
message and/or attachments to it may contain
confidential or privileged information. If you are
not the intended recipient, any dissemination, use,
review, distribution, printing or copying of the
information contained in this e-mail message
and/or attachments to it are strictly prohibited. If
you have received this communication in error,
please notify us by reply e-mail or telephone and
immediately and permanently delete the message
and any attachments. Thank you
_______________________________________________
General mailing list
[email protected]<mailto:[email protected]>
http://developer.marklogic.com/mailman/listinfo/general
_______________________________________________
General mailing list
[email protected]
http://developer.marklogic.com/mailman/listinfo/general