Ich bin bis einschlielich dem 25.4.2014 in Urlaub. Ihre Mail wird nicht automatisch weitergeleitet. Bitte wenden Sie sich bei Fragen an unser Sekretariat.
mit freundlichen Gren Lothar Bubel - Geschftsfhrer - CAIGOS GmbH Am Neunkircher Weg 3 66459 Kirkel zentral +49 6849 - 6004-0 direkt +49 6849 - 6004-20 mobil +49 151-15680750 mail [email protected] www.caigos.de www.geoportal-saar.de CAIGOS GmbH Eingetragen beim Amtsgericht Homburg HRB 3696 Geschftsfhrer: Lothar Bubel USt-IdNr.: DE 1380 16 658 Send General mailing list submissions to [email protected] To subscribe or unsubscribe via the World Wide Web, visit http://developer.marklogic.com/mailman/listinfo/general or, via email, send a message with subject or body 'help' to [email protected] You can reach the person managing the list at [email protected] When replying, please edit your Subject line so it is more specific than "Re: Contents of General digest..." Today's Topics: 1. Re: One character wild card search without using filter search (Amit Kumar Dev) ---------------------------------------------------------------------- Message: 1 Date: Wed, 9 Apr 2014 10:51:53 +0000 From: Amit Kumar Dev <[email protected]> Subject: Re: [MarkLogic Dev General] One character wild card search without using filter search To: "[email protected]" <[email protected]> Message-ID: <6A488A0741DC554EAF57B6F92D70AA8F12007668@chnshlmbx11> Content-Type: text/plain; charset="utf-8" Hi, Kindly suggest do I have to go for One character search Index as xdmp:plan for Three character search plus word lexicon with codepoint collation (recommended Index setting for wild card search) is giving "Lexicon fallback prefix expansion exceeded limits: 32768 raw or 512 compressed to prefixes". Thanks and regards Amit Kumar Dev -----Original Message----- From: Amit Kumar Dev Sent: Tuesday, April 01, 2014 12:19 PM To: 'Mary Holstege'; [email protected] Subject: RE: [MarkLogic Dev General] One character wild card search without using filter search Hi Mary, xdmp:plan gives "Lexicon fallback prefix expansion exceeded limits: 32768 raw or 512 compressed to prefixes". Does this mean that I have to go for One character search index since three character search plus word lexicon is not returning accurate result. Kindly suggest. Below is the xdmp:plan response: <qry:query-plan xmlns:qry="http://marklogic.com/cts/query"> <qry:info-trace>xdmp:eval("xquery version &quot;1.0-ml&quot;;&#13;&#10;&#13;&#10;(: Copyrig...", (), <options xmlns="xdmp:eval"><database>4854283089665795422</database><modules>867829563796094...</options>)</qry:info-trace> <qry:info-trace>Analyzing path for search: fn:doc()</qry:info-trace> <qry:info-trace>Step 1 is searchable: fn:doc()</qry:info-trace> <qry:info-trace>Path is fully searchable.</qry:info-trace> <qry:info-trace>Gathering constraints.</qry:info-trace> <qry:info-trace>Lexicon fallback prefix expansion exceeded limits: 32768 raw or 512 compressed to prefixes</qry:info-trace> <qry:info-trace>Search query contributed 1 constraint: cts:element-word-query(xs:QName("","title"), "A*", ("wildcarded","lang=en"), 1)</qry:info-trace> <qry:partial-plan> <qry:term-query weight="0"> <qry:key>12705188407099764864</qry:key> </qry:term-query> </qry:partial-plan> <qry:info-trace>Executing search.</qry:info-trace> <qry:final-plan> <qry:and-query> <qry:term-query weight="0"> <qry:key>12705188407099764864</qry:key> </qry:term-query> </qry:and-query> </qry:final-plan> <qry:info-trace>Selected 3681 fragments</qry:info-trace> <qry:result estimate="3681"/> </qry:query-plan> Thanks and Regards Amit Dev -----Original Message----- From: Mary Holstege [mailto:[email protected]] Sent: Tuesday, April 01, 2014 12:17 AM To: [email protected]; Amit Kumar Dev Subject: Re: [MarkLogic Dev General] One character wild card search without using filter search What does the query plan show you? xdmp:plan(cts:search(doc(), ..."unfiltered")) Could be lexicon expansion is too large or is being defeated in some way. The one character index will give you accurate results, but you will pay for it in disk space. //Mary On Mon, 31 Mar 2014 00:53:32 -0700, Amit Kumar Dev <[email protected]> wrote: > Thanks! Mary > > As suggested I have enabled three character search and word lexicon > with codepoint collation. > Also additionally enabled trailing wildcard search index. > But still search is not returning accurate response for single > character and two character wild card search. > > MarkLogic version is 6.0-2.3 > > Below is the sample query: > xquery version "1.0-ml"; > > let $searchQuery := cts:query( > <cts:element-word-query> > <cts:element>title</cts:element> > <cts:text>A*</cts:text> > <cts:option>wildcarded</cts:option> > </cts:element-word-query>) > return cts:search(fn:doc(), $searchQuery, "unfiltered")[1 to 10] > > This query also returns documents which do not have any word starts > with "A" in "title" element. > > I also tried by creating "element word lexicon" for title element, but > still same result. > > When I enabled "one character search" I got accurate response, since > this index is not recommended by MarkLogic (as three character search > and word lexicon with codepoint collation already enabled). > > Kindly suggest do I need to enable single character index or is there > any other alternative to achieve the same. > > Thanks and regards > Amit Dev > > -----Original Message----- > From: Mary Holstege [mailto:[email protected]] > Sent: Friday, March 14, 2014 3:56 AM > To: [email protected]; Amit Kumar Dev > Subject: Re: [MarkLogic Dev General] One character wild card search > without using filter search > > > > On Tue, 11 Mar 2014 20:53:52 -0700, Amit Kumar Dev > <[email protected]> wrote: > >> Hi All, >> >> >> * I am performing one character wild card search on title >> element. >> i.e. >> <cts:element-query> >> <cts:element>Title</cts:element> >> <cts:word-query weight="2"> >> <cts:text>D*</cts:text> >> <cts:option>wildcarded</cts:option> >> <cts:option>case-insensitive</cts:option> >> <cts:option>stemmed</cts:option> >> </cts:word-query> >> </cts:element-query> >> >> >> * I am performing unfiltered search. >> >> * Only Three & Two character wild card index is enable, but >> element word lexicon for Title is not created. >> >> After performing search I get documents which has Title with Diamond, >> Delegate etc.., but I am also getting documents which does not have >> any words starting with D in Title element. >> >> My queries are: >> >> * Is this behavior because of unfiltered search? >> >> * What is the best possible way to achieve one character wild >> card search without using filter search? > > Look at what xdmp:plan shows you for this query. Since you don't have > any index capable of resolving the D* part of it, you will just be > asking the index for all documents containing Title elements. > Unfiltered search is search using only the indexes. > > In general we would recommend using 3 character plus a codepoint word > lexicon if you want to do wildcard queries. It may help to add the > trailing wildcard index as well. If you have the word lexicon, the > two-character index doesn't give you a lot. > > Also note that and element-query containing a word-query is not the > same as an element-word-query, and to accurately resolve the former > you will need element and word positions. > (Difference being: <book><Title><BREAK>Dog</BREAK></Title></book> > matches the element-query, but not the element-word-query unless BREAK > is declared as an element-word-query-through.) > > //Mary > > > **************** CAUTION - Disclaimer ***************** This e-mail > contains PRIVILEGED AND CONFIDENTIAL INFORMATION intended solely for > the use of the addressee(s). If you are not the intended recipient, > please notify the sender by e-mail and delete the original message. > Further, you are not to copy, disclose, or distribute this e-mail or > its contents to any other person and any such actions are unlawful. > This e-mail may contain viruses. Infosys has taken every reasonable > precaution to minimize this risk, but is not liable for any damage you > may sustain as a result of any virus in this e-mail. You should carry > out your own virus checks before opening the e-mail or attachment. > Infosys reserves the right to monitor and review the content of all > messages sent to or from this e-mail address. Messages sent to or from > this e-mail address may be stored on the Infosys e-mail system. > ***INFOSYS******** End of Disclaimer ********INFOSYS*** -- Using Opera's revolutionary email client: http://www.opera.com/mail/ ------------------------------ _______________________________________________ General mailing list [email protected] http://developer.marklogic.com/mailman/listinfo/general End of General Digest, Vol 118, Issue 10 ****************************************
_______________________________________________ General mailing list [email protected] http://developer.marklogic.com/mailman/listinfo/general
