Ich bin bis einschlielich dem 25.4.2014 in Urlaub.
Ihre Mail wird nicht automatisch weitergeleitet.
Bitte wenden Sie sich bei Fragen an unser Sekretariat. 





mit freundlichen Gren
 
   Lothar Bubel
- Geschftsfhrer - 
 
CAIGOS GmbH
Am Neunkircher Weg 3
66459 Kirkel
 
zentral  +49 6849 - 6004-0 
direkt    +49 6849 - 6004-20
mobil    +49 151-15680750 
 
mail      [email protected]
 
www.caigos.de 
www.geoportal-saar.de
 
CAIGOS GmbH
Eingetragen beim Amtsgericht Homburg HRB 3696
Geschftsfhrer: Lothar Bubel
USt-IdNr.: DE 1380 16 658
Send General mailing list submissions to
        [email protected]

To subscribe or unsubscribe via the World Wide Web, visit
        http://developer.marklogic.com/mailman/listinfo/general
or, via email, send a message with subject or body 'help' to
        [email protected]

You can reach the person managing the list at
        [email protected]

When replying, please edit your Subject line so it is more specific
than "Re: Contents of General digest..."


Today's Topics:

   1. Re: One character wild card search without using filter
      search (Amit Kumar Dev)


----------------------------------------------------------------------

Message: 1
Date: Wed, 9 Apr 2014 10:51:53 +0000
From: Amit Kumar Dev <[email protected]>
Subject: Re: [MarkLogic Dev General] One character wild card search
        without using filter search
To: "[email protected]"
        <[email protected]>
Message-ID: <6A488A0741DC554EAF57B6F92D70AA8F12007668@chnshlmbx11>
Content-Type: text/plain; charset="utf-8"

Hi,

Kindly suggest do I have to go for One character search Index as xdmp:plan for 
Three character search plus word lexicon with codepoint collation (recommended 
Index setting for wild card search) is giving "Lexicon fallback prefix 
expansion exceeded limits: 32768 raw or 512 compressed to prefixes".

Thanks and regards
Amit Kumar Dev

-----Original Message-----
From: Amit Kumar Dev 
Sent: Tuesday, April 01, 2014 12:19 PM
To: 'Mary Holstege'; [email protected]
Subject: RE: [MarkLogic Dev General] One character wild card search without 
using filter search

Hi Mary,

xdmp:plan gives "Lexicon fallback prefix expansion exceeded limits: 32768 raw 
or 512 compressed to prefixes".

Does this mean that I have to go for One character search index since three 
character search plus word lexicon is not returning accurate result.

Kindly suggest.

Below is the xdmp:plan response:

<qry:query-plan xmlns:qry="http://marklogic.com/cts/query";>
        <qry:info-trace>xdmp:eval("xquery version 
&amp;quot;1.0-ml&amp;quot;;&amp;#13;&amp;#10;&amp;#13;&amp;#10;(: Copyrig...", 
(), &lt;options 
xmlns="xdmp:eval"&gt;&lt;database&gt;4854283089665795422&lt;/database&gt;&lt;modules&gt;867829563796094...&lt;/options&gt;)</qry:info-trace>
        <qry:info-trace>Analyzing path for search: fn:doc()</qry:info-trace>
        <qry:info-trace>Step 1 is searchable: fn:doc()</qry:info-trace>
        <qry:info-trace>Path is fully searchable.</qry:info-trace>
        <qry:info-trace>Gathering constraints.</qry:info-trace>
        <qry:info-trace>Lexicon fallback prefix expansion exceeded limits: 
32768 raw or 512 compressed to prefixes</qry:info-trace>
        <qry:info-trace>Search query contributed 1 constraint: 
cts:element-word-query(xs:QName("","title"), "A*", ("wildcarded","lang=en"), 
1)</qry:info-trace>
        <qry:partial-plan>
                <qry:term-query weight="0">
                        <qry:key>12705188407099764864</qry:key>
                </qry:term-query>
        </qry:partial-plan>
        <qry:info-trace>Executing search.</qry:info-trace>
        <qry:final-plan>
                <qry:and-query>
                        <qry:term-query weight="0">
                                <qry:key>12705188407099764864</qry:key>
                        </qry:term-query>
                </qry:and-query>
        </qry:final-plan>
        <qry:info-trace>Selected 3681 fragments</qry:info-trace>
        <qry:result estimate="3681"/>
</qry:query-plan>

Thanks and Regards
Amit Dev

-----Original Message-----
From: Mary Holstege [mailto:[email protected]] 
Sent: Tuesday, April 01, 2014 12:17 AM
To: [email protected]; Amit Kumar Dev
Subject: Re: [MarkLogic Dev General] One character wild card search without 
using filter search


What does the query plan show you?

xdmp:plan(cts:search(doc(), ..."unfiltered"))

Could be lexicon expansion is too large or is being defeated in some way.  The 
one character index will give you accurate results, but you will pay for it in 
disk space.

//Mary

On Mon, 31 Mar 2014 00:53:32 -0700, Amit Kumar Dev <[email protected]>
wrote:

> Thanks! Mary
>
> As suggested I have enabled three character search and word lexicon 
> with codepoint collation.
> Also additionally enabled trailing wildcard search index.
> But still search is not returning accurate response for single 
> character and two character wild card search.
>
> MarkLogic version is 6.0-2.3
>
> Below is the sample query:
> xquery version "1.0-ml";
>
> let $searchQuery := cts:query(
>                       <cts:element-word-query>
>                               <cts:element>title</cts:element>
>                               <cts:text>A*</cts:text>
>                                 <cts:option>wildcarded</cts:option>
>                       </cts:element-word-query>)
> return cts:search(fn:doc(), $searchQuery, "unfiltered")[1 to 10]
>
> This query also returns documents which do not have any word starts 
> with "A" in "title" element.
>
> I also tried by creating "element word lexicon" for title element, but 
> still same result.
>
> When I enabled "one character search" I got accurate response, since 
> this index is not recommended by MarkLogic (as three character search 
> and word lexicon with codepoint collation already enabled).
>
> Kindly suggest do I need to enable single character index or is there 
> any other alternative to achieve the same.
>
> Thanks and regards
> Amit Dev
>
> -----Original Message-----
> From: Mary Holstege [mailto:[email protected]]
> Sent: Friday, March 14, 2014 3:56 AM
> To: [email protected]; Amit Kumar Dev
> Subject: Re: [MarkLogic Dev General] One character wild card search 
> without using filter search
>
>
>
> On Tue, 11 Mar 2014 20:53:52 -0700, Amit Kumar Dev 
> <[email protected]> wrote:
>
>> Hi All,
>>
>>
>> *         I am performing one character wild card search on title  
>> element.
>> i.e.
>> <cts:element-query>
>>                 <cts:element>Title</cts:element>
>>                                 <cts:word-query weight="2">
>>                                 <cts:text>D*</cts:text>
>>                                 <cts:option>wildcarded</cts:option>
>>                                 <cts:option>case-insensitive</cts:option>
>>                                 <cts:option>stemmed</cts:option>
>>                                 </cts:word-query> 
>> </cts:element-query>
>>
>>
>> *         I am performing unfiltered search.
>>
>> *         Only Three & Two character wild card index is enable, but  
>> element word lexicon for Title is not created.
>>
>> After performing search I get documents which has Title with Diamond, 
>> Delegate etc.., but I am also getting documents which does not have 
>> any words starting with D in Title element.
>>
>> My queries are:
>>
>> *         Is this behavior because of unfiltered search?
>>
>> *         What is the best possible way to achieve one character wild  
>> card search without using filter search?
>
> Look at what xdmp:plan shows you for this query. Since you don't have 
> any index capable of resolving the D* part of it, you will just be 
> asking the index for all documents containing Title elements. 
> Unfiltered search is search using only the indexes.
>
> In general we would recommend using 3 character plus a codepoint word 
> lexicon if you want to do wildcard queries. It may help to add the 
> trailing wildcard index as well. If you have the word lexicon, the 
> two-character index doesn't give you a lot.
>
> Also note that and element-query containing a word-query is not the 
> same as an element-word-query, and to accurately resolve the former 
> you will need element and word positions.
> (Difference being: <book><Title><BREAK>Dog</BREAK></Title></book>
> matches the element-query, but not the element-word-query unless BREAK 
> is declared as an element-word-query-through.)
>
> //Mary
>
>
> **************** CAUTION - Disclaimer ***************** This e-mail 
> contains PRIVILEGED AND CONFIDENTIAL INFORMATION intended solely for 
> the use of the addressee(s). If you are not the intended recipient, 
> please notify the sender by e-mail and delete the original message. 
> Further, you are not to copy, disclose, or distribute this e-mail or 
> its contents to any other person and any such actions are unlawful. 
> This e-mail may contain viruses. Infosys has taken every reasonable 
> precaution to minimize this risk, but is not liable for any damage you 
> may sustain as a result of any virus in this e-mail. You should carry 
> out your own virus checks before opening the e-mail or attachment. 
> Infosys reserves the right to monitor and review the content of all 
> messages sent to or from this e-mail address. Messages sent to or from 
> this e-mail address may be stored on the Infosys e-mail system.
> ***INFOSYS******** End of Disclaimer ********INFOSYS***


--
Using Opera's revolutionary email client: http://www.opera.com/mail/

------------------------------

_______________________________________________
General mailing list
[email protected]
http://developer.marklogic.com/mailman/listinfo/general


End of General Digest, Vol 118, Issue 10
****************************************
_______________________________________________
General mailing list
[email protected]
http://developer.marklogic.com/mailman/listinfo/general

Reply via email to