[ 
https://issues.apache.org/jira/browse/DIRSERVER-1965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064947#comment-14064947
 ] 

Emmanuel Lecharny commented on DIRSERVER-1965:
----------------------------------------------

Some other LDAP servers are indexing triplets. For instance, a sentence like :
"Hello World!" will be indexed using the following triplets :
'hel', 'ell', 'llo', 'lo ', 'o w'...

If we have a N letters sentence, we will create N-3 entries in the index. This 
is very expensive. OTOH, it allows all the kind of searches for substring.

We might want to evaluate the pros and cons of such an approach.

> An Index should speed up searches starting with '*'
> ---------------------------------------------------
>
>                 Key: DIRSERVER-1965
>                 URL: https://issues.apache.org/jira/browse/DIRSERVER-1965
>             Project: Directory ApacheDS
>          Issue Type: Improvement
>          Components: ldap
>    Affects Versions: 2.0.0-M16
>         Environment: NA
>            Reporter: Ernst Bech
>            Assignee: Kiran Ayyagari
>            Priority: Blocker
>
> As of now an index will speed up searches for an exact match string and a 
> substring ending with a "*".
> It does NOT currently speed up searches starting with a "*", which is what we 
> need to be implemented.
> Example: in our unified messaging application - when we receive an incoming 
> call - we have to find a user whose telephone number ends with the digits 
> signaled to us by the telephone network.
> Let's say a user has a telephone number +49(777)12345678. The telephone 
> network only signals its extension 678. We thus search for *678 to get a list 
> of possible users and by applying some extra magic we are able to pick the 
> right one.
> The problem is: we need to do this fast, because taking the call depends on 
> it.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to