[Zope] Re: [Zope-dev] ZCatalog and 'fuzzy logic'

2001-01-10 Thread Morten W. Petersen

 I do not think that "fuzzy logic" is strongly related to "regexp-like".
 Anyway.
 
 Fuzzy searching often means "finding matches with characters omitted,
 replaced or inserted".

It seems I misunderstood the term fuzzy logic myself.  Fuzzy logic means
if I search for a word, for example 'programmer', it will return matches
to the words 'program', 'programming','programmable' etc.

I.e., it will somewhat intelligently return words that are similar in
what they mean, using grammar rules (chopping off endings of words and
making them match others).

Hmm.

Cheers,

Morten


___
Zope maillist  -  [EMAIL PROTECTED]
http://lists.zope.org/mailman/listinfo/zope
**   No cross posts or HTML encoding!  **
(Related lists - 
 http://lists.zope.org/mailman/listinfo/zope-announce
 http://lists.zope.org/mailman/listinfo/zope-dev )




[Zope] Re: [Zope-dev] ZCatalog and 'fuzzy logic'

2001-01-10 Thread Dieter Maurer

Morten W. Petersen writes:
  It seems I misunderstood the term fuzzy logic myself.  Fuzzy logic means
  if I search for a word, for example 'programmer', it will return matches
  to the words 'program', 'programming','programmable' etc.
This, usually, is called "stemming".
Though, your examples indicate quite a strong form of it.

If you have some tool, maybe LinguistX, that map from a word
to its stem and then from the stem to all words with this as
stem (or directly give the stem equivalence class of a word),
then it is quite easy to incorporate that in Zope's catalog.

However, to do that cleanly, you will need good algorithms
and/or large dictionaries. This, usually, is not free of
charge.



Dieter

___
Zope maillist  -  [EMAIL PROTECTED]
http://lists.zope.org/mailman/listinfo/zope
**   No cross posts or HTML encoding!  **
(Related lists - 
 http://lists.zope.org/mailman/listinfo/zope-announce
 http://lists.zope.org/mailman/listinfo/zope-dev )




[Zope] Re: [Zope-dev] ZCatalog and 'fuzzy logic'

2001-01-10 Thread Casey Duncan

--- "Morten W. Petersen" [EMAIL PROTECTED] wrote:
[snip]
 
 It seems I misunderstood the term fuzzy logic
 myself.  Fuzzy logic means
 if I search for a word, for example 'programmer', it
 will return matches
 to the words 'program', 'programming','programmable'
 etc.
 
 I.e., it will somewhat intelligently return words
 that are similar in
 what they mean, using grammar rules (chopping off
 endings of words and
 making them match others).
 
 Hmm.
 
 Cheers,
 
 Morten
 

ZCatalog TextIndexes support this type of "wildcard"
searching. I posted a message a couple of weeks ago
that describes the query syntax. Search the mailing
list archives for it.


=
| Casey Duncan
| Kaivo, Inc.
| [EMAIL PROTECTED]
`-

__
Do You Yahoo!?
Yahoo! Photos - Share your holiday photos online!
http://photos.yahoo.com/

___
Zope maillist  -  [EMAIL PROTECTED]
http://lists.zope.org/mailman/listinfo/zope
**   No cross posts or HTML encoding!  **
(Related lists - 
 http://lists.zope.org/mailman/listinfo/zope-announce
 http://lists.zope.org/mailman/listinfo/zope-dev )




[Zope] Re: [Zope-dev] ZCatalog and 'fuzzy logic'

2001-01-10 Thread Ken Manheimer

On Wed, 10 Jan 2001, Morten W. Petersen wrote:

  I do not think that "fuzzy logic" is strongly related to "regexp-like".
  Anyway.
  
  Fuzzy searching often means "finding matches with characters omitted,
  replaced or inserted".
 
 It seems I misunderstood the term fuzzy logic myself.  Fuzzy logic means
 if I search for a word, for example 'programmer', it will return matches
 to the words 'program', 'programming','programmable' etc.

I think your talking about something else.  Last i checked, "fuzzy logic"
was a logical algebra based on the existence of intermediate truth states,
between "true" and "false".  It has little or nothing to do with
aproximate searching, though i guess you could use it to make assertions
about the aproximations.  I think what you all are talking about is "fuzzy
matching".

 I.e., it will somewhat intelligently return words that are similar in
 what they mean, using grammar rules (chopping off endings of words and
 making them match others).

There are also matching mechanisms like soundex, that account for
misspelling by translating words to phonetic-equivalent normalized codes,
and comparing on that basis.

Ken
[EMAIL PROTECTED]


___
Zope maillist  -  [EMAIL PROTECTED]
http://lists.zope.org/mailman/listinfo/zope
**   No cross posts or HTML encoding!  **
(Related lists - 
 http://lists.zope.org/mailman/listinfo/zope-announce
 http://lists.zope.org/mailman/listinfo/zope-dev )




[Zope] Re: [Zope-dev] ZCatalog and 'fuzzy logic'

2001-01-09 Thread Dieter Maurer

Morten W. Petersen writes:
  Is there anyone who could try to give an estimate of how long it would
  take to add fuzzy logic (regexp-like) searching capability to the
  ZCatalog?
I do not think that "fuzzy logic" is strongly related to "regexp-like".
Anyway.

Fuzzy searching often means "finding matches with characters omitted,
replaced or inserted".

Zope's globbing vocabularies support wildcards '*' and '?'.
To implement wildcard based searches efficiently, they
index words under their two letter consitutents.
When you now get a pattern, you derive from the pattern
what two letter constituents the matching words must
have and retrieve them. This defines a candidate word set.
Then you check, whether the retrieved word really match
the expression.

You can extend this algorithm to get fuzzy searches.



Dieter

___
Zope maillist  -  [EMAIL PROTECTED]
http://lists.zope.org/mailman/listinfo/zope
**   No cross posts or HTML encoding!  **
(Related lists - 
 http://lists.zope.org/mailman/listinfo/zope-announce
 http://lists.zope.org/mailman/listinfo/zope-dev )