Are you talking about matching sections of text (copyright/plagerism issues)

Rough similarity (similar size, number of words)
Similar content groupings (word count, number of matches)
Whole phrase matching


On 3/5/07, Ben Doom <[EMAIL PROTECTED]> wrote:
> "Similar" is a rather vague description.  Are you looking for overlap?
> Length?  Number of characters in the same location in each string?
> Number of similar characters?
>
> Soundex is a good start if what your are looking for is similarity in a
> sounds-like or spell-check sense, but isn't very helpful for UUIDs.
>
> --Ben Doom
>
> Chris Long wrote:
> > I'm looking for an algorithm that will take two pieces of text as input and 
> > return some comparison of those texts, and how similar they are to one 
> > another.  Ideally I would like it to return a number which would indicate 
> > the level of similarity, but I c
> > an probably work with just about anything.
> >
> > Are there any good algorithms out there for doing this?  Any help would be 
> > much appreciated.

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~|
Upgrade to Adobe ColdFusion MX7
Experience Flex 2 & MX7 integration & create powerful cross-platform RIAs
http://www.adobe.com/products/coldfusion/flex2/

Archive: 
http://www.houseoffusion.com/groups/CF-Talk/message.cfm/messageid:271636
Subscription: http://www.houseoffusion.com/groups/CF-Talk/subscribe.cfm
Unsubscribe: 
http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=11502.10531.4

Reply via email to