Perhaps this is what Jan had in mind:

Tanimoto coefficient (extended Jaccard coefficient)

"Cosine similarity is a measure of similarity between two vectors of n 
dimensions by finding the angle between them, often used to compare documents 
in 
text mining."

http://en.wikipedia.org/wiki/Jaccard_index

--
David Mitchell

On 12/20/2009 22:13, Henry Rich wrote:
> Jan,
>
> I have looked for a description of Tanimoto distance but have not found
> anything useful.  Can you describe what it is or point to a description
> of it?
>
> Henry Rich
>
> Jan Jacobs wrote:
>> Henry,
>> very good. For longer strings it is more than double so fast as the previous
>> version. In my test cases it even consumes less memory.
>> Is it possible to include this as a native function in J (e.g. overloading
>> -. or s:)?
>> Same question but now for Tanimoto distance?
>> Jan.
>>
>>
>> On 12/19/09, R.E. Boss<r.e.b...@planet.nl>  wrote:
>>> Smart analysis. Chapeau!
>>>
>>>
>>> R.E. Boss
>>>
>>>
>>> -----Oorspronkelijk bericht-----
>>> Van: programming-boun...@jsoftware.com
>>> [mailto:programming-boun...@jsoftware.com] Namens Henry Rich
>>> Verzonden: zaterdag 19 december 2009 3:00
>>> Aan: Programming forum
>>> Onderwerp: [Jprogramming] Levenshtein distance
>>>
>>> I was working with R. E.'s compact implementation of the Levenshtein
>>> distance and I found an interesting equivalence:
>>>
>>> (<.>:)/\.&.|.
>>>
>>> can be replaced by
>>>
>>> (<./\@:- + ]) i...@#
>>>
>>> which uses a little more space but is quite a bit faster for large
>>> operands.  So now I have the version:
>>>
>>> NB. Levenshtein distance between two strings
>>> levdist=: 4 : 0
>>> 'a b'=. (/: #&>)x;y
>>> z=. i.>:#b
>>> for_j. a do.
>>> z=. ((<./\@:- + ]) i...@#) ((j ~: b) + }:z) ({...@] , (<. }.))>:z
>>> end.
>>> {:z
>>> )
>>>
>>>
>>> Henry Rich
>>> ----------------------------------------------------------------------
>>> For information about J forums see http://www.jsoftware.com/forums.htm
>>>
>>> ----------------------------------------------------------------------
>>> For information about J forums see http://www.jsoftware.com/forums.htm
>>>
>>
>>
>>
> ----------------------------------------------------------------------
> For information about J forums see http://www.jsoftware.com/forums.htm
>
----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm

Reply via email to