Hi Eric,

I had a similar problem (http://aclweb.org/anthology/W/W16/W16-2114.pdf) and 
solved it by calculating a normalized Jaccard coefficient for different 
cut-offs, such as only the first 3 or 5 words in both lists.
Regards,

Johannes
_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list
Corpora@uib.no
http://mailman.uib.no/listinfo/corpora

Reply via email to