This is a hard problem.

If you go by degrees and shades of synonymity, it can (and has been)
done manually - see Visual Thesaurus (http://
www.visualthesaurus.com/).

But for grouping based on the same semantic topics - that's pretty
difficult. You could do it based on co-location in a corpus, but
there'd be a lot of noise and results wouldn't be perfect. Maybe by
using a syntactic parser to pull out nouns and verbs you could improve
the results a bit. You could probably get something. Your best bet is
definitely a statistical, corpus-linguistics based approach..

Actual semantic parsing isn't anywhere close, yet. The consensus of
those I've talked to seems to be that we won't have "full" semantic
language processing until we have strong AI - in fact, the two are
very probably equivalent. Natural language is just too full of
ambiguity and context. For example, there's probably a great many
*people* in the world who'd have a hard time linking the words "fish"
and "reel", if they haven't fished before. And a lot of people would
probably associate "reel" with "dance" more strongly than "fish". The
task potentially would strain even the verbal prowess of a non-
linguist human, and computers are much more limited.



On Jul 28, 4:58 pm, Daniel <doubleagen...@gmail.com> wrote:
> I want to write a clojure program that searches for similarities of
> words in the english language and places them in a graph, where the
> distance between nodes indicates their similarity.  I don't mean
> syntactical similarity.  Related contextual meaning is closer to the
> mark.
>
> For instance: "fish" and "reel" don't have much similarity, but in the
> context of fishing they do, so the distance in such a graph wouldn't
> be very large.
>
> I'm sure research has been done in this area (I suspect with no small
> portion belonging to google), so can anybody point me in the right
> direction?
>
> Thanks.

-- 
You received this message because you are subscribed to the Google
Groups "Clojure" group.
To post to this group, send email to clojure@googlegroups.com
Note that posts from new members are moderated - please be patient with your 
first post.
To unsubscribe from this group, send email to
clojure+unsubscr...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/clojure?hl=en

Reply via email to