What you are trying to do is called record linkage. There is a fair amount of info in the Lucene archives on this, see http://www.lucidimagination.com/search/?q=record+linkage

As Erick says, you will need more info than just the name to do this. I doubt you will be able to get completely automated, but you should be able to obtain good results such that the outliers are easy for a human to handle.

-Grant

On Mar 19, 2009, at 12:47 PM, bruce wrote:

Hi...

This may/may not have anything to do with Lucene/Nutch, but I figured I'd
ask/post anyway.

I'm working on a project, dealing with courses/classes on college sites. I'm trying to figure out how to create an automated process where I can create a
process to link a given faculty member to a given class..

The issue arises where a class will have an instructor. However, there might
be multiple instructors with the same name listed in the faculty/staff
directory for the college. I'm interested in various/different approaches to
solving this issue. My hope is that someone here might have a
pointer.reference to areas that I can examine/research to figure this out..

Thanks in advance for any assistance with this issue.

Thanks




---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Reply via email to