Thanks for the suggestions. I'm going to play around with this.

On the most part, I'm doing detection for scenarios with two names, so 
names like Robert De Niro will not come up.

-A

Rick Denatale wrote:
> On Fri, Mar 5, 2010 at 12:16 PM, Hassan Schroeder
> <[email protected]> wrote:
>>
>> You know this is not something you're going to solve with regular
>> expressions, though, right? �:-)
>>
>> "San Francisco's Jane Smith, quoted in Broder's Washington Post
>> �article, said ..."
>>
>> You need a lot more heuristics than a simple RegEx to reliably find
>> names in a block of text.
> 
> Some other cases to consider
> 
> John Phillip Sousa (or if you're a kid a heart John Jacob Jingelheimer
> Smith) not to mention Spanish names which can have MANY parts.
> 
> Robert De Niro
> 
> Jesus Mary and Joseph
> 
> Surnames with origins in some languages don't start with a capital
> 
> Michael Henry de Young  - Dutch
> 
> Wernher von Braun - German
> 
> 
> 
> 
> 
> --
> Rick DeNatale
> 
> Blog: http://talklikeaduck.denhaven2.com/
> Twitter: http://twitter.com/RickDeNatale
> WWR: http://www.workingwithrails.com/person/9021-rick-denatale
> LinkedIn: http://www.linkedin.com/in/rickdenatale

-- 
Posted via http://www.ruby-forum.com/.

-- 
You received this message because you are subscribed to the Google Groups "Ruby 
on Rails: Talk" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/rubyonrails-talk?hl=en.

Reply via email to