On Jan 13, 2012, at 9:39 AM, Rob Lifford wrote: > What's the state of the art on email scraper bots these days?
The topic was covered in a security class I took in 2008, so my memory is hazy and this is out of date, however... Most of the bots at the time were still fairly naive in their implementation, but there were some bots (1 or 2, iirc) that not only pretended (successfully) to be a browser, but that these more advanced bots would also imitate a browser, executing javascript and pulling information about modified content. Additionally, they would search for text that looked like email (not only mailto items), including items like "name (at) domain (dot) com" and variants. I also think there was a screen-grabbing and OCR bot, but I can't remember if that was for email scrapping or captcha breaking... I would recommend not putting any personally identifying information on a website (public or otherwise), but most people think I'm super paranoid. Hope that helps, Andrew -- Our Web site: http://www.RefreshAustin.org/ You received this message because you are subscribed to the Google Groups "Refresh Austin" group. [ Posting ] To post to this group, send email to [email protected] Job-related postings should follow http://tr.im/refreshaustinjobspolicy We do not accept job posts from recruiters. [ Unsubscribe ] To unsubscribe from this group, send email to [email protected] [ More Info ] For more options, visit this group at http://groups.google.com/group/Refresh-Austin
