Re: [PDX.rb] Ruby Trainers?

Ryan Carver Thu, 06 Oct 2005 10:16:28 -0700

I have a screen scraping library called Scraper that just needs alittle cleanup and release.. I just haven't found a chance to do ityet. I'm not too familiar with WWW::Mechanize, but what I remember isthe usage wasn't very clean


here's what Scraper looks like

# a session tracks cookies so you can login to web basedapplications that don't use Basic

  s = Scraper::Session.new

  # use the session to grab a url
  r = s.get "http://www.google.com/";

  # fill out a form
  form = r.form
  form[:q] = "ruby scraper"

  # submit the form and get the resulting page
  results = form.submit

# browse the web by following a couple links, and show theresulting page

  puts results.links.last.follow.links.last.follow.html


Anyone interested?




On Oct 6, 2005, at 9:36 AM, John Labovitz wrote:

Also, if anyone knows of alternatives to Waitr for cross-platform
browsers, let me know.
It's not exactly the same thing, but WWW::Mechanize is pretty goodfor scraping and controlling websites. I've had a bunch ofexperience using it to implement front-ends and data-suckers for aclient.
Unfortunately, WWW::Mechanize doesn't have very good docs orsupport. It's one of those back-burner ideas of mine to write up ahow-to article.
You can get it as a gem -- "mechanize".
The homepage is currently busted, but is usually at http://www.ntecs.de/blog/Blog/WWW-Mechanize.rdoc
--John
_______________________________________________
PDXRuby mailing list
[email protected]
IRC: #pdx.rb on irc.freenode.net
http://lists.pdxruby.org/mailman/listinfo/pdxruby


_______________________________________________
PDXRuby mailing list
[email protected]
IRC: #pdx.rb on irc.freenode.net
http://lists.pdxruby.org/mailman/listinfo/pdxruby

Re: [PDX.rb] Ruby Trainers?

Reply via email to