Re: [CODE4LIB] screen scraping

2011-10-03 Thread Genny Engel
Genny Engel Sonoma County Library gen...@sonoma.lib.ca.us 707 545-0831 x581 www.sonomalibrary.org -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Nate Hill Sent: Sunday, October 02, 2011 7:23 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: [CODE4LIB] sc

Re: [CODE4LIB] screen scraping

2011-10-03 Thread Nate Vack
On Sun, Oct 2, 2011 at 9:35 PM, Reese, Terry wrote: > In Canada, the BC Supreme Court ruled that screen scrapping real estate > listings from one site and using them on another indeed infringed on > copyright.  Not sure if this would cover your use -- but if you are coming > from Canada, it mig

Re: [CODE4LIB] screen scraping

2011-10-03 Thread Simon Spero
On Oct 3, 2011 9:19 AM, "Ed Summers" wrote: > On Sun, Oct 2, 2011 at 10:32 PM, Ken Irwin wrote: > > 1. respect robots.txt Disclaimer: I am not a lawyer. Remember that robots.txt applies only to recursive web crawlers, and not to screen-scraping per se. In cases where it does apply, it has limi

Re: [CODE4LIB] screen scraping

2011-10-03 Thread Ed Summers
On Sun, Oct 2, 2011 at 10:32 PM, Ken Irwin wrote: > 1. respect robots.txt Respecting robots.txt is key. It's normally easy to eyeball, but when in doubt something like robotparser [1] can be helpful. A bit of reflection on the Golden Rule probably is probably more important than pondering the leg

Re: [CODE4LIB] screen scraping

2011-10-03 Thread Ed Summers
On Mon, Oct 3, 2011 at 8:55 AM, Ed Summers wrote: > Respecting robots.txt is key. It's normally easy to eyeball, but when > in doubt something like robotparser [1] can be helpful. A bit of > reflection on the Golden Rule probably is probably more important than > pondering the legality of what you

[CODE4LIB] screen scraping

2011-10-02 Thread Susan Kane
For public use, I'd Ask the Webmaster as a courtesy -- regardless of legality. After all, it is (presumably) their content and they should have some say in whether it appears on another site. If my content were used to drive traffic at another site, I'd probably want to know who and where, etc.

Re: [CODE4LIB] screen scraping

2011-10-02 Thread Tracy Seneca
rary From: Code for Libraries [CODE4LIB@LISTSERV.ND.EDU] on behalf of Nate Hill [nathanielh...@gmail.com] Sent: Sunday, October 02, 2011 7:23 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: [CODE4LIB] screen scraping A question: what are the 'rules' around screen scraping? If one site doesn&

Re: [CODE4LIB] screen scraping

2011-10-02 Thread Nate Hill
I think what I'm hearing here is that it would be a good idea to ask a webmaster on the other end if it's OK. Advertising... Roberto, good point I hadn't thought of that. Thanks. On Sun, Oct 2, 2011 at 7:46 PM, Roberto Hoyle wrote: > On 10/2/2011 10:23 PM, Nate Hill wrote: > >> A question: what

Re: [CODE4LIB] screen scraping

2011-10-02 Thread Roberto Hoyle
On 10/2/2011 10:23 PM, Nate Hill wrote: A question: what are the 'rules' around screen scraping? If one site doesn't offer an RSS feed and you want to grab (for example) their weekly top ten list with a script and then redisplay it on another site, is that bad form? Or even illegal? If the sit

Re: [CODE4LIB] screen scraping

2011-10-02 Thread Erik Hetzner
At Sun, 2 Oct 2011 19:23:09 -0700, Nate Hill wrote: > > A question: what are the 'rules' around screen scraping? > If one site doesn't offer an RSS feed and you want to grab (for example) > their weekly top ten list with a script and then redisplay it on another > site, is that bad form? Or even i

Re: [CODE4LIB] screen scraping

2011-10-02 Thread Ken Irwin
I don't know that there are two many rules about this, but here's what comes to mind for me: 1. respect robots.txt 2. cache content so you don't hit their site more often than is reasonable. (i'd say that once a day is pretty reasonable) 3. also cache or mockup or something when you're writing y

Re: [CODE4LIB] screen scraping

2011-10-02 Thread Reese, Terry
: 541.737.6384 *** -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Nate Hill Sent: Sunday, October 02, 2011 7:23 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: [CODE4LIB] screen scraping A question: what are the 'rules&#x

Re: [CODE4LIB] screen scraping

2011-10-02 Thread Michael J. Giarlo
It's only illegal if you respect the authority of the HTTPolice*. -Mike * OCLC On Sun, Oct 2, 2011 at 22:23, Nate Hill wrote: > A question: what are the 'rules' around screen scraping? > If one site doesn't offer an RSS feed and you want to grab (for example) > their weekly top ten list with a

[CODE4LIB] screen scraping

2011-10-02 Thread Nate Hill
A question: what are the 'rules' around screen scraping? If one site doesn't offer an RSS feed and you want to grab (for example) their weekly top ten list with a script and then redisplay it on another site, is that bad form? Or even illegal? Thanks- Nate -- Nate Hill nathanielh...@gmail.com h