Re: [CODE4LIB] opening a pdf file [diva]

2011-10-03 Thread KREYCHE, MICHAEL
I just installed EBookDroid (AFAICT the latest and greatest version of VuDroid) from the Android Market and tried it on the color version of Eric's "canary" file. It immediately loads numbered blank pages, then starts rendering the current page, which takes about 20 seconds. Previously rendered

Re: [CODE4LIB] opening a pdf file [diva]

2011-10-03 Thread Tom Keays
Another idea, if you are looking for an app-based rather than web-based reader is VuDroid, which supports both PDF and DjVu formats. http://code.google.com/p/vudroid/ I suggest it, not because I use it but because, at least in the Open Library version of the book's record, http://openlibrary.org/b

Re: [CODE4LIB] screen scraping

2011-10-03 Thread Genny Engel
Another reason to check with the webmaster, all legalities aside, is that their top ten list might actually be being built on an RSS feed, but for whatever reason they don't offer it directly as a feed (or they do, but it wasn't obvious to you where that feed was to be found). They might prefer

Re: [CODE4LIB] opening a pdf file [diva]

2011-10-03 Thread Brenner, Aaron L
> -Original Message- > From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of > Andrew Hankinson > I don't know of anything similar for PDFs, since they're not really > designed to render a portion of the document without downloading the > entire thing. The "linearized" f

Re: [CODE4LIB] opening a pdf file

2011-10-03 Thread Dave Caroline
The problem is PDF and the viewers. some/most expand ALL the compressed images and create thumbs from the images before they start display, This uses huge amounts of memory, a technology fail, they just dont fit certain work. "If" you are lucky the viewer keeps it compressed so it fits in memory an

Re: [CODE4LIB] opening a pdf file [diva]

2011-10-03 Thread Andrew Hankinson
It doesn't work with PDFs, since it needs to create a tiled TIFF image for each page. I don't know of anything similar for PDFs, since they're not really designed to render a portion of the document without downloading the entire thing. You can convert PDF pages to images, though... :) -Andrew

Re: [CODE4LIB] ny times best seller api

2011-10-03 Thread Karen Coombs
Some of OCLC's APIs do support JSONP or CORS: for example QuestionPoint API, the xIdentifier and MapFAST services. However, other services do not provide this support. This is because for these services we need to carefully ensure that the application making the request is actually owned by the ins

Re: [CODE4LIB] OpenLibrary book covers & privacy concerns

2011-10-03 Thread Chris Cormack
On 4 Oct 2011 07:27, "Erin Germ" wrote: > > Does anyone know if OpenLibrary tracks specific user information on the book > covers they provide? My concern is privacy of the patrons, not stat use? > Why not ask them? I've found them nothing but exceedingly helpful Chris > Thanks. > > ~Erin

[CODE4LIB] OpenLibrary book covers & privacy concerns

2011-10-03 Thread Erin Germ
Does anyone know if OpenLibrary tracks specific user information on the book covers they provide? My concern is privacy of the patrons, not stat use? Thanks. ~Erin

Re: [CODE4LIB] opening a pdf file [diva]

2011-10-03 Thread Tom Keays
On Mon, Oct 3, 2011 at 11:57 AM, Andrew Hankinson < andrew.hankin...@gmail.com> wrote: > I'm one of the developers of Diva. I noticed that you've been getting your > files from the Internet Archive. They also have the full high-quality JPEG > and JPEG2000 images available. > > http://ia600209.us.a

Re: [CODE4LIB] screen scraping

2011-10-03 Thread Nate Vack
On Sun, Oct 2, 2011 at 9:35 PM, Reese, Terry wrote: > In Canada, the BC Supreme Court ruled that screen scrapping real estate > listings from one site and using them on another indeed infringed on > copyright.  Not sure if this would cover your use -- but if you are coming > from Canada, it mig

Re: [CODE4LIB] screen scraping

2011-10-03 Thread Simon Spero
On Oct 3, 2011 9:19 AM, "Ed Summers" wrote: > On Sun, Oct 2, 2011 at 10:32 PM, Ken Irwin wrote: > > 1. respect robots.txt Disclaimer: I am not a lawyer. Remember that robots.txt applies only to recursive web crawlers, and not to screen-scraping per se. In cases where it does apply, it has limi

Re: [CODE4LIB] opening a pdf file [diva]

2011-10-03 Thread Parker, Anson (adp6j)
So this is awesome, does it in fact work with PDF's or not, and if not does anyone have any similar tools recommended for pdfs ap On 10/3/11 11:12 AM, "Dave Caroline" wrote: >Diva was announced here of 6th of June >https://listserv.nd.edu/cgi-bin/wa?A2=ind1106&L=CODE4LIB&T=0&F=&S=&P=27064 > >Th

Re: [CODE4LIB] opening a pdf file [diva]

2011-10-03 Thread Andrew Hankinson
On 2011-10-03, at 11:29 AM, Eric Lease Morgan wrote: >> > Very interesting, and thank you for bringing it to my attention. It seems it > relies on a technology that reads and chunks up image files. Alas, I have > PDFs. Moreover, I really want people to be able to print the entire > documents. I

Re: [CODE4LIB] opening a pdf file

2011-10-03 Thread Eric Lease Morgan
On Oct 3, 2011, at 11:32 AM, Demian Katz wrote: > Additionally, I notice that there are different versions of the PDF here: > > http://www.archive.org/details/canarybird00schm > > (one labeled PDF, another B/W PDF) > > Does one version work better on tablets than the other? At first glance, t

Re: [CODE4LIB] opening a pdf file

2011-10-03 Thread Demian Katz
I imagine the problems have to do with the multi-layered nature of the Internet Archive scans; even on my PC, the PDF doesn't render well until it is fully downloaded -- while it is in the process of loading, I see a blurry mess. Perhaps the tablet devices are doing the same thing -- attempting

Re: [CODE4LIB] opening a pdf file [diva]

2011-10-03 Thread Eric Lease Morgan
On Oct 3, 2011, at 11:12 AM, Dave Caroline wrote: > Diva was announced here of 6th of June > https://listserv.nd.edu/cgi-bin/wa?A2=ind1106&L=CODE4LIB&T=0&F=&S=&P=27064 > > The clever part is you only send the visible part at the scale they > are viewing so little excess bandwidth. > > For online

Re: [CODE4LIB] opening a pdf file

2011-10-03 Thread Eric Lease Morgan
Most people seem to have mixed results when trying to open the PDF files on their tablet-based (Android and iOS) devices. Bummer! These PDF files were harvested from the Internet Archive. They seem to be viewable just fine for desktop machines, but not tablets. The number of files I have in the

Re: [CODE4LIB] opening a pdf file

2011-10-03 Thread KREYCHE, MICHAEL
Eric, I have to answer "mostly no" to your question. I can open it with the Adobe reader on my Android tablet, but the pages are too blurred to read. It's a little (don’t take this literally) like an interlaced image when only the first half of the lines are displayed. I've never run into thi

Re: [CODE4LIB] opening a pdf file [diva]

2011-10-03 Thread Dave Caroline
Diva was announced here of 6th of June https://listserv.nd.edu/cgi-bin/wa?A2=ind1106&L=CODE4LIB&T=0&F=&S=&P=27064 The clever part is you only send the visible part at the scale they are viewing so little excess bandwidth. For online document view it takes some beating and is not too hard to set u

Re: [CODE4LIB] opening a pdf file [diva]

2011-10-03 Thread Eric Lease Morgan
On Oct 3, 2011, at 10:26 AM, Dave Caroline wrote: > It is educational to look at memory use in the pc when that pdf is loaded. > Evince here is using 600meg do you have space for such objects on > these little toys > > try something like diva so you dont suck the resources dry on the client Plea

Re: [CODE4LIB] opening a pdf file

2011-10-03 Thread Dave Caroline
On Mon, Oct 3, 2011 at 2:58 PM, Eric Lease Morgan wrote: > Are any of you able to open the following URL with an Android-based tablet > device: > >  http://dh.crc.nd.edu/sandbox/cyl/corpus/canarybird00schm.pdf It is educational to look at memory use in the pc when that pdf is loaded. Evince here

Re: [CODE4LIB] opening a pdf file

2011-10-03 Thread Chad Benjamin Nelson
Eric, Downloaded, but the default pdf viewer (QuickOffice on my android phone) just shows me pages with a big red X. I downloaded the Adobe Reader app and now i can see the content, but it does not look anything like the image I see in my pc's browser. Garbled / washed out, though each page

[CODE4LIB] opening a pdf file

2011-10-03 Thread Eric Lease Morgan
Are any of you able to open the following URL with an Android-based tablet device: http://dh.crc.nd.edu/sandbox/cyl/corpus/canarybird00schm.pdf I have harvested about 60 PDF documents from the Internet Archive, and I created a rudimentary tablet-based interface to the collection here: http:/

Re: [CODE4LIB] ny times best seller api

2011-10-03 Thread Ed Summers
On Wed, Sep 28, 2011 at 5:36 PM, Godmar Back wrote: > Closer to the code4lib community: OCLC and Serials Solutions don't support > JSONP in their webservices, either, even though doing so would allow cool > services and would likely not affect their business models adversely in a > significant way

Re: [CODE4LIB] screen scraping

2011-10-03 Thread Ed Summers
On Sun, Oct 2, 2011 at 10:32 PM, Ken Irwin wrote: > 1. respect robots.txt Respecting robots.txt is key. It's normally easy to eyeball, but when in doubt something like robotparser [1] can be helpful. A bit of reflection on the Golden Rule probably is probably more important than pondering the leg

Re: [CODE4LIB] screen scraping

2011-10-03 Thread Ed Summers
On Mon, Oct 3, 2011 at 8:55 AM, Ed Summers wrote: > Respecting robots.txt is key. It's normally easy to eyeball, but when > in doubt something like robotparser [1] can be helpful. A bit of > reflection on the Golden Rule probably is probably more important than > pondering the legality of what you