On Monday 13 July 2009 17:25:52 Brad Hards wrote: > http://code.google.com/p/ocropus/issues/detail?id=146 discusses the > possibility of creating searchable image PDFs. I'm not sure how well that > is going to work out, but basic PDF support seems like it might be useful. > > As a first cut (not intended to be applied at this stage), I've added > support for reading PDF files to iulib and ocropus. It relies on the > poppler library. I recognise it isn't complete and that RGB doesn't work > yet, but it is enough to get ocropus book2pages to run on a multiple page > PDF file. > > I've based the support on the TIFF implementation - much of the patch is > fairly mechanical (well, it was once I understood what was going on, > anyway). > > From the comment in issue 146, I'm assuming that this the sort of thing > you'd like to see added to ocropus. However is this the sort of > implementation you'd expected? > > Also, the tests for this are fairly messy, because I can't write a PDF > file. Instead, I rely on an existing PDF files, and a set of "known answer" > PNG files. See test-io_pdf.cc (attached) for the test as it currently > exits. I've tried to choose a fairly small PDF file, which gives: > $ ls -go orientation* > -rw-r--r-- 1 12033 2009-07-12 21:41 orientation-0.png > -rw-r--r-- 1 13078 2009-07-13 08:30 orientation-1.png > -rw-r--r-- 1 14030 2009-07-13 08:26 orientation-2.png > -rw-r--r-- 1 13403 2009-07-13 08:30 orientation-3.png > -rw-r--r-- 1 14675 2009-07-11 22:08 orientation.pdf > > That 67K of test files just covers the simple gray case. I'll try to make > the other examples just one page, but the png images will still take a bit > of space. Is that OK for iulib? > As an alternative, we could just check that the results were the right > size. There are reasonable amount of things that could go wrong in such a > case though (for example, that probably won't pick up endianess problems in > the image). > > Thoughts? Comments? Ping.
I'd really appreciate feedback on this, so I know whether to keep going while I've got very limited email / net connectivity. Brad --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "ocropus" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/ocropus?hl=en -~----------~----~----~----~------~----~------~--~---
