Hi,
Am 25.09.2015 um 13:02 schrieb Tim Daley:
Tims-MacBook-Pro-2:Downloads tdaley$ jar -xvf
pdfbox-app-2.0.0-20150925.080424-1677-sources.jar
created: META-INF/
inflated: META-INF/MANIFEST.MF
inflated: META-INF/DEPENDENCIES
inflated: META-INF/NOTICE
inflated: META-INF/LICENSE
Tims-MacBook-Pro-2:Downloads tdaley$
pdfbox-app is some sort of a meta package which provides a pdfbox binary
including everything which is needed to run the commandline tools. It doesn't
contain any sourcecode.
BR
Andreas
On Fri, Sep 25, 2015 at 1:13 AM, Tilman Hausherr <[email protected]>
wrote:
Am 25.09.2015 um 01:50 schrieb Tim Daley:
I checked the Maven source for the package and it only contained metadata.
I would have tried to check it out.
Not sure what the question is, but the latest 2.0 app snapshot is here:
https://repository.apache.org/content/groups/snapshots/org/apache/pdfbox/pdfbox-app/2.0.0-SNAPSHOT/
just download the latest jar file.
Tilman
On Thu, Sep 24, 2015 at 2:36 AM, Tilman Hausherr <[email protected]>
wrote:
Am 24.09.2015 um 04:22 schrieb Tim Daley:
Another note. When I was using PDFDebugger, I could get it to load a URL,
but I when I tried the file selection menu, all the PDF files were
grayed
out.
I'm on OSX 10.10. It's not a biggie for me as I can just use a URL.
Weird... It worked for me (on windows)... Another possibility is to use
drag and drop.
Tilman
On Wed, Sep 23, 2015 at 10:11 PM, Tim Daley <[email protected]> wrote:
That's what I was looking for!!!
Thanks!
On Wed, Sep 23, 2015 at 4:20 PM, Tilman Hausherr <
[email protected]>
wrote:
Am 23.09.2015 um 22:16 schrieb Tim Daley:
The main gist of the program is to read in a multi-page pdf. Based on
a
control file, detect what type of document this represents:
simplex/duplex
portrait/landscape
1-up/2-up (in the case of books/booklets)
In the case of books/booklets, the pages need to be split in half and
the
individual pages reordered so that they are in page number order.
The resulting pages are then rotated as necessary and ouput either as
is
(for use on a tablet), or arranged on letter sized pages. In this
latter
case, the pages are moved to the upper right for simplex printing or
alternately upper right and upper left for duplex printing.
I assume the easiest way would be to build a single image, rotate the
image
as necessary, resize the image as necessary and write the image to a
new
page. If so, what operations are required to assemble all the source
images
into a single image. Or is there an easier way to do this with
PDFBox?
Ah, I think I understand: you look at the images in the resources,
and
based on the width / height ratio, you make a decision.
However there's no guarantee that the images will come in a certain
sequence.
Why not simply render the PDF pages?
http://stackoverflow.com/questions/23326562/apache-pdfbox-convert-pdf-to-images
Tilman
On Wed, Sep 23, 2015 at 4:01 PM, Tim Daley <[email protected]> wrote:
It's on https://www.daley.ws/Believe.pdf
I found PDFDebugger, thanks!
Now that I look at my code again. It looks like I am reading a list
of
images.
On Wed, Sep 23, 2015 at 2:44 PM, Tilman Hausherr <
[email protected]
wrote:
Am 23.09.2015 um 20:39 schrieb Tim Daley:
Whoops! I don't see PDFDebugger in PDFBox 2. Oversight? I'll get it
out
of
Version 1.
You can't attach PDF files. Upload them somewhere.
PDFDebugger is there, I even made a change earlier today! It is
part
of
the PDFBox app jar.
Tilman
On Wed, Sep 23, 2015 at 2:35 PM, Tim Daley <[email protected]>
wrote:
The PDF is at the bottom of the email. Aha! PDFDebugger!
On Wed, Sep 23, 2015 at 1:31 PM, Tilman Hausherr <
[email protected]
wrote:
The XObjects should be the same count in version 1 and 2.
If you don't want to share the PDFs, then look at them with the
new
PDFDebugger. You can see the XObject images easily.
Tilman
Am 23.09.2015 um 19:21 schrieb Tim Daley:
Here's the basic code that used to work. Granted, it probably
depends
heavily on Version 1's structure.
PDPage pdPage = CFCAPDFInputProgressBar.this.pdPages.get(i);
Map<COSName, PDXObject> images = new TreeMap<COSName,
PDXObject>();
PDResources pdResources = pdPage.getResources();
for(Entry<COSName, PDXObject>
objectImageEntry:images.entrySet())
{
PDXObject pdXObject = objectImageEntry.getValue();
if (pdXObject instanceof PDImageXObject)
{
PDImageXObject pdXObjectImage=
((PDImageXObject)pdXObject);
BufferedImage bufferedImage = null;
try{bufferedImage= pdXObjectImage.getImage();}
catch(Throwable t)
{
t.printStackTrace();
randomAccessFile.close();
throw new RuntimeException(t);
}
if (CFCAPDFInputProgressBar.this.music.getLandscape())
bufferedImage= rotate90DX(bufferedImage);
int width = bufferedImage.getWidth();
int height = bufferedImage.getHeight();
if (CFCAPDFInputProgressBar.this.music.getTwoPage())
{
width /= 2;
boolean even = i%2 == 0;
intrightPageNo= even?i+1:pageCount*2-i;
intleftPageNo= even?pageCount*2-i:i+1;
putPage(bufferedImage, rightPageNo, width, 0, width,
height);
putPage(bufferedImage, leftPageNo, 0, 0, width,
height);
}
else
{
int pageNo =
CFCAPDFInputProgressBar.this.music.getStart()
+ i;
putPage(bufferedImage, pageNo, 0, 0, width, height);
}
}
}
On Wed, Sep 23, 2015 at 1:06 PM, Tilman Hausherr <
[email protected]
<mailto:[email protected]>> wrote:
Am 23.09.2015 um 17:33 schrieb Tim Daley:
It appears that PDFBOX 2 handles scanned documents
differently
than PDFBOX
1.
I have multipage PDFs that I have scanned from a
Konica/Minolta C224e. The
PDFs in version 1 seemed to come in as a single
image.
Now in
version 2,
they seem to come in as multiple images. I assume
this
is to
reduce the
size of the resultant PDFs.
Is there a way to retrieve each page as a single
image
or is
there a method
to merge all the images on a page into a single
image?
Can't comment without having a sample PDF. And I don't
know
what
you mean with "seemed to come in as a single image".
Tilman
---------------------------------------------------------------------
To unsubscribe, e-mail:
[email protected]
<mailto:[email protected]>
For additional commands, e-mail:
[email protected]
<mailto:[email protected]>
--
*Tim Daley*
IT Specialist-Operating Systems
cru | Engagement & Services | Platform Team
o:407-826-2911 | m:407-716-0284
[email protected] <mailto:[email protected]>
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]
--
*Tim Daley*
IT Specialist-Operating Systems
cru | Engagement & Services | Platform Team
o: 407-826-2911 | m: 407-716-0284
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]
--
*Tim Daley*
IT Specialist-Operating Systems
cru | Engagement & Services | Platform Team
o: 407-826-2911 | m: 407-716-0284
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]
--
*Tim Daley*
IT Specialist-Operating Systems
cru | Engagement & Services | Platform Team
o: 407-826-2911 | m: 407-716-0284
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]