[ 
https://issues.apache.org/jira/browse/PDFBOX-2530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14290994#comment-14290994
 ] 

khyrul bashar commented on PDFBOX-2530:
---------------------------------------

I'm interested to work on this project in next summer. 
To get Introduced I've made a first look up in source code. Now trying to 
understand the architecture of PDFBox and thus spending time to understand 
structure of PDF itself.
I need some guideline. how to prepare for the project and what are the topics I 
should know beforehand to work on this project.
 

> Improve PDFDebugger
> -------------------
>
>                 Key: PDFBOX-2530
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-2530
>             Project: PDFBox
>          Issue Type: Improvement
>          Components: Utilities
>    Affects Versions: 1.8.8, 2.0.0
>            Reporter: Tilman Hausherr
>              Labels: gsoc2015
>
> (This is an idea for [GSOC2015|https://www.google-melange.com/]. But if you 
> want to submit some improvements to the code outside of GSOC2015 now, this is 
> fine. We can always come up with other project ideas for GSOC2015)
> Our command line utility PDFDebugger (part of the command line pdfbox-app get 
> it [here|https://pdfbox.apache.org/downloads.html], read description 
> [here|https://pdfbox.apache.org/commandline/], get source code 
> [here|https://svn.apache.org/viewvc/pdfbox/trunk/tools/src/main/java/org/apache/pdfbox/tools/PDFDebugger.java?view=markup&sortby=date])
>  needs some improvements:
>    - hex view
>    - view of non printable characters
>    - saving streams
>    - binary copy & paste
>    - ability to search in streams (very useful for content streams and meta 
> data)
>    - show images that are streams
>    - show PDIndexed color lookup table, show the index value, the base and 
> RGB color value sets when the mouse moves
>    - show PDSeparation color
>    - show PDDeviceN colors
>    - show font encodings and characters
>    - edit attributes
>    - edit streams, while keeping or changing the compression filter
>    - save altered PDF 
>    - color mark of certain PDF operators, especially Q...q and text operators 
> (BT...ET). Ideally, it should help the user understand the "bracketing" of 
> these operators, i.e. understand where a sequence starts and where it ends. 
> (See "operator summary" in the PDF Spec) Other "important" operators I can 
> think of are the matrix, font and color operators. A cool advanced thing 
> would be to show the current color or the font in a popup when hovering above 
> such an operator.
> To see a product with a similar purpose that is better than PDFDebugger, 
> watch [this video|https://www.youtube.com/watch?v=g-QcU9B4qMc].
> I'm not asking to implement a clone of that product (I don't use it, all I 
> know is that video), but we at PDFBox really need something that makes PDF 
> debugging easier. As an example of how the current PDFDebugger prevented me 
> from finding a bug quickly, see PDFBOX-2401 and search for "PDFDebugger".
> Prerequisites:
> - java programming, especially the GUI components
> - the ability to understand existing source code
> Using external software components is possible (must have Apache License or a 
> compatible one), but should be decided on a case-by-case basis, we don't want 
> to get too big.
> Development strategy: go from the easy to the difficult. The wished features 
> are already sorted this way (mostly).
> Mentor: Tilman Hausherr (European timezone, languages: german, english, 
> french). To see the GSoC2014 project I mentored, go to PDFBOX-1915.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to