Re: HTML formatting

Raimi Rufai Sun, 08 May 2011 11:56:30 -0700

Hi LX,

PDF to HTML conversion is a fascinating  set of problems. One of the
hard knots to crack is preserving tables and columns for multi-column
documents. I've not had a look at the code for a long time.


I'll be nice to jump back in.

Regards,

Raimi

On Fri, May 6, 2011 at 3:01 PM, LynX <[email protected]> wrote:
> Hello,
>
> There is several similar issues which concerns formatting after PDF to HTML
> conversion:
>
> https://issues.apache.org/jira/browse/PDFBOX-6
> https://issues.apache.org/jira/browse/PDFBOX-271
>
> I would like to work on them (I see that some work has been done by rrufai
> already, but PDFBOX code have changed since then, so I may need to do some
> additional changes), but I see that severity of all these issues is minor
> and there are not any comments on them for a long time.
> Thats why I am not sure if it make sense to work on them or not? If not then
> may be they can be closed?
>
> Thank you,
> LX
>



-- 
«To develop software is to build a machine simply by describing it.»
(Michael A. Jackson -- not the singer)
«Développer des logiciels est de construire une machine tout
simplement en le décrivant.» (Michael A. Jackson - pas le chanteur)

Re: HTML formatting

Reply via email to