*** This bug is a duplicate of bug 33288 ***
    https://bugs.launchpad.net/bugs/33288

David - I believe this bug is the one you mentioned on bug 33288. Yes,
this is not quite a duplicate, but hopefully fixing that will help here
too.

The bug in 33288 is caused because the heuristic used to detect reading
order is poor (among other things). This bug, however, appears to be
caused by rounding issues. The current 'copy & paste' code tries to
preserve a tabular layout it infers from the original pdf, and will move
text around to do that, in particular to avoid overlapping blocks of
text. Note that it does this on all text, whether or not we'd see it as
a table - PDF only contains coordinates of chunks of text, not their
interrelationships.

In this case, it appears that there is whitespace in the text to the
right of each number, and that each number, with its whitespace, forms a
separate block of text. It seems likely to me that the right edge of the
whitespace is coincident with the left edge of the last number. If
poppler sees an overlap here, eg because of a rounding error, it would
cause the bug - but only because it then tries to preserve the layout.
So this is a peculiarity of the way rectangular selection works in
poppler, and is possibly fixable.

However, if 33288 is fixed, the selection code will work entirely
differently. It *may* fix this bug, but it may make it worse. If these
non-columns are detected as being columns (because the whitespace
between them is wide enough), then fixed poppler will select down the
column first. If the whitespace is narrow enough, though, it will treat
these as lines of text and you will get the result you expected. Numbers
will not be shifted up and down, because poppler will no longer be
trying to preserve a tabular layout.

-- 
In Evince,  copy/paste from .pdf files reorders data
https://bugs.launchpad.net/bugs/346403
You received this bug notification because you are a member of Ubuntu
Desktop Bugs, which is a bug assignee.

-- 
desktop-bugs mailing list
[email protected]
https://lists.ubuntu.com/mailman/listinfo/desktop-bugs

Reply via email to