> On Dec 28, 2016, at 8:18 AM, Tilman Hausherr <[email protected]> wrote: > > […] > Try also https://github.com/tabulapdf/ <https://github.com/tabulapdf/> , I > wonder how they handle this problem.
Hi, main author of Tabula here. We've come across that case many times. Some spreadsheet->PDF generators clip a cell's content to the extent of its container. We handle it by simply detecting whether a character is inside the current clipping path [1]. Cheers, [1] https://github.com/tabulapdf/tabula-java/blob/master/src/main/java/technology/tabula/ObjectExtractor.java#L342 — Manuel Aristarán <[email protected]> http://jazzido.com

