Hello everyone,
I need to extract table data from pdf.
I know there are different approaches for that, but the table has
“gridlines”, so i needed an exact solution.
My problem is that, when I parse the pdf with a page drawer, there are
some lines that actually not “seen in pdf”.
I need to discard them but I couldnt find how to.
Obviously there is a hidden information in “graphics/graphicsstate”.
(not width, background/foreground color)
Please see attachments for clarification.
Any help would be appreciated.
Thanks.
These are not “raw” lines, these are “combined” line info. Bold one I
need to get rid of (actually a set of lines, not single)
HORIZONTALLINE:Point2D.Double[31.0, 276.0] Point2D.Double[565.0, 276.0]
HORIZONTALLINE:Point2D.Double[31.0, 311.0] Point2D.Double[565.0, 311.0]
HORIZONTALLINE:Point2D.Double[31.0, 256.0] Point2D.Double[565.0, 256.0]
HORIZONTALLINE:Point2D.Double[31.0, 236.0] Point2D.Double[565.0, 236.0]
HORIZONTALLINE:Point2D.Double[31.0, 216.0] Point2D.Double[565.0, 216.0]
HORIZONTALLINE:Point2D.Double[31.0, 196.0] Point2D.Double[565.0, 196.0]
HORIZONTALLINE:Point2D.Double[31.0, 176.0] Point2D.Double[565.0, 176.0]
HORIZONTALLINE:Point2D.Double[31.0, 156.0] Point2D.Double[565.0, 156.0]
HORIZONTALLINE:Point2D.Double[31.0, 136.0] Point2D.Double[565.0, 136.0]
HORIZONTALLINE:Point2D.Double[31.0, 116.0] Point2D.Double[565.0, 116.0]
HORIZONTALLINE:Point2D.Double[31.0, 108.5] Point2D.Double[564.0, 108.5]
VERTICALLINE:Point2D.Double[565.0, 116.0] Point2D.Double[565.0, 311.0]
VERTICALLINE:Point2D.Double[31.0, 116.0] Point2D.Double[31.0, 311.0]
VERTICALLINE:Point2D.Double[51.0, 116.0] Point2D.Double[51.0, 311.0]
VERTICALLINE:Point2D.Double[95.0, 116.0] Point2D.Double[95.0, 311.0]
VERTICALLINE:Point2D.Double[222.0, 116.0] Point2D.Double[222.0, 311.0]
*VERTICALLINE:Point2D.Double[142.0, 116.0] Point2D.Double[142.0, 276.0]***
VERTICALLINE:Point2D.Double[247.0, 116.0] Point2D.Double[247.0, 311.0]
VERTICALLINE:Point2D.Double[287.0, 116.0] Point2D.Double[287.0, 311.0]
VERTICALLINE:Point2D.Double[310.0, 116.0] Point2D.Double[310.0, 311.0]
VERTICALLINE:Point2D.Double[339.0, 116.0] Point2D.Double[339.0, 311.0]
VERTICALLINE:Point2D.Double[369.0, 116.0] Point2D.Double[369.0, 311.0]
VERTICALLINE:Point2D.Double[402.0, 116.0] Point2D.Double[402.0, 311.0]
VERTICALLINE:Point2D.Double[452.0, 116.0] Point2D.Double[452.0, 311.0]
VERTICALLINE:Point2D.Double[432.0, 116.0] Point2D.Double[432.0, 311.0]
VERTICALLINE:Point2D.Double[507.0, 116.0] Point2D.Double[507.0, 311.0]
VERTICALLINE:Point2D.Double[537.0, 116.0] Point2D.Double[537.0, 311.0]
VERTICALLINE:Point2D.Double[147.0, 116.0] Point2D.Double[147.0, 311.0]
Bu e-posta'nın içerdiği bilgiler (ekleri dahil olmak üzere) gizlidir.
Onayımız olmaksızın üçüncü kişilere açiklanamaz. Bu mesajın
gönderilmek istendiği kişi değilseniz, lütfen mesajı sisteminizden
derhal siliniz. IBTech A.Ş. bu mesajın içerdiği bilgilerin doğruluğu
veya eksiksiz olduğu konusunda bir garanti vermemektedir. Bu nedenle
bilgilerin ne şekilde olursa olsun içeriğinden, iletilmesinden,
alınmasından, saklanmasından sorumlu değildir. Bu mesajın içeriği
yazarına ait olup, IBTech A.Ş.'nin görüşlerini içermeyebilir.
The information contained in this e-mail (including any attachments)is
confidential. It must not be disclosed to any person without our
authority. If you are not the intended recipient, please delete it
from your system immediately. IBTech A.S. makes no warranty as to the
accuracy or completeness of any information contained in this message
and hereby excludes any liability of any kind for the information
contained therein or for the information transmission, reception,
storage or use of such in any way whatsoever. Any opinions expressed
in this message are those of the author and may not necessarily
reflect the opinions of IBTech A.S.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]