Now that it is visible, that there is no SVG in the PDF, why not
simply render the PDF to PNG and crop the graph part?
  Unless there is some weird requirement to use SVG rather than PNG…

  If there is a hard requirement for SVG one could use the
PDFRenderer.renderPageToGraphics[1] in combination with Apache's XML
generator for SVG output.

  [1] org.apache.pdfbox.rendering.PDFRenderer.renderPageToGraphics(int,
Graphics2D)
  [2] https://xmlgraphics.apache.org/batik/using/svg-generator.html

На чт, 7.03.2019 г. в 11:39 ч. European Neuroscience Center
<[email protected]> написа:
>
> Hi Jan,
>
> The first thing I did was a Web Robot, that crawls all pages for each
> student and gets the necessary information. This significantly saves time,
> but again requires human interference and time. PDFs that are regularly
> sent automatically by email, for each student, contain all the necessary
> information, that the Web Robot collects.
>
> Do you think that through Selenium <https://www.seleniumhq.org/> these
> activities and processes can be fully automated?
>
>
> Regards,
> Miro.
>
> On Thu, Mar 7, 2019 at 11:11 AM <[email protected]> wrote:
>
> >
> > > We have access to the sources (Website), but this is time
> > > consuming. Partly, there are web services, which we can use, but not for
> > > all tasks. The PDF files are generated automatically by schedule, so
> > this
> > > way can be fully automated.
> >
> >
> >
> >
> >
> > Supposing your SVG data are available in some website and instead of
> > downloading them one by one you prefer extract them in bulk from PDF
> > snapshots of these pages, I'd recommend avoiding that PDF route and rather
> > automating that SVG downloading step.
> >
> >
> > Firstly I'd ask the app developers to provide some API to get data via web
> > service. Only if there is no other option, I would try guessing the SVG
> > image URL for any page/article. If there is some relation, automation is
> > easy. If not, you could somehow automate your manual steps via testing
> > tools, see e.g. https://www.seleniumhq.org/.
> >
> >
> >
> >
> > Jan

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to