Rahul Veeramalla created TIKA-2493:
--------------------------------------
Summary: Allow Extraction of Javascript from PDFs
Key: TIKA-2493
URL: https://issues.apache.org/jira/browse/TIKA-2493
Project: Tika
Issue Type: Improvement
Reporter: Rahul Veeramalla
Priority: Blocker
I have a use case wherein I need to upload PDFs as part of a File Upload
feature that I am currently building for my application. Based on Security
teams recommendation, I need to scan the PDFs for any embedded Javascript,
attachments and links contained in them and block such PDFs.
I was able to figure out the solution to extract hyperlinks and attachments
from the PDF using TIKA.
However, I am unable to find anything to extract javascript from PDFs.
**I need help to figure out if a PDF contains Javascript elements/code or not.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)