[ 
https://issues.apache.org/jira/browse/TIKA-4148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17771956#comment-17771956
 ] 

Tim Allison commented on TIKA-4148:
-----------------------------------

In roughly reverse order...

bq. zip is just their way of packaging

Ok. Phew... 

So, y, that looks good for the "parsing" that we can do easily.

What would you propose for the mime types for each of the file types?

We have a POIFSContainer detector that looks through the file/directory names 
within an OLE2 and makes a mime detection if there are tell-tale components.  
If you have time to run POIFSDump or similar on the example files, it'd be 
helpful to figure out what might be distinctive for each.  If nothing, then, y, 
we can fall back to extensions.

> Support Autodesk Inventor files (.ipt) (.iam) (.ipn) (.idw)
> -----------------------------------------------------------
>
>                 Key: TIKA-4148
>                 URL: https://issues.apache.org/jira/browse/TIKA-4148
>             Project: Tika
>          Issue Type: Improvement
>            Reporter: Alexey Pismenskiy
>            Priority: Major
>
> Add support for Autodesk Inventor files in Tika. 
> Examples of the files can be downloaded from 
> [https://www.autodesk.com/support/technical/article/caas/tsarticles/ts/3gnm93P9sPAWE6vndk7fjq.html]
> It would be great to start at least at the metadata level and then add 
> content parsing later. 
> I suspect I would be something similar to 
> [DWGParser|[https://tika.apache.org/0.9/api/org/apache/tika/parser/dwg/DWGParser.html]|https://tika.apache.org/0.9/api/org/apache/tika/parser/dwg/DWGParser.html].],
>  
> any suggestions where to start looking are appreciated. 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to