Re: [PR] Bump aws.version from 1.12.590 to 1.12.592 [tika]

2023-11-19 Thread via GitHub


THausherr merged PR #1462:
URL: https://github.com/apache/tika/pull/1462


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@tika.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[jira] [Commented] (TIKA-4170) Tika to extract Apple Key files

2023-11-19 Thread Tika User (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-4170?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17787819#comment-17787819
 ] 

Tika User commented on TIKA-4170:
-

Hi Allison,

          Our observation is Tika is not extracting embedded files from within 
the *.key files.

          Could you please check it ?  Thank you.

> Tika to extract Apple Key files
> ---
>
> Key: TIKA-4170
> URL: https://issues.apache.org/jira/browse/TIKA-4170
> Project: Tika
>  Issue Type: Bug
>Reporter: Tika User
>Priority: Major
> Attachments: Apple_key_file.zip
>
>
> We are trying Tika to extract Apple Key files.  The testing data is attached.
>     Could you please check why Tika can't extract the Apple Key files from 
> Tika-2.9.0? 
>     The below testing result is for your reference.  Thank you.
>  
> Tika version  --> Have child documents after extracting?
>             2.4.1  --> YES
>             2.6.0  --> YES
>             2.7.0  --> YES
>             2.8.0  --> YES
>             2.9.0  --> NO  
>             2.9.1  --> NO  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[PR] Bump aws.version from 1.12.590 to 1.12.592 [tika]

2023-11-19 Thread via GitHub


dependabot[bot] opened a new pull request, #1462:
URL: https://github.com/apache/tika/pull/1462

   Bumps `aws.version` from 1.12.590 to 1.12.592.
   Updates `com.amazonaws:aws-java-sdk-s3` from 1.12.590 to 1.12.592
   
   Changelog
   Sourced from https://github.com/aws/aws-sdk-java/blob/master/CHANGELOG.md;>com.amazonaws:aws-java-sdk-s3's
 changelog.
   
   1.12.592 2023-11-17
   AWS App Mesh
   
   
   Features
   
   Change the default value of these fields from 0 to null: MaxConnections, 
MaxPendingRequests, MaxRequests, HealthCheckThreshold, PortNumber, and 
HealthCheckPolicy - port. Users are not expected to perceive the change, 
except that badRequestException is thrown when required fields missing 
configured.
   
   
   
   AWS Cloud9
   
   
   Features
   
   A minor doc only update related to changing the date of an API 
change.
   
   
   
   AWS CloudFormation
   
   
   Features
   
   This release adds a new flag ImportExistingResources to CreateChangeSet. 
Specify this parameter on a CREATE- or UPDATE-type change set to import 
existing resources with custom names instead of recreating them.
   
   
   
   AWS CodePipeline
   
   
   Features
   
   CodePipeline now supports overriding source revisions to achieve manual 
re-deploy of a past revision
   
   
   
   AWS CodeStar connections
   
   
   Features
   
   This release adds support for the CloudFormation Git sync feature. Git 
sync enables updating a CloudFormation stack from a template stored in a Git 
repository.
   
   
   
   AWS Elemental MediaLive
   
   
   Features
   
   MediaLive has now added support for per-output static image overlay.
   
   
   
   AWS SSO OIDC
   
   
   Features
   
   Adding support for sso-oauth:CreateTokenWithIAM.
   
   
   
   AWS Security Token Service
   
   
   Features
   
   API updates for the AWS Security Token Service
   
   
   
   AWS Single Sign-On Admin
   
   
   Features
   
   Improves support for configuring RefreshToken and TokenExchange grants 
on applications.
   
   
   
   Amazon Athena
   
   
   Features
   
   Adding SerivicePreProcessing time metric
   
   
   
   Amazon CloudWatch Internet Monitor
   
   
   Features
   
   Adds new querying capabilities for running data queries on a monitor
   
   
   
   Amazon Connect Service
   
   
   Features
   
   This release adds WISDOM_QUICK_RESPONSES as new IntegrationType of 
Connect IntegrationAssociation resource and bug fixes.
   
   
   
   Amazon Connect Wisdom Service
   
   
   ... (truncated)
   
   
   Commits
   
   https://github.com/aws/aws-sdk-java/commit/d0816a84ff9f8f10b58910639f88aad0aeba974b;>d0816a8
 AWS SDK for Java 1.12.592
   https://github.com/aws/aws-sdk-java/commit/9157f43dea132eabc2ec79d0bf9e6501f42ee900;>9157f43
 Update GitHub version number to 1.12.592-SNAPSHOT
   https://github.com/aws/aws-sdk-java/commit/809022f250d1042ecdc6b9876c4f2701d30bd2d3;>809022f
 AWS SDK for Java 1.12.591
   https://github.com/aws/aws-sdk-java/commit/2f6c50150c868a9d9bc0b15028c5b6c944a659bf;>2f6c501
 Update GitHub version number to 1.12.591-SNAPSHOT
   See full diff in https://github.com/aws/aws-sdk-java/compare/1.12.590...1.12.592;>compare 
view
   
   
   
   
   Updates `com.amazonaws:aws-java-sdk-transcribe` from 1.12.590 to 1.12.592
   
   Changelog
   Sourced from https://github.com/aws/aws-sdk-java/blob/master/CHANGELOG.md;>com.amazonaws:aws-java-sdk-transcribe's
 changelog.
   
   1.12.592 2023-11-17
   AWS App Mesh
   
   
   Features
   
   Change the default value of these fields from 0 to null: MaxConnections, 
MaxPendingRequests, MaxRequests, HealthCheckThreshold, PortNumber, and 
HealthCheckPolicy - port. Users are not expected to perceive the change, 
except that badRequestException is thrown when required fields missing 
configured.
   
   
   
   AWS Cloud9
   
   
   Features
   
   A minor doc only update related to changing the date of an API 
change.
   
   
   
   AWS CloudFormation
   
   
   Features
   
   This release adds a new flag ImportExistingResources to CreateChangeSet. 
Specify this parameter on a CREATE- or UPDATE-type change set to import 
existing resources with custom names instead of recreating them.
   
   
   
   AWS CodePipeline
   
   
   Features
   
   CodePipeline now supports overriding source revisions to achieve manual 
re-deploy of a past revision
   
   
   
   AWS CodeStar connections
   
   
   Features
   
   This release adds support for the CloudFormation Git sync feature. Git 
sync enables updating a CloudFormation stack from a template stored in a Git 
repository.
   
   
   
   AWS Elemental MediaLive
   
   
   Features
   
   MediaLive has now added support for per-output static image overlay.
   
   
   
   AWS SSO OIDC
   
   
   Features
   
   Adding support for sso-oauth:CreateTokenWithIAM.
   
   
   
   AWS Security Token Service
   
   
   Features
   
   API updates for the AWS Security Token Service
   
   
   
   AWS Single Sign-On Admin
   
   

[jira] [Commented] (TIKA-4148) Support Autodesk Inventor files (.ipt) (.iam) (.ipn) (.idw)

2023-11-19 Thread Nick Burch (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-4148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17787608#comment-17787608
 ] 

Nick Burch commented on TIKA-4148:
--

For detection of the OLE2 based files, we don't need to find unique byte 
combinations, we only need to find unique OLE2 entry names / sets of names

See 
[https://github.com/apache/tika/blob/main/tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-microsoft-module/src/main/java/org/apache/tika/detect/microsoft/POIFSContainerDetector.java#L362]
 for an example of "must have this then one of those"

If you can run POIFSLister (and/or POIFSDumper) on a bunch of files, and spot 
the entry names that are common (+ ideally not already in POIFSContainerDector 
for other ones), that's what we need

> Support Autodesk Inventor files (.ipt) (.iam) (.ipn) (.idw)
> ---
>
> Key: TIKA-4148
> URL: https://issues.apache.org/jira/browse/TIKA-4148
> Project: Tika
>  Issue Type: Improvement
>Reporter: Alexey Pismenskiy
>Priority: Major
>
> Add support for Autodesk Inventor files in Tika. 
> Examples of the files can be downloaded from 
> [https://www.autodesk.com/support/technical/article/caas/tsarticles/ts/3gnm93P9sPAWE6vndk7fjq.html]
> It would be great to start at least at the metadata level and then add 
> content parsing later. 
> I suspect I would be something similar to 
> [DWGParser|[https://tika.apache.org/0.9/api/org/apache/tika/parser/dwg/DWGParser.html]|https://tika.apache.org/0.9/api/org/apache/tika/parser/dwg/DWGParser.html].],
>  
> any suggestions where to start looking are appreciated. 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)