Nicholas DiPiazza created TIKA-4605:
---------------------------------------
Summary: Add Google Drive fetcher plugin
Key: TIKA-4605
URL: https://issues.apache.org/jira/browse/TIKA-4605
Project: Tika
Issue Type: New Feature
Reporter: Nicholas DiPiazza
h2. Overview
Port the Google Drive fetcher from the external tika-pipes repository as a new
Tika plugin. This fetcher enables fetching content from Google Drive using
OAuth2 authentication.
h2. Implementation Details
* Port code from:
https://github.com/nddipiazza/tika-pipes/tree/main/tika-pipes-fetchers/tika-fetcher-google-drive
* Create new plugin module:
*tika-pipes/tika-pipes-plugins/tika-pipes-google-drive*
* Implement as a standard Tika pipes plugin (following plugin architecture)
* Support OAuth2 authentication for Google Drive
* Include appropriate dependencies and configuration
h2. Features
* Fetch files from Google Drive
* OAuth2 token-based authentication
* Support for Google Drive API
* Configurable service account credentials
* Error handling and retry logic
h2. Acceptance Criteria
* Google Drive fetcher integrated as a Tika plugin
* Plugin follows standard Tika plugin architecture (like TIKA-4604)
* Configuration supports OAuth2/service account authentication
* Code follows Apache Tika patterns (extends AbstractTikaExtension, uses
Metadata)
* All existing tests pass
* forbiddenapis check passes
* Plugin can be loaded dynamically by tika-grpc
h2. Reference
* External implementation:
https://github.com/nddipiazza/tika-pipes/tree/main/tika-pipes-fetchers/tika-fetcher-google-drive
* Similar implementation: TIKA-4604 (Atlassian JWT fetcher)
--
This message was sent by Atlassian Jira
(v8.20.10#820010)