Nicholas DiPiazza created TIKA-4605:
---------------------------------------

             Summary: Add Google Drive fetcher plugin
                 Key: TIKA-4605
                 URL: https://issues.apache.org/jira/browse/TIKA-4605
             Project: Tika
          Issue Type: New Feature
            Reporter: Nicholas DiPiazza


h2. Overview
Port the Google Drive fetcher from the external tika-pipes repository as a new 
Tika plugin. This fetcher enables fetching content from Google Drive using 
OAuth2 authentication.

h2. Implementation Details
* Port code from: 
https://github.com/nddipiazza/tika-pipes/tree/main/tika-pipes-fetchers/tika-fetcher-google-drive
* Create new plugin module: 
*tika-pipes/tika-pipes-plugins/tika-pipes-google-drive*
* Implement as a standard Tika pipes plugin (following plugin architecture)
* Support OAuth2 authentication for Google Drive
* Include appropriate dependencies and configuration

h2. Features
* Fetch files from Google Drive
* OAuth2 token-based authentication
* Support for Google Drive API
* Configurable service account credentials
* Error handling and retry logic

h2. Acceptance Criteria
* Google Drive fetcher integrated as a Tika plugin
* Plugin follows standard Tika plugin architecture (like TIKA-4604)
* Configuration supports OAuth2/service account authentication
* Code follows Apache Tika patterns (extends AbstractTikaExtension, uses 
Metadata)
* All existing tests pass
* forbiddenapis check passes
* Plugin can be loaded dynamically by tika-grpc

h2. Reference
* External implementation: 
https://github.com/nddipiazza/tika-pipes/tree/main/tika-pipes-fetchers/tika-fetcher-google-drive
* Similar implementation: TIKA-4604 (Atlassian JWT fetcher)



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to