zhedoubushishi opened a new pull request #1869:
URL: https://github.com/apache/hudi/pull/1869


   ## *Tips*
   - *Thank you very much for contributing to Apache Hudi.*
   - *Please review https://hudi.apache.org/contributing.html before opening a 
pull request.*
   
   ## What is the purpose of the pull request
   
   Support bootstrap related function through Hudi CLI including performing 
bootstrap through CLI, displaying indexing info, etc.
   
   Some use cases:
   
   List all the indexing mapping for a Hudi table
   ```
   hudi-cli> bootstrap show index mapping
   
==================================================================================
   | Partition | FileId | Hudi File Path | Source File Partition | Source File 
Path |
   
==================================================================================
   ...
   ```
   
   Get all the mapping from a partition
   ```
   hudi-cli> bootstrap show index mapping --partition "2018/01/01"
   
==================================================================================
   | Partition | FileId | Hudi File Path | Source File Partition | Source File 
Path |
   
==================================================================================
   ...
   ```
   
   Get a mapping from a fileId
   ```
   hudi-cli> bootstrap show index mapping --fileId 
"12d36820-8a6d-402b-a387-21852af24af0"
   
==================================================================================
   | Partition | FileId | Hudi File Path | Source File Partition | Source File 
Path |
   
==================================================================================
   ...
   ```
   
   Support bootstrap action from CLI
   ```
   hudi-cli> bootstrap run -sourcePath "s3://xxx" -recordKeyColumns "id"
   ```
   
   ## Brief change log
   
   Added ```BootstrapCommand.java``` for bootstrap related cli command. 
   Added ```ITTestBootstrapCommand.java``` for an end-to-end testing for 
bootstrap cli command.
   
   ## Verify this pull request
   
   This change added tests and can be verified as follows:
   
     - *Added integration tests for end-to-end.*
   
   ## Committer checklist
   
    - [x] Has a corresponding JIRA in PR title & commit
    
    - [x] Commit message is descriptive of the change
    
    - [ ] CI is green
   
    - [ ] Necessary doc changes done or have another open PR
          
    - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to