YuriyGavrilov opened a new issue, #7848:
URL: https://github.com/apache/seatunnel/issues/7848

   ### Search before asking
   
   - [X] I had searched in the 
[feature](https://github.com/apache/seatunnel/issues?q=is%3Aissue+label%3A%22Feature%22)
 and found no similar feature requirement.
   
   
   ### Description
   
   There is an option in excel which named as Transpose. It is Rotate data in 
90 degree. 
   so: 1 2 3 4 will be 
   1
   2
   3
   4
   
   and 
   5
   6
   7
   8
    
   could be as: 5 6 7 8
   
   It is nice option to have for LLM for categorizing data and making some data 
about data like knowledge base.
   
   ### Usage Scenario
   
   1. We have 10 rows for each tables. it could be 100 or 10000 or etc. 
   2. We parameterize job to loop all the tables and take for 10 row for each 
of it. 
   3. Transpose it to: Catalog, Schema, Table, Column , Row1, Row2, Row3, 
...etc.
   4. Next we will ask LLM about finding personal or sensitive data inside 
names or values (fixed row number) 
   5. We will collect all the sinked data to one table or whatever to Catalog, 
Schema, Table, Column, Tables Category, Attribute Category, Sensitive (true or 
false), Description. 
   6. In addition there could be option to retrieve and sent database comments 
to the prompt like: Catalog, Catalog Description (comments), Schema, Schema 
Description ( comments ), Table, Table description ( comments ), Column name, 
Column description ( comments ), Row1, Row2, Row3 ( as values ).
   7. All those sink output could be send someway to Datahub and add additional 
attributes to tables metadata for defining security level for accessing users 
to the tables. 
   8. There are also could be different cases to use transpose if we make some 
grouping by on some dimensions with fixed row size and collection output on 
different ways. 
   
   ### Related issues
   
   Need to have additional steps or stages to prepare data for LLMs prompt. 
   
   ### Are you willing to submit a PR?
   
   - [ ] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [X] I agree to follow this project's [Code of 
Conduct](https://www.apache.org/foundation/policies/conduct)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to