Hisoka-X commented on PR #9760: URL: https://github.com/apache/seatunnel/pull/9760#issuecomment-3222408289
> I should add it to the FileFormat enum and implement a corresponding ReadStrategy? Yes. > And based on the chunkSize and overlap values, I can create a new class similar to MultipleTableFileSourceReader to convert it into structured data. Additionally, I should implement a new SourceSplit, for example, by creating something like RagFileSourceSplit. No, the MultipleTableFileSourceReader and SourceSplit used by all FileFormat. MultipleTableFileSourceReader used to read file path, ReadStrategy used to parse data in file. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
