0AyanamiRei opened a new pull request, #64711:
URL: https://github.com/apache/doris/pull/64711

   ### What problem does this PR solve?
   
   Issue Number: None
   
   Related PR: None
   
   Problem Summary: Stream load and HTTP stream load parse compressed CSV data 
through a legacy format enum list when estimating async group commit content 
length, while newer compress_type-based inputs such as ZSTD remain 
FORMAT_CSV_PLAIN or FORMAT_JSON. This makes ZSTD stream loads unsupported by 
the shared load-format parser and misses compressed-load handling in group 
commit paths. This change teaches LoadUtil to parse ZSTD, centralizes 
compressed load detection by compress_type plus legacy format types, applies it 
to stream load and HTTP stream group commit planning, and adds zstd regression 
coverage for CSV, JSON, group commit stream load, and group commit HTTP stream 
paths.
   
   ### Release note
   
   Support ZSTD compressed stream load and HTTP stream load inputs.
   
   ### Check List (For Author)
   
   - Test:
       - Unit Test: ./run-be-ut.sh --run --filter=LoadUtilTest.*
       - Regression test: ./run-regression-test.sh --run -d load_p0/stream_load 
-s test_stream_load_compress_type -c 
'jdbc:mysql://127.0.0.1:33520/?useLocalSessionState=true&allowLoadLocalInfile=true&zeroDateTimeBehavior=round'
 -ha 127.0.0.1:33500
       - Regression test: ./run-regression-test.sh --run -d load_p0/stream_load 
-s test_group_commit_stream_load -c 
'jdbc:mysql://127.0.0.1:33520/?useLocalSessionState=true&allowLoadLocalInfile=true&zeroDateTimeBehavior=round'
 -ha 127.0.0.1:33500
       - Regression test: ./run-regression-test.sh --run -d load_p0/http_stream 
-s test_group_commit_http_stream -c 
'jdbc:mysql://127.0.0.1:33520/?useLocalSessionState=true&allowLoadLocalInfile=true&zeroDateTimeBehavior=round'
 -ha 127.0.0.1:33500
       - Static check: ./build-support/clang-format.sh && 
./build-support/check-format.sh
       - Static check attempted: 
CLANG_TIDY_BINARY=/data/data3/huangruixin/include/src-master/ldb_toolchain/bin/clang-tidy
 ./build-support/run-clang-tidy.sh (blocked by existing/environment 
diagnostics: existing HTTP action and LoadUtilTest lint findings plus stddef.h 
not found; new zstd helper warning fixed and load_util.cpp/load_util.h reported 
clean)
   - Behavior changed: Yes. Stream load and HTTP stream load now accept ZSTD as 
compress_type and treat it as compressed input for async group commit content 
length estimation.
   - Does this need documentation: No
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to