yiguolei commented on code in PR #52676: URL: https://github.com/apache/doris/pull/52676#discussion_r2182085617
########## be/src/vec/data_types/serde/data_type_number_serde.cpp: ########## @@ -424,16 +424,38 @@ Status DataTypeNumberSerDe<T>::write_column_to_orc(const std::string& timezone, if constexpr (T == TYPE_LARGEINT) { // largeint auto* cur_batch = dynamic_cast<orc::StringVectorBatch*>(orc_col_batch); - - INIT_MEMORY_FOR_ORC_WRITER() - + // First pass: calculate total memory needed and collect serialized values + size_t total_size = 0; for (size_t row_id = start; row_id < end; row_id++) { if (cur_batch->notNull[row_id] == 1) { std::string value_str = fmt::format("{}", col_data[row_id]); Review Comment: 这个也可以提前保存 string vector,避免451 行重复计算的 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org For additional commands, e-mail: commits-h...@doris.apache.org