baibaichen opened a new issue, #9517:
URL: https://github.com/apache/incubator-gluten/issues/9517

   ### Backend
   
   CH (ClickHouse)
   
   ### Bug description
   
   when not set disk cache, create merge tree table on HDFS or Minio will 
failed:
   
   ```bash
    <Error> default.customer: Part 
/tmp/gluten/3-5/hdfs/metadata/3-5/default/customer/f92148ef-8792-4c75-8f3d-5e71b32ba959_0_001/
 is broken and needs manual correction. Reason: Code: 27. DB::Exception: Cannot 
parse input: expected 'eof' before: 
'0���<7.����\n�(\t8g��\0\0\0\0\0(�/�`�\0�\0\0 \0F�\0 � "M �� �< A� 
�> W�\n��2ꣂa��=�i���ľ< ���1���l\n�wF�\0H|�3\t�\f����� 
@�8^x��a�Y�Dc�-�A����0f�'. (CANNOT_PARSE_INPUT_ASSERTION_FAILED), Stack 
trace (when copying this message, always include the lines below):
   
   0. ./contrib/llvm-project/libcxx/include/__exception/exception.h:113: 
Poco::Exception::Exception(String const&, int) @ 0x0000000016cda212
   1. ./cmake-build-relwithdebinfo-clang19/./src/Common/Exception.cpp:108: 
DB::Exception::Exception(DB::Exception::MessageMasked&&, int, bool) @ 
0x000000000db39bf9
   2. ./src/Common/Exception.h:112: 
DB::Exception::Exception(PreformattedMessage&&, int) @ 0x0000000006e8c02c
   3. DB::Exception::Exception<String&>(int, 
FormatStringHelperImpl<std::type_identity<String&>::type>, String&) @ 
0x0000000006f7de4b
   4. ./cmake-build-relwithdebinfo-clang19/./src/IO/ReadHelpers.cpp:95: 
DB::throwAtAssertionFailed(char const*, DB::ReadBuffer&) @ 0x000000000dc0c4fc
   5. ./cmake-build-relwithdebinfo-clang19/./src/IO/ReadHelpers.cpp:138: ? @ 
0x000000000dc0c96e
   6. ./cmake-build-relwithdebinfo-clang19/./src/Core/NamesAndTypes.cpp:95: 
DB::NamesAndTypesList::readText(DB::ReadBuffer&, bool) @ 0x000000001073b721
   7. 
./cmake-build-relwithdebinfo-clang19/./src/Storages/MergeTree/IMergeTreeDataPart.cpp:1623:
 DB::IMergeTreeDataPart::loadColumns(bool) @ 0x0000000012e1209a
   8. 
./cmake-build-relwithdebinfo-clang19/./src/Storages/MergeTree/IMergeTreeDataPart.cpp:841:
 DB::IMergeTreeDataPart::loadColumnsChecksumsIndexes(bool, bool) @ 
0x0000000012e10efb
   9. 
./cmake-build-relwithdebinfo-clang19/./utils/extern-local-engine/Storages/MergeTree/SparkStorageMergeTree.cpp:247:
 local_engine::SparkStorageMergeTree::loadDataPart(DB::MergeTreePartInfo 
const&, String const&, std::shared_ptr<DB::IDisk> const&, 
DB::MergeTreeDataPartState) @ 0x000000000e05e97e
   10. 
./cmake-build-relwithdebinfo-clang19/./utils/extern-local-engine/Storages/MergeTree/SparkStorageMergeTree.cpp:206:
 
local_engine::SparkStorageMergeTree::loadDataPartsWithNames(std::unordered_set<String,
 std::hash<String>, std::equal_to<String>, std::allocator<String>> const&) @ 
0x000000000e05d682
   11. 
./cmake-build-relwithdebinfo-clang19/./utils/extern-local-engine/Storages/MergeTree/SparkMergeTreeSink.cpp:151:
 local_engine::SinkHelper::saveMetadata(std::shared_ptr<DB::Context const> 
const&) @ 0x000000000e04b30b
   12. 
./cmake-build-relwithdebinfo-clang19/./utils/extern-local-engine/Storages/MergeTree/SparkMergeTreeSink.cpp:286:
 local_engine::SinkHelper::finish(std::shared_ptr<DB::Context const> const&) @ 
0x000000000e049dc6
   13. 
./cmake-build-relwithdebinfo-clang19/./utils/extern-local-engine/Storages/MergeTree/SparkMergeTreeSink.cpp:93:
 local_engine::SparkMergeTreeSink::onFinish() @ 0x000000000e049be3
   14. ./contrib/llvm-project/libcxx/include/__functional/function.h:716: ? @ 
0x0000000013772306
   15. 
./cmake-build-relwithdebinfo-clang19/./src/Processors/Transforms/ExceptionKeepingTransform.cpp:187:
 DB::ExceptionKeepingTransform::work() @ 0x0000000013771eb3
   16. 
./cmake-build-relwithdebinfo-clang19/./src/Processors/Executors/ExecutionThreadContext.cpp:53:
 DB::ExecutionThreadContext::executeTask() @ 0x00000000134db001
   17. 
./cmake-build-relwithdebinfo-clang19/./src/Processors/Executors/PipelineExecutor.cpp:305:
 DB::PipelineExecutor::executeStepImpl(unsigned long, std::atomic<bool>*) @ 
0x00000000134cfc70
   18. 
./cmake-build-relwithdebinfo-clang19/./src/Processors/Executors/PipelineExecutor.cpp:174:
 DB::PipelineExecutor::executeStep(std::atomic<bool>*) @ 0x00000000134cf4fa
   19. 
./cmake-build-relwithdebinfo-clang19/./src/Processors/Executors/PullingPipelineExecutor.cpp:54:
 DB::PullingPipelineExecutor::pull(DB::Chunk&) @ 0x00000000134e1894
   20. 
./cmake-build-relwithdebinfo-clang19/./src/Processors/Executors/PullingPipelineExecutor.cpp:65:
 DB::PullingPipelineExecutor::pull(DB::Block&) @ 0x00000000134e19f9
   21. 
./cmake-build-relwithdebinfo-clang19/./utils/extern-local-engine/Parser/LocalExecutor.cpp:69:
 local_engine::LocalExecutor::hasNext() @ 0x000000000df73211
   22. 
./cmake-build-relwithdebinfo-clang19/./utils/extern-local-engine/local_engine_jni.cpp:282:
 Java_org_apache_gluten_vectorized_BatchIterator_nativeHasNext @ 
0x0000000006e6f457
   ````
   
   ### Gluten version
   
   main branch
   
   ### Spark version
   
   None
   
   ### Spark configurations
   
   _No response_
   
   ### System information
   
   _No response_
   
   ### Relevant logs
   
   ```bash
   
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to