baibaichen opened a new issue, #9517:
URL: https://github.com/apache/incubator-gluten/issues/9517
### Backend
CH (ClickHouse)
### Bug description
when not set disk cache, create merge tree table on HDFS or Minio will
failed:
```bash
<Error> default.customer: Part
/tmp/gluten/3-5/hdfs/metadata/3-5/default/customer/f92148ef-8792-4c75-8f3d-5e71b32ba959_0_001/
is broken and needs manual correction. Reason: Code: 27. DB::Exception: Cannot
parse input: expected 'eof' before:
'0���<7.����\n�(\t8g��\0\0\0\0\0(�/�`�\0�\0\0 \0F�\0 � "M �� �< A�
�> W�\n��2ꣂa��=�i���ľ< ���1���l\n�wF�\0H|�3\t�\f�����
@�8^x��a�Y�Dc�-�A����0f�'. (CANNOT_PARSE_INPUT_ASSERTION_FAILED), Stack
trace (when copying this message, always include the lines below):
0. ./contrib/llvm-project/libcxx/include/__exception/exception.h:113:
Poco::Exception::Exception(String const&, int) @ 0x0000000016cda212
1. ./cmake-build-relwithdebinfo-clang19/./src/Common/Exception.cpp:108:
DB::Exception::Exception(DB::Exception::MessageMasked&&, int, bool) @
0x000000000db39bf9
2. ./src/Common/Exception.h:112:
DB::Exception::Exception(PreformattedMessage&&, int) @ 0x0000000006e8c02c
3. DB::Exception::Exception<String&>(int,
FormatStringHelperImpl<std::type_identity<String&>::type>, String&) @
0x0000000006f7de4b
4. ./cmake-build-relwithdebinfo-clang19/./src/IO/ReadHelpers.cpp:95:
DB::throwAtAssertionFailed(char const*, DB::ReadBuffer&) @ 0x000000000dc0c4fc
5. ./cmake-build-relwithdebinfo-clang19/./src/IO/ReadHelpers.cpp:138: ? @
0x000000000dc0c96e
6. ./cmake-build-relwithdebinfo-clang19/./src/Core/NamesAndTypes.cpp:95:
DB::NamesAndTypesList::readText(DB::ReadBuffer&, bool) @ 0x000000001073b721
7.
./cmake-build-relwithdebinfo-clang19/./src/Storages/MergeTree/IMergeTreeDataPart.cpp:1623:
DB::IMergeTreeDataPart::loadColumns(bool) @ 0x0000000012e1209a
8.
./cmake-build-relwithdebinfo-clang19/./src/Storages/MergeTree/IMergeTreeDataPart.cpp:841:
DB::IMergeTreeDataPart::loadColumnsChecksumsIndexes(bool, bool) @
0x0000000012e10efb
9.
./cmake-build-relwithdebinfo-clang19/./utils/extern-local-engine/Storages/MergeTree/SparkStorageMergeTree.cpp:247:
local_engine::SparkStorageMergeTree::loadDataPart(DB::MergeTreePartInfo
const&, String const&, std::shared_ptr<DB::IDisk> const&,
DB::MergeTreeDataPartState) @ 0x000000000e05e97e
10.
./cmake-build-relwithdebinfo-clang19/./utils/extern-local-engine/Storages/MergeTree/SparkStorageMergeTree.cpp:206:
local_engine::SparkStorageMergeTree::loadDataPartsWithNames(std::unordered_set<String,
std::hash<String>, std::equal_to<String>, std::allocator<String>> const&) @
0x000000000e05d682
11.
./cmake-build-relwithdebinfo-clang19/./utils/extern-local-engine/Storages/MergeTree/SparkMergeTreeSink.cpp:151:
local_engine::SinkHelper::saveMetadata(std::shared_ptr<DB::Context const>
const&) @ 0x000000000e04b30b
12.
./cmake-build-relwithdebinfo-clang19/./utils/extern-local-engine/Storages/MergeTree/SparkMergeTreeSink.cpp:286:
local_engine::SinkHelper::finish(std::shared_ptr<DB::Context const> const&) @
0x000000000e049dc6
13.
./cmake-build-relwithdebinfo-clang19/./utils/extern-local-engine/Storages/MergeTree/SparkMergeTreeSink.cpp:93:
local_engine::SparkMergeTreeSink::onFinish() @ 0x000000000e049be3
14. ./contrib/llvm-project/libcxx/include/__functional/function.h:716: ? @
0x0000000013772306
15.
./cmake-build-relwithdebinfo-clang19/./src/Processors/Transforms/ExceptionKeepingTransform.cpp:187:
DB::ExceptionKeepingTransform::work() @ 0x0000000013771eb3
16.
./cmake-build-relwithdebinfo-clang19/./src/Processors/Executors/ExecutionThreadContext.cpp:53:
DB::ExecutionThreadContext::executeTask() @ 0x00000000134db001
17.
./cmake-build-relwithdebinfo-clang19/./src/Processors/Executors/PipelineExecutor.cpp:305:
DB::PipelineExecutor::executeStepImpl(unsigned long, std::atomic<bool>*) @
0x00000000134cfc70
18.
./cmake-build-relwithdebinfo-clang19/./src/Processors/Executors/PipelineExecutor.cpp:174:
DB::PipelineExecutor::executeStep(std::atomic<bool>*) @ 0x00000000134cf4fa
19.
./cmake-build-relwithdebinfo-clang19/./src/Processors/Executors/PullingPipelineExecutor.cpp:54:
DB::PullingPipelineExecutor::pull(DB::Chunk&) @ 0x00000000134e1894
20.
./cmake-build-relwithdebinfo-clang19/./src/Processors/Executors/PullingPipelineExecutor.cpp:65:
DB::PullingPipelineExecutor::pull(DB::Block&) @ 0x00000000134e19f9
21.
./cmake-build-relwithdebinfo-clang19/./utils/extern-local-engine/Parser/LocalExecutor.cpp:69:
local_engine::LocalExecutor::hasNext() @ 0x000000000df73211
22.
./cmake-build-relwithdebinfo-clang19/./utils/extern-local-engine/local_engine_jni.cpp:282:
Java_org_apache_gluten_vectorized_BatchIterator_nativeHasNext @
0x0000000006e6f457
````
### Gluten version
main branch
### Spark version
None
### Spark configurations
_No response_
### System information
_No response_
### Relevant logs
```bash
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]