felixwluo opened a new pull request, #34589:
URL: https://github.com/apache/doris/pull/34589
## Proposed changes
Issue Number: close #xxx
<!--Describe your changes.-->
fixed an issue where the hive catalog field delimiter is an empty string,
causing it to be core
1、hive build statement
```
CREATE TABLE `hive_q1`(
`id` int,
`name` string)
ROW FORMAT SERDE
'org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe'
WITH SERDEPROPERTIES (
'field.delim'='',
'serialization.format'='')
STORED AS INPUTFORMAT
'org.apache.hadoop.mapred.TextInputFormat'
OUTPUTFORMAT
'org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat'
LOCATION
'hdfs://HDFSxxxxx/usr/hive/warehouse/hive_q1'
TBLPROPERTIES (
'transient_lastDdlTime'='1715175063')
```
2、problem
`When the field.delim attribute value is an empty string, the be core dumps
because of the separator problem`
3、be stack
```
0# doris::signal::(anonymous namespace)::FailureSignalHandler(int,
siginfo_t*, void*) in /usr/local/service/doris/lib/be/doris_be
1# os::Linux::chained_handler(int, siginfo*, void*) in
/usr/local/jdk/jre/lib/amd64/server/libjvm.so
2# JVM_handle_linux_signal in /usr/local/jdk/jre/lib/amd64/server/libjvm.so
3# signalHandler(int, siginfo*, void*) in
/usr/local/jdk/jre/lib/amd64/server/libjvm.so
4# 0x00007F75D264E400 in /lib64/libc.so.6
5# __memset_sse2 in /lib64/libc.so.6
6#
doris::vectorized::PlainCsvTextFieldSplitter::_split_field_multi_char(doris::Slice
const&, std::vector<doris::Slice, std::allocator<doris::Slice> >*) in
/usr/local/service/doris/lib/be/doris_be
7# doris::vectorized::CsvReader::_line_split_to_values(doris::Slice const&,
bool*) in /usr/local/service/doris/lib/be/doris_be
8# doris::vectorized::CsvReader::_fill_dest_columns(doris::Slice const&,
doris::vectorized::Block*,
std::vector<COW<doris::vectorized::IColumn>::mutable_ptr<doris::vectorized::IColumn>,
std::allocator<COW<doris::vectorized::IColumn>::mutable_ptr<doris::vectorized::IColumn>
> >&, unsigned long*) in /usr/local/service/doris/lib/be/doris_be
9# doris::vectorized::CsvReader::get_next_block(doris::vectorized::Block*,
unsigned long*, bool*) in /usr/local/service/doris/lib/be/doris_be
10# doris::vectorized::VFileScanner::_get_block_impl(doris::RuntimeState*,
doris::vectorized::Block*, bool*) in /usr/local/service/doris/lib/be/doris_be
11# doris::vectorized::VScanner::get_block(doris::RuntimeState*,
doris::vectorized::Block*, bool*) in /usr/local/service/doris/lib/be/doris_be
12#
doris::vectorized::ScannerScheduler::_scanner_scan(doris::vectorized::ScannerScheduler*,
doris::vectorized::ScannerContext*,
std::shared_ptr<doris::vectorized::VScanner>) in
/usr/local/service/doris/lib/be/doris_be
13# std::_Function_handler<void (),
doris::vectorized::ScannerScheduler::_schedule_scanners(doris::vectorized::ScannerContext*)::$_1::operator()()
const::{lambda()#4}>::_M_invoke(std::_Any_data const&) in
/usr/local/service/doris/lib/be/doris_be
14# doris::WorkThreadPool<true>::work_thread(int) in
/usr/local/service/doris/lib/be/doris_be
15# execute_native_thread_routine in /usr/local/service/doris/lib/be/doris_be
16# start_thread in /lib64/libpthread.so.0
17# __clone in /lib64/libc.so.6
```
## Further comments
If this is a relatively large or complex change, kick off the discussion at
[[email protected]](mailto:[email protected]) by explaining why you
chose the solution you did and what alternatives you considered, etc...
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]