[
https://issues.apache.org/jira/browse/ARROW-16386?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17529480#comment-17529480
]
David Li commented on ARROW-16386:
----------------------------------
It looks like it's because ORC depends on tzdata? Not sure if Arrow can do much
about that. Possibly thrown from here:
[https://github.com/apache/orc/blob/0c4dbaaad036989e34d84d547bca9bc342d4883e/c%2B%2B/src/Timezone.cc#L689-L707]
> Simple example arrow script fails on ubuntu:latest docker container
> -------------------------------------------------------------------
>
> Key: ARROW-16386
> URL: https://issues.apache.org/jira/browse/ARROW-16386
> Project: Apache Arrow
> Issue Type: Bug
> Environment: Active environment is described by Docker file, but I'm
> running docker on windows 10.
> Reporter: Larry Dawson
> Priority: Major
>
> This docker file using ubuntu:latest, which at this time equates to
> ubuntu:jammy:
> {code:java}
> from ubuntu:latest
> run apt update -y
> run apt upgrade -y
> run apt install -y vim libssl-dev libpq-dev python3 python3-venv
> build-essential
> run cd /opt && mkdir python_environments && cd python_environments && python3
> -m venv venv && . venv/bin/activate && python -m pip install --upgrade pip &&
> pip install pyarrow pandas
> run cd /opt/python_environments && . venv/bin/activate && python -c "import
> pandas as pd; import pyarrow as pa; from pyarrow import orc;
> orc.write_table(pa.table({'col1': [1,2,3]}), 'test.orc'); pdtbl =
> orc.read_table('test.orc')" {code}
> Fails on the orc.read_table command with this error:
> {code:java}
> #9 0.939 terminate called after throwing an instance of 'orc::TimezoneError'
> #9 0.939 what(): Can't open /etc/localtime
> #9 0.944 Aborted {code}
> The error report is accurate, there is no /etc/localtime file in the
> ubuntu:latest docker image
>
--
This message was sent by Atlassian Jira
(v8.20.7#820007)