[
https://issues.apache.org/jira/browse/DRILL-7405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16955023#comment-16955023
]
ASF GitHub Bot commented on DRILL-7405:
---------------------------------------
sohami commented on issue #1874: DRILL-7405: Avoiding download of TPC-H data
URL: https://github.com/apache/drill/pull/1874#issuecomment-544024689
Looks like these files were packaged as jar in Drill class path as an
example data for users to run some exploratory queries. I think putting these
files as part of source repo should be fine.
@vvysotskyi : I think your main concern is related to the unit tests data
files which are merged with the source files. I guess that was done to keep the
test execution time lower otherwise ideally unit tests should use the in-memory
data generator for it's use. May be we should come up with some policies which
can dictate when is it fine to check in the test data file and when one should
use in-memory data generator.
Also how does moving data files to a separate git repo will help here ?
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
> Build fails due to inaccessible apache-drill on S3 storage
> ----------------------------------------------------------
>
> Key: DRILL-7405
> URL: https://issues.apache.org/jira/browse/DRILL-7405
> Project: Apache Drill
> Issue Type: Task
> Components: Tools, Build & Test
> Affects Versions: 1.16.0
> Reporter: Boaz Ben-Zvi
> Assignee: Abhishek Girish
> Priority: Critical
> Fix For: 1.17.0
>
>
> A new clean build (e.g. after deleting the ~/.m2 local repository) would
> fail now due to:
> Access denied to:
> [http://apache-drill.s3.amazonaws.com|https://urldefense.proofpoint.com/v2/url?u=http-3A__apache-2Ddrill.s3.amazonaws.com_files_sf-2D0.01-5Ftpc-2Dh-5Fparquet-5Ftyped.tgz&d=DwMGaQ&c=C5b8zRQO1miGmBeVZ2LFWg&r=KLC1nKJ8dIOnUay2kR6CAw&m=08mf7Xfn1orlbAA60GKLIuj_PTtfaSAijrKDLOucMPU&s=CX97We3sm3ZZ_aVJIrsUdXVJ3CNMYg7p3IsxbJpuXWk&e=]
>
> (e.g., for the test data sf-0.01_tpc-h_parquet_typed.tgz )
> A new publicly available storage place is needed, plus appropriate changes in
> Drill to get to these resources.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)