[ 
https://issues.apache.org/jira/browse/HAWQ-186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15046200#comment-15046200
 ] 

Lin Wen commented on HAWQ-186:
------------------------------

>From my quick investigation there are:

1. The stupid stuff that can simply be deleted

depends/libyarn/releng/bin/templates/.core-site-secure-ha.xml.swp
tools/bin/ext/yaml/__init__.pyc
tools/bin/ext/yaml/composer.pyc
tools/bin/ext/yaml/constructor.pyc
tools/bin/ext/yaml/cyaml.pyc
tools/bin/ext/yaml/dumper.pyc
tools/bin/ext/yaml/emitter.pyc
tools/bin/ext/yaml/error.pyc
tools/bin/ext/yaml/events.pyc
tools/bin/ext/yaml/loader.pyc
tools/bin/ext/yaml/nodes.pyc
tools/bin/ext/yaml/parser.pyc
tools/bin/ext/yaml/reader.pyc
tools/bin/ext/yaml/representer.pyc
tools/bin/ext/yaml/resolver.pyc
tools/bin/ext/yaml/scanner.pyc
tools/bin/ext/yaml/serializer.pyc
tools/bin/ext/yaml/tokens.pyc


2. Data files used in testing

  The most substantial list being the ones under
  contrib/hawq-hadoop/hawq-mapreduce-tool/test-data/hawq-data/gpseg1


3. Jar files and tar files of various dependencies, including:

contrib/hawq-hadoop/hawq-mapreduce-tool/lib/parquet-column-1.1.0.jar
contrib/hawq-hadoop/hawq-mapreduce-tool/lib/parquet-common-1.1.0.jar
contrib/hawq-hadoop/hawq-mapreduce-tool/lib/parquet-encoding-1.1.0.jar
contrib/hawq-hadoop/hawq-mapreduce-tool/lib/parquet-format-1.0.0.jar
contrib/hawq-hadoop/hawq-mapreduce-tool/lib/parquet-hadoop-1.1.0.jar
contrib/hawq-hadoop/hawq-mapreduce-tool/lib/postgresql-9.2-1003-jdbc4.jar
contrib/hawq-hadoop/hawq-mapreduce-tool/lib/snakeyaml-1.12.jar
contrib/hawq-hadoop/hawq-mapreduce-tool/lib/snappy-java-1.1.0.jar
pxf/gradle/wrapper/gradle-wrapper.jar
tools/bin/pythonSrc/epydoc-3.0.1.tar.gz
tools/bin/pythonSrc/lockfile-0.9.1.tar.gz
tools/bin/pythonSrc/logilab-astng-0.20.1.tar.gz
tools/bin/pythonSrc/logilab-common-0.50.1.tar.gz
tools/bin/pythonSrc/paramiko-1.7.6-9.tar.gz
tools/bin/pythonSrc/PSI-0.3b2_gp.tar.gz
tools/bin/pythonSrc/pychecker-0.8.18.tar.gz
tools/bin/pythonSrc/pycrypto-2.0.1.tar.gz
tools/bin/pythonSrc/pylint-0.21.0.tar.gz
tools/bin/pythonSrc/unittest2-0.5.1.tar.gz
tools/bin/src/netperf-2.4.3.tar.gz

4. a cacerts file

depends/libyarn/releng/make/dependencies/cacerts

5. a couple image files

src/port/win32.ico
src/tools/backend/flow.gif


The above was identified from the following

    find * -type f -exec file '{}' + | grep -v text | grep -v empty >
/tmp/binfiles

There were additionally some false positives such as
   src/test/regress/data/mpp12839_1.data

Which are basically text but include some unusual characters.

Regards,
  Caleb


Hi, Lirong, 
Above is from Caleb's email. Please remove these useless files. 

> clean code
> ----------
>
>                 Key: HAWQ-186
>                 URL: https://issues.apache.org/jira/browse/HAWQ-186
>             Project: Apache HAWQ
>          Issue Type: Bug
>          Components: Build
>            Reporter: Lei Chang
>            Assignee: Lirong Jian
>             Fix For: 2.0.0-beta-incubating
>
>
>    1. We need to add the DISCLAIMER file
>    2.  There are a few things checked into the source tree that
>    definitely shouldn't be in the release tarball:
>    * anythings that's *.tar.gz
>    * anything that's a dependency jar
>    * ./depends/libyarn/releng/bin/templates/.core-site-secure-ha.xml.swp
>    * ./contrib/hawq-hadoop/hawq-mapreduce-tool/lib/*.jar
>    * ./tools/bin/ext/yaml/*.pyc
>    3. There are a few things that I'd like to understand
>    why they are there:
>    * ./depends/libyarn/releng/make/dependencies/cacerts



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to