Hello all
I'm looking to the Maven configuration. I noticed the "apache-release"
profile, which run the assembly plugin. This plugin creates two files in
the root "target" directory:
sis-0.3-incubating-SNAPSHOT-source-release.zip
sis-0.3-incubating-SNAPSHOT-src.zip
I didn't performed an extensive comparison, but at a first look the
content of those two files seem identical. Does anyone know why this
apparent duplication?
If I'm understanding right,
"sis-0.3-incubating-SNAPSHOT-source-release.zip" would be the official
file to be downloaded by users wanting SIS source code. This file is 5
Mb big. This large size is caused by the
"sis-data/California_Restaurants.csv" file, which is 19 Mb before
compression. At first I though that this was a file used for test
suites, but a search for "California_Restaurants" from the project root
directory gave me no hit. A search for "sis-data" neither; this module
doesn't seem to be used.
I wonder what is the purpose of "sis-data"? Is it a place-holder for a
future module providing data for test suites? If so, then I would
propose to limit ourselves to small files since those data may not be
the primary interest of peoples downloading the source code, and in
order to make debugging easier since it is harder to step through a
program performing million of iterations. We will nevertheless need a
few big test files, but I propose to put them in a separated optional
download. The JUnit tests using those big files would be automatically
skipped if those big data are not present (the exact location of those
optional files would be a future discussion if the principle is okay).
Martin