Hello all

I'm looking to the Maven configuration. I noticed the "apache-release" profile, which run the assembly plugin. This plugin creates two files in the root "target" directory:

    sis-0.3-incubating-SNAPSHOT-source-release.zip
    sis-0.3-incubating-SNAPSHOT-src.zip

I didn't performed an extensive comparison, but at a first look the content of those two files seem identical. Does anyone know why this apparent duplication?

If I'm understanding right, "sis-0.3-incubating-SNAPSHOT-source-release.zip" would be the official file to be downloaded by users wanting SIS source code. This file is 5 Mb big. This large size is caused by the "sis-data/California_Restaurants.csv" file, which is 19 Mb before compression. At first I though that this was a file used for test suites, but a search for "California_Restaurants" from the project root directory gave me no hit. A search for "sis-data" neither; this module doesn't seem to be used.

I wonder what is the purpose of "sis-data"? Is it a place-holder for a future module providing data for test suites? If so, then I would propose to limit ourselves to small files since those data may not be the primary interest of peoples downloading the source code, and in order to make debugging easier since it is harder to step through a program performing million of iterations. We will nevertheless need a few big test files, but I propose to put them in a separated optional download. The JUnit tests using those big files would be automatically skipped if those big data are not present (the exact location of those optional files would be a future discussion if the principle is okay).

    Martin

Reply via email to