I'm wondering about the size of the builds, which are surprisingly big to me. 
The src is 12M/13M and the bin is 17M/26M (tar.gz/zip) for 2.9.1, similar for 
3.0.0.

In looking at the binary artifact I see the following:
* Every contrib jar has a corresponding javadoc jar, but there is no 
core-javadoc.jar, however, there is a doc folder that is not jarred and it has 
all the contrib documents in it.

Is it really needed to include it in the bin at all? In my working environment, 
Eclipse, they are entirely unnecessary when one has the src.zip. I imagine that 
other IDEs are similar. And they are trivial to generate. I'd rather see a 
separate JavaDoc tar.gz/zip.

And if it is needed in the bin artifact, is it necessary to have it 
uncompressed and partially duplicated?

The contrib javadoc.jars total 4.3M and the docs/api when zipped or jarred has 
a size of 13.7M.

For whatever reason, gzip is much better at compressing javadoc than zip or jar 
is. While not duplicating the contrib javadocs would be better, not jarring the 
contrib javadocs would improve the gzip compressibility for bin artifact and 
not adversely affect the size of the zip.

* There is a src folder that has a few things in it.

Aren't these in another jar? And shouldn't they folder not be there? After all 
it is not a src artifact.

* The lib folder has the servlet-api-2.4.jar in it, but the junit-3.8.2.jar is 
not.

Should either of these be there?


Regarding the src artifact I see the following:
* It is far more than the src for the bin artifact.
** It includes site files.
** It includes tests.
** It includes 3-rd party jars for contrib. 6.3M of them.

I get that it is merely an export of SVN, but should it be?
Could it be broken out into separate parts? Would that make sense?
E.g.
lucene-src -- Contains the parts for the bin jar.
lucene-test -- Contains the tests.
lucene-dependency -- Contains the 3-rd party jars.
lucene-misc -- Contains the site files and miscellaneous other stuff.

Regarding the 3-rd party jars, there are 2 jars that are not in svn that 
lucene/contrib requires and to get them one has to bootstrap by running ant. 
These are the bdb libs.

I also see that lucene has a patched Xerces (is that still necessary?) and a 
custom build of ICU4J (are there instructions for creating that? I didn't find 
them.).

Every release of Lucene, I find my self repackaging the bin and src to skinny 
it down to what we need for our development environment. That's my motivation 
for raising these questions.

If we can figure out if or what should change, I'd be glad to do the ant work. 
I know enough of ant to be dangerous ;)

-- DM





---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

Reply via email to