I've made the following changes to the ARC case publishing process:
1. The trigger that results in a case being redacted is now visible in
the REDACTED.txt file, for example
http://arc.opensolaris.org/caselog/PSARC/2009/543/REDACTED.txt:
ercheck.html - Sun Proprietary
2. .ms files are now published. To avoid false triggering of the
redaction rules, any comment lines (lines beginning with .\") are
ignored by the scan.
3. The redacted_cases.html file has been renamed to
http://arc.opensolaris.org/caselog/unpublished_cases.html and now
includes information on empty cases, of which there are currently 189.
In addition, the status column in the yearly summary files also shows
'Empty' for such cases, for example
http://arc.opensolaris.org/caselog/PSARC/2005/
Empty cases are cases where the case has not been redacted but there are
no files that can be published. Usually this is because the case has
been marked as 'manual' without any of the case files actually being
opened - see
http://hub.opensolaris.org/bin/view/Community+Group+arc/arc-publish-historical-checklist
for the details of the process.
There have been NO changes in the way these cases are processed, the
only change is that they are now explicitly identified and reported.
4. The ignored files and directory rules have been slightly updated:
Ignored directories (case insensitive)
--------------------------------------
Any directories with 'closed' or 'old' in the name.
Any '.' hidden subdirectories
Any subdirectories called javadoc, sc, SCCS or uirb
Ignored files (case insensitive)
--------------------------------
Any files starting with ',', '.', 's.' or 'p.;
Any files ending with '~'
Any files starting with '#' and ending with '#'
Any files ending with .closed -closed .old -old .bak -bak .orig -orig
Any files with the suffixes .class .mp3 .o .wav
Other limits that trigger non-publication
-----------------------------------------
Maximum case size: 50Mb
Maximum number of files in a case: 1000
Maximum text file size: 1Mb
Maximum binary file size: 10Mb
Binary files are defined as anything that has a suffix that matches:
7z|bz2|eps|gif|gz|jpeg|jpg|pdf|png|ps|tar|tgz|Z|zip|od[bfgmpst]|ot[ghpst]|st[ciw]|svw|sx[cdgim]
--
Alan Burlison
--