Re: [HACKYSTAT-DEV-L] Size Summit Results

Aaron Kagawa Sat, 27 Aug 2005 01:44:44 -0700

Sounds great.. Although, I think the sensors should know if they arecounting lines of code from a "master build".

The current problem with our FileMetric data and other SDTs that can besent via a build (i.e, UnitTest, Coverage, Dependency) is that it is hardto determine which snapshot is from the "master build". Currently, ourmechanism is to find that last batch of sensor data for a specific day anduse that set in the DailyProjectData. In CSDL's case, it just so happensthat no one other that hackystat-l runs the loccAll target. So, it worksout pretty good in our environment. In other organizations, multipledevelopers could be running loccAlli on different configurations of thesystem.

Anyway, this problem would be easily solved if the "snapshot" sensors senda masterBuild=true property in the propertyList. So, the DailyProjectDatawould check for the last batch of sensor data with the'masterBuild=true'. If there are no batches with a masterBuild=true, thenrevert to our old mechanism.


good idea? or not?

thanks, aaron

At 04:40 PM 8/26/2005, you wrote:

Mike, Cedric, Cam, and I met for the Hackystat Size summit today. Here'sa summary of our results:
* When thinking about size metrics in Hackystat, there are a variety oflevels upon which to ponder, including:
 - Size tools.  Currently or soon-to-be supported tools include:
LOCC (Mike). Java grammar-based parsing, provides 'sophisticated'size counts and OO metrics.C++ size support, minimal size data(comment/noncomment lines)SCLC (Cedric). Over a dozen languages, minimal size data(comment/noncomment lines)Cam's Parser. C++, does minimal size plus number of functions in thefile.
- Size sensors. We currently have a sensor for LOCC only. Goal: ageneric FileMetric sensor.
- FileMetric sensor data type. The current FileMetric Sensor data needsa redesign. See below.
- Server analyses, including DailyAnalysis, DailyProjectData, andReduction functions.
   These tend to be too Java-specific. See below.

The delegates to the summit came to the following conclusions:
1. With the advent of evolutionary sensor data types, we will be able toredesign the FileMetric SDT. Our proposed new structure is:
- Required attributes: - Selected optional (plist) attributes (used bypublic analyses):
 tstamp                     sourceLines
 tool                       commentLines
 fileName                   functionCount
 fileType                   functionSizeList
 nonblankLines              lastMod
 totalLines                 className

This new structure satisfies the following requirements:
- The required attributes provide a minimally useful 'default' size metricfor any kind of file-based source data.- nonBlankLines can default to totalLines if the counting tool cannotdetermine it.- fileType is determined by the sensor (so that it can, say, use the firstline to detect shell script types)- Optional attributes support the basic programming language units andmetrics.- lastMod is useful on the client-side to eliminate redundent FileMetricentries.- className is useful in Java for tying Unit Test data without a file butwith a class name to the fileName.- Other optional attributes can be provided but won't probably besupported by public Hackystat analyses.
2. We will publish the list of fileType identifiers used by our sensors aspart of the FileMetric SDT documentation so
that other sensor writers can employ them if they desire.

3. Analyses will attempt to avoid hard-coding fileType-specific analyses.
4. Rather than provide a custom sensor for each size tool considered inthis analysis, we instead agreed upon a common XML format that all toolswill produce. Then, a single generic FileMetric sensor can be used to readin this common XML and send it to the server. The format is basically:
<filemetrics>
<filemetric tstamp="" tool="" fileName="" fileType="" nonblankLines=""totalLines="" sourceLines="" .../>
  <filemetric .../>
<filemetrics>
All of the required attributes should appear in each <filemetric> entry,and zero or more of the optional ones can appear. Sensors are, of course,free to implement their own optional attributes, but they will requireadditional analysis capabilities to be written.
5. The steps in carrying out this redesign are:
  a. Philip finishes evolutionary sensor data types.
  b. FileMetric data is evolved to this new format.
  c. FileMetric sensor supporting common XML format is implemented.
  c. Size tools are modified to produce the common XML format.
  d. Analyses are modified to operate on new FileMetric format.
Volunteers for steps (b) through (d) will be solicited following theconclusion of step (a).
6. As a side note, one delegate to the summit (Cam) complainedvociferously about the bogosity of current UnitTest analyses, which aretotally Java and CSDL specific, and effectively require each unit test tohave an associated FileMetric entry identifying the fileName associatedwith the test. The other delegates responded with thundering applause tothis denouncement. Be it resolved: As soon as we clean up size, we aregoing to move on to unit testing!
Submitted for your approval.

Philip
Secretary, Hackystat Size Summit
August 2005

Re: [HACKYSTAT-DEV-L] Size Summit Results

Reply via email to