Re: Bytecode generation, Source code mappings, JCov, Future (Patch)

Alex Rau Wed, 30 Apr 2008 10:17:41 -0700

Hi Jim,

thanks for the detailed info. Unfortunately I've not had much timethis week to investigate deeply on your proposal (compiler API /debugger API). Here are some things I came up so far - please correctme in case I got something wrong:

1) The debugger API is based on a design with two virtual machinesinvolved ( the debugger vm and the vm which gets debugged). Whilethis fits perfectly a debugging or profiling scenario where twovirtual machines are always involved this does not properly line upwith my scenario where only one instance of a virtual machine exists.Our software is based on top of a readily available (compiled) build.It performs modifications on the byte code of the build, runs allunit tests and generates xml reports (all done in the mentionedsingle vm in one shot). That's all. A second vm is just not existingand would mean much more overhead to our design just for gettingcolumn information.

2) I could not yet find my way through the compiler and debugger APIfrom a technical point f view to really have the column informationin the end. I've already had a look on the netbeans sources and(probably) found the right code location but I have to investigate onthat in more detail. However this indicates somehow that it's gettingmuch more tricky compared to the variant where the compiler itselfoutputs the column information into the byte code via additionalattributes. A question here: is is necessary to recompile on the flyduring debugging to get the line/column information ? If yes thenthis would make it even more difficult and would mean that we have tosupport an additional compilation process while up to now we strictlyrely on already performed compilations. We work on byte codeexclusively and the sources are only required for the report generation.

3) I think that line numbers and column information are actually"attributes" of the compiler ( result ) in a broader sense. It alwaysdepends on the compiler what values these attributes will have.Compared to for example a duration of a method invocation (profiling)or a certain value of a variable (debugging) the latter are *always*runtime-dependent values. What I'd like to say is: there are static( runtime-independent, "compiler only"-dependent ) attributes (lineand column info) and dynamic attributes ( runtime and executiondependent ) attributes (invocation duration, variable value). I see a"natural" separation between those where static attributes should bestored statically (e.g. in the byte code) and dynamic attributesshould be accessible dynamically (like the debugger API allows). Thisdoes imply as well that while we are interested in static attributesof the compiler it's really not necessary to reread these attributeswith every modification on bytecode level. Having these informationat a single point of time (after the compilation is finished) istotally sufficient compared to getting the information during runtimeevery time.

It looks to me that what I want to achieve belongs more to thecompiler than somewhere else. Any comments ?



Best regards,

Alex


On 24.04.2008, at 04:53, Jim Holmlund wrote:

Just to summarize:
- jcov is an internal to Sun tool.
- to support jcov, a .class file attribute called theCharacterRangeTable attribute wasdefined and javac was changed to output it in response to the -Xjcov(I think) command line option:
CharacterRangeTable_attribute {
u2 attribute_name_index;
u4 attribute_length;
u2 character_range_table_length;
{ u2 start_pc;
u2 end_pc;
u4 character_range_start;
u4 character_range_end;
u2 flags;
} character_range_table[character_range_table_length];
}
The 'flags' field item describes the kind of range, eg statement,block, assignment,
flow_controller ..

- the CharacterRangeTable was never added to the VM Spec.
- jcov used the old JVMPI. Robert rewrote it to do byte codeinstrumentation
via java.lang.instrument. It still uses the CharacterRangeTable.
As Robert mentioned, we have had requests from debuggers to includethis kind of info in the .class file, for example to allow steppingthru terms of an expression, multiple statements on one line, etc.We planned to do something for this in JDK 6, eg, formalize theCharacterRangeTable attribute by adding it to the definition of theclass file in the VM spec, and add functionality to JVM TI, JDWP,and JDI to allow debuggers to access this information.
When Peter von der Ahé heard about this, he suggested that we notdo this and instead proposed a solution that required no changes tobe made to the JDK. His idea was that an IDE has the source codefor a file in which fine grained stepping is desired, and the IDEcan get the bytecodes from the debuggee VM via JDI (Method.bytecodes()). The IDE can then use the compiler APIs introduced in JDK 6
http://www.artima.com/lejava/articles/compiler_api.html
to match the source code to the bytecodes to find the bytecodesthat correspond to source constructs of interest. This idea wasinvestigated by the NetBeans debugger team and found to beeffective, so it was implemented as the 'expression stepping'feature in NetBeans 6.0:
http://www.netbeans.org/features/java/debugger.html
So, we ended up not needing character offset information in JPDAand so we didn't add the CharacterRangeTable attribute to the VMspec. Adding thisinformation to JPDA would be very low on our listof things to do, unless
some needs arise that can't be handled by Peter's technique.
I wonder if Alex could also use Peter's idea. Alex did mention thatthe tools he is interested
in normally have the source code available so maybe he could.

- jjh

Jonathan Gibbons wrote:
Hi Serviceability folk,
The Subject line is from a thread on the compiler-dev list. Youmight be interested to check it out here:http://mail.openjdk.java.net/pipermail/compiler-dev/2008-April/thread.html#300
The thread concerns an interest in improving the information aboutsource location generated by the compiler, javac, and morespecifically, increasing the resolution of the info from line-based coordinates to source-based coordinates. The submitter isalso talking about using side files for the info, which (if Irecall correctly) I have heard folk such as Jim discuss before now.
What would be the interest from the serviceability group about anysuch work? Is it "on your radar", "sometime eventually", or "it'llnever happen"? :-)
-- Jon
P.S. Warning: the submitter has provided a patch on the compiler-dev thread but has not yet signed the SCA.

Re: Bytecode generation, Source code mappings, JCov, Future (Patch)

Reply via email to