Re: TCK and all that

Stuart Monteith Thu, 29 Oct 2009 11:02:48 -0700

Hi Konstantin,

Thanks for your reply. The existing TCK works by running theconfiguration

methods on JUnit test cases and then creating a dump. The JUnit test classes

are run again, only within the JUnit framework such that the testmethods areexecuted this time. The advantage of this system is that the set up andthe checkingboth exist within the same class file and so consistency is easilymaintained. Now,one of the problems with that approach is that the both the setup andthe testcases haveto be compiled and run using at least Java 5.0, and yet we wish tosupport Java 1.4.2.The other problem is that what we look for is what we expect from ourprogram's configurationof the JVM, and no more, so your suggestion for the JVM to introspectitself while runningdoes potentially produce a more complete test of the API's abilities.The only frustrationis with 1.4.2, which doesn't provide support forThread.getAllStackTraces(), JVMTI, and no doubt

other things. I think that is something that should be up for discussion.

Using JVMTI to store a file with the JVM state is an interestingsuggestion. If we take the JVMstate and write it out to a file with a JVMTI agent, what we end upproducing might end up beingthe CJVMTI implementation of Kato that we have just now, which doesraise the question of how wego about testing that. It is interesting that would could use the JVMTIagent to verify alternative

implementations of the API.

Introspecting within Java could do:
   o Thread.getAllStackTraces()

as you suggested. Provided the threads we are interested in areparked, there should be someconsistency, although the time between callinggetAllStackTraces() and producing a dump couldproduce inconsistencies, which might be just the thread callinggetAllStackTraces().Cross-referencing class names in the stack-traces with theirclass instances may be very difficult,unless uniqueness can be guaranteed. I think JVMTI would bebetter suited to this.


   o Reflection

We could retrieve parts of the heap and class hierarchy usingreflection. However, private fields would be

      problematic.

   o java.lang.ClassLoader

The problem with this class is that it doesn't tell you what itloaded, and it can onlytell you about the parent loader or the system classloader. So itdoesn't really allow complete

      knowledge of the system.

There are certain things we can do within Java for the Image API:
   o Image.getHostName(), Image.getIPAddreses()
      could be easily fulfilled with the Java API.
   o Image.getSystemType(),Image.getSystemSubType()

could be fulfilled using the System properties (e.g. os.name,os.arch, os.version)In addition, the files on the system could be read to work outwhat system it is.

   o Image.getInstalledMemory()

With knowledge of the OS, this might be available through shellcommands or available

      in a special file. (e.g. /proc/meminfo)
   o Image.getProcessorCount(), getProcessorType(), getProcessorSubType()
      Like the previous entry, from Java we could call getconf, or look
      under /proc/cpuinfo on linux. I assume there are other
   o ImageProcess.getEnvironment()
       Can be comfortable implemented using System.getenv()
   o ImageAddressSpace.getSections()
      All I can think of is the /proc/$$/maps file. Otherwise

To introspect the system more thoroughly JNI could be used to query theOS directly:

   o ImageProcess.getID()

A JNI function to call "getpid()" on UNIX or Linux platforms andequivalent elsewhere.


The particularly painful points will be:
   o ImageSection

In general, I don't see how these can be implemented withoutknowledge from the JVM.

   o ImageThread

Retrieving these and correlating them with JavaThread instancesis going to be difficult.A Java thread executing a JNI thread could correlate a nativethread and Java thread, but

      all of the other threads will have difficulty.

The difficulties above will all be shared by the CJVMTI implementation,of course.

I won't go into more detail, but I think that introspecting from withinJava is not going to get us too far. Using JVMTI should producemore useful information, and supplementing that with other functions tointrospect on the system itself would also be used by the CJVMTIimplementation of the RI. Of course, we might have two codebases that domuch the same thing to check against one another, so I think thathaving explicit tests like we do just now (if only they were working...)would still be prudent to eliminate the possibility of consistent errors and

ensure compliance with the specification.



Regards,
   Stuart



Bobrovsky, Konstantin S wrote:

Hi Stuart,

one of the approaches to write TCK tests for the "1.0" mode I can think of
is making the test application retrieve an log information to be checked
(using standard Java API means) to some "golden" file, then expecting the
TCK test obtain the same (or "similar") information from a "snapshot" using
the RI API.

In more details:
There are several agents:
 - KATO implementation to be tested
 - Java Runtime which runs the tests and is coupled with the KATO
   implementation
 - test application (a set of classes run as a sample payload application)
 - TCK test, which can work in 2 modes:
   (1) run the test application, retrieve and log all the information
       to be checked in mode (2) to a "golden file" (should be stable
       between invokations), have the snapshot to be generated by some
       means.
   (2) use RI to read the snapshot and retrieve necessary information, then
       Compare what's retrieved with the golden file
Each such test is run 2 times: in mode (1) to generate the golden file and
the dump, and in mode (2) when actual API implementation verification
happens.

Of course, not entire API can be tested this way, but a fair part can. The main 
question is what kind of information can be obtained in mode (1), which can 
later be retrieved from the snapshot. What comes to mind is:
  - all alive threads running in the test application and their Java stacks
    using java.lang.Thread.getAllStackTraces. For threads with pre-defined
    names  which the test recognizes as "special" ones (and which are
    created by the test application), some field values or other details
    can be queried
  - a subset of live objects created by the test application
  - ... many more if JVMTI or JMX is used as the information provider
    in mode (1)

Thanks,
Konst

Intel Novosibirsk

Closed Joint Stock Company Intel A/O

Registered legal address: Krylatsky Hills Business Park,17 Krylatskaya Str., Bldg 4, Moscow 121614,Russian Federation

-----Original Message-----
From: Stuart Monteith [mailto:stuk...@stoo.me.uk]
Sent: Thursday, October 22, 2009 9:51 PM
To: kato-dev@incubator.apache.org; kato-s...@incubator.apache.org
Subject: TCK and all that

Hi everybody,

   Apologies for the cross posting - but the implementation of the TCK
is relevant to both camps.
One of the things we need to do before making a release available is to
produce a TCK. By "TCK" I mean
a test harness that will:

1. Setup a JVM into a known configuration and generate a dump.
2. Open a dump with the Kato API and run unit tests against it.

Whether or not these are just functional tests or technology compliance
is something I will just gloss over for now
as just a classification issue.

As we all know, there is a problem with optionality in the API. There is
information that will be lost when using
certain dump types.

Steve has suggested two modes for running the TCK.

Legacy
========

This mode runs through the API and confirms that the basic API behaves
as it is intended to do so.
It checks that the rules of the API are adhered to, but doesn't expect
any method will return any
particular piece of information.

For example:
   List<ImageProcess> ImageAddressSpace.getProcesses();

processes would be checked to ensure that it is not null. If there were
contents
they would be queried, and their methods executed.

For example:
   String ImageProcess.getID();

Would be expected to return a String or throw a DataUnavailable or
CorruptDataException exception.
The actual contents of the string wouldn't be checked - just so long as
it isn't null.
(incidentally, actually checking the value in this example for
correctness is difficult).

1.0
=====

This is a test for exact compliance with the API. All of the information
that can be retrieved from the dump
must be retrieved and it must all be correct.

With some caveats.

Like with ImageProcess.getID(), there is some information that is poorly
specified or difficult to know in
advance.

The poorly specified aspects of the API are:
   Platform specific items.
      For example:

   Implementation specific items:
      toString() methods - what do they print?

Things that are difficult to know in advance:
   Native stack frames - ordering and naming.
   JavaThreads - there will be threads that are implementation specific.

There problems aren't intractable. A certain amount of flexibility will
be necessary to accommodate variations.
We should expect there to be more than what we expect, so we should just
look for what we expect to be there,
and ignore the rest.

The poorly specified aspects of the API should just be more precisely
specified. Although it means that the TCK will
have to be aware of the different platforms and their behaviour.

I'm not a great fan of the "Legacy" mode. While it may have a basic use for
testing basic behaviour, I would like for there to be stronger statement
about what we expect the  API to do.

For example, if we call JavaRuntime.getThreads(), I would expect a
List<JavaThread> to be returned. However, if it wasn't empty, I would want
the threads we put into the testcase to be there and identifiable by their
(correct) names.

i.e. if there is something to test we should test it.

I call this mode "Permissive" (alternatives: lenient, acquiescent, lax,
liberal, tolerant)

There are, of course, problems. But I would expect that is this mode wasn't
possible or advisable, it would be improbable that a program could be
writtent to call the API.

The end result is that if you get information out of the API, it must be
correct. Where you don't, it is ignored.

Thoughts?

Thoughts?

Regards,
        Stuart

--
Stuart Monteith
http://blog.stoo.me.uk/


--
Stuart Monteith
http://blog.stoo.me.uk/

Re: TCK and all that

Reply via email to