Re: [isabelle-dev] Datatypes Isatest failures

2014-09-29 Thread Makarius

On Thu, 25 Sep 2014, Makarius wrote:

HOL-Proofs is important in various ways: theoretically it opens the 
possibility for independent checking of proofs by a different system, 
and thus raising the level of confidence in Isabelle results; 
practically it indicates how close we are to the next concrete wall of 
resource limits.


Moreover, Poly/ML 5.3.0 is still important, since it has more thorough 
exception trace, which is required in hard cases of ML diagnostics.



We've had such unclear situations occasionally in the past, and usually 
managed to get things under control again, after looking 2 or 3 times 
more closely.  Softening concrete walls has always been a routine trick 
in long-term Isabelle maintenance.


After some experiments this morning, my impression is that there is 
nothing special, just a very old test machine.  I have changed that in 
ab4b94892c4c, so lets hope for the best for tomorrow's isatest run.



Makarius
___
isabelle-dev mailing list
isabelle-...@in.tum.de
https://mailmanbroy.informatik.tu-muenchen.de/mailman/listinfo/isabelle-dev


Re: [isabelle-dev] Datatypes Isatest failures

2014-09-25 Thread Florian Haftmann
 It doesn't seem like subst_tac had anything to do with it. I still have no 
 idea about what made HOL-Proofs slower on at-poly-e beween September 13 
 and 14. The log reveals nothing special, except a truncation at the end. 
 (while processing Predicate.thy). My personal inclination would be to 
 disable this test for this platform -- HOL-Proofs isn't exactly used by 
 many people, and I'm not sure there's much value in running CPUs for 1 hour 
 each night to test it on some slow hardware and old version of Poly/ML (5.3).
 
 I see the following options as relatively painless ways of solving this:
 
 1. Increase the timeout from 3600 s to, e.g. 4800.

Could be.

 2. Make HOL-Proofs and dependencies ISABELLE_FULL_TEST.

Definitely not.  HOL-Proofs is am important indicator whether something
got utterly wrong in handling of thm values in Pure.  (Or would you like
JinjaThreads depend on ISABELLE_FULL_TEST also ;-)?)

 3. Introduce another condition to control whether HOL-Proofs should be 
 built.

The pain of introducting another condition seems bigger to me than to
raise the timeout.

 Does anybody have an opinion?

So, (1) is my opinion.

Florian

-- 

PGP available:
http://home.informatik.tu-muenchen.de/haftmann/pgp/florian_haftmann_at_informatik_tu_muenchen_de



signature.asc
Description: OpenPGP digital signature
___
isabelle-dev mailing list
isabelle-...@in.tum.de
https://mailmanbroy.informatik.tu-muenchen.de/mailman/listinfo/isabelle-dev


Re: [isabelle-dev] Datatypes Isatest failures

2014-09-19 Thread Dmitriy Traytel

On 17.09.2014 11:40, Jasmin Christian Blanchette wrote:

There are only a handful of old_datatypes left in the AFP, and they will go 
away as soon as Dmitriy gets a chance to fix a bug in his code (presumably once he's back 
from vacation).
It is now (isabelle/7b92932ffea5) fixed, and the old_datatypes are 
gone from the AFP (AFP/5bda5332b484).


Dmitriy
___
isabelle-dev mailing list
isabelle-...@in.tum.de
https://mailmanbroy.informatik.tu-muenchen.de/mailman/listinfo/isabelle-dev


Re: [isabelle-dev] Datatypes Isatest failures

2014-09-18 Thread Jasmin Christian Blanchette
Hi again,

Am 17.09.2014 um 11:40 schrieb Jasmin Christian Blanchette 
jasmin.blanche...@gmail.com:

 1. HOL-Proofs times out since September 14 on at-poly-e. On September 13, 
 we had
 
Timing HOL-Proofs (2 threads, 3161.882s elapsed time, 2889.329s cpu time, 
 840.167s GC time, factor 0.91)
Finished HOL-Proofs (0:54:37 elapsed time, 0:48:53 cpu time, factor 0.89)
 
 which is close to the timeout (1 hour, I believe) but not *that* close. 
 Still, the failures on September 14, 15, 16, and 17 have been consistent 
 enough to suggest that change be0f5d8d511b (the only relevant one between 
 Sept. 13 and 14) is the cause, in which case I have a strong suspicion about 
 subst_tac.

It doesn't seem like subst_tac had anything to do with it. I still have no 
idea about what made HOL-Proofs slower on at-poly-e beween September 13 and 
14. The log reveals nothing special, except a truncation at the end. (while 
processing Predicate.thy). My personal inclination would be to disable this 
test for this platform -- HOL-Proofs isn't exactly used by many people, and 
I'm not sure there's much value in running CPUs for 1 hour each night to test 
it on some slow hardware and old version of Poly/ML (5.3).

I see the following options as relatively painless ways of solving this:

1. Increase the timeout from 3600 s to, e.g. 4800.

2. Make HOL-Proofs and dependencies ISABELLE_FULL_TEST.

3. Introduce another condition to control whether HOL-Proofs should be built.

Does anybody have an opinion?

Jasmin

___
isabelle-dev mailing list
isabelle-...@in.tum.de
https://mailmanbroy.informatik.tu-muenchen.de/mailman/listinfo/isabelle-dev


[isabelle-dev] Datatypes Isatest failures

2014-09-17 Thread Jasmin Christian Blanchette
Hi all,

The good news is that the port to the new datatypes have been overall a 
success. There are only a handful of old_datatypes left in the AFP, and they 
will go away as soon as Dmitriy gets a chance to fix a bug in his code 
(presumably once he's back from vacation). Also good news: Timing information 
as I could see it show no big impact on the AFP, e.g. JinjaThread. Although the 
new package is much more efficient in its handling of nested recursion, it is 
generally heavier due to its reliance on local theories and on the BNF 
composition pipeline -- thankfully not too much heavier, as far as I could tell 
so far.

The bad news is that we've been experiencing Isatest failures since then. Let 
me classify them:

1. HOL-Proofs times out since September 14 on at-poly-e. On September 13, 
we had

Timing HOL-Proofs (2 threads, 3161.882s elapsed time, 2889.329s cpu time, 
840.167s GC time, factor 0.91)
Finished HOL-Proofs (0:54:37 elapsed time, 0:48:53 cpu time, factor 0.89)

which is close to the timeout (1 hour, I believe) but not *that* close. Still, 
the failures on September 14, 15, 16, and 17 have been consistent enough to 
suggest that change be0f5d8d511b (the only relevant one between Sept. 13 and 
14) is the cause, in which case I have a strong suspicion about subst_tac. I 
will investigate. Yesterday I tried something, but it had no effect. Due to the 
liveness flavor of the bug, it may still take some rounds to nail it down. 
(Building HOL-Proofs takes 4.5 minutes on my laptop, so it's not as if I 
could easily reproduce the problem locally.)

2. HOL-Datatype_Examples causes random problems on random platforms. Today 
there was a timeout on at-poly-e, which I cannot really explain.

3. Freak errors, e.g. crash in HOL-Decision_Procs. These appear unrelated to 
my changes, and we had those before already.

Bottom line: I'm working on it. If you have ideas/suggestions on how to help 
this converge faster, please let me know.

Jasmin

___
isabelle-dev mailing list
isabelle-...@in.tum.de
https://mailmanbroy.informatik.tu-muenchen.de/mailman/listinfo/isabelle-dev