Re: z/Architecture design errors

2006-07-21 Thread Edward Jaffe

Joel C. Ewing wrote:
I think you mis-interpreted John's remarks.  He didn't say anything 
about mainframe developers extensively writing and testing microcode 
instructions (which is not a documented user interface), but instead 
extensively writing and testing z-Architecture assembler code (with 
the High Level Assembler, which uses instructions documented in the 
z/OS Principles of Operation, which IS a documented user interface).  
Outside of IBM, microcode assemblers might be used in a limited 
educational setting in conjunction with emulated execution to 
demonstrate principles of microcode development - they would not be 
used to generate and modify microcode on an actual IBM mainframe 
processor.


Microcode???!! I suggest you read this article: 
http://researchweb.watson.ibm.com/journal/rd/483/heller.html


Then, with your enhanced understanding of how modern mainframes work, go 
back and re-read the posts in this thread.


--
Edward E Jaffe
Phoenix Software International, Inc
5200 W Century Blvd, Suite 800
Los Angeles, CA 90045
310-338-0400 x318
[EMAIL PROTECTED]
http://www.phoenixsoftware.com/

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: z/Architecture design errors

2006-07-21 Thread Shmuel Metz (Seymour J.)
In [EMAIL PROTECTED], on 07/20/2006
   at 03:39 PM, john gilmore [EMAIL PROTECTED] said:

There are two ways to deal with errors in this business.  

Three.

One is to try to  keep them secret, fixing them under the covers.  
The other is to call a  shovel a spade as quickly as it has been 
identified in order to turn one's  back upon it as quickly as 
possible.

The third way is to deny that there is an error for as long as
possible so as to avoid the expense of fixing it. Intel may not have
invented the third way, but they sure got a lot fo publicity for
implementing it :-(
 
-- 
 Shmuel (Seymour J.) Metz, SysProg and JOAT
 ISO position; see http://patriot.net/~shmuel/resume/brief.html 
We don't care. We don't have to care, we're Congress.
(S877: The Shut up and Eat Your spam act of 2003)

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: z/Architecture design errors

2006-07-21 Thread Vernooy, C.P. - SPLXM
Edward Jaffe [EMAIL PROTECTED] wrote in message
news:[EMAIL PROTECTED]...
  demonstrate principles of microcode development - they would not be 
  used to generate and modify microcode on an actual IBM mainframe 
  processor.
 
 Microcode???!! I suggest you read this article: 
 http://researchweb.watson.ibm.com/journal/rd/483/heller.html
 
 Then, with your enhanced understanding of how modern mainframes work,
go 
 back and re-read the posts in this thread.
 
 -- 
 Edward E Jaffe

Gee, the time that a processor existed of a large number of transistors,
efficiently bundled into a chip, is really ages ago. Interesting
article!!!

Kees.


**
For information, services and offers, please visit our web site: 
http://www.klm.com. This e-mail and any attachment may contain confidential and 
privileged material intended for the addressee only. If you are not the 
addressee, you are notified that no part of the e-mail or any attachment may be 
disclosed, copied or distributed, and that any other action related to this 
e-mail or attachment is strictly prohibited, and may be unlawful. If you have 
received this e-mail by error, please notify the sender immediately by return 
e-mail, and delete this message. Koninklijke Luchtvaart Maatschappij NV (KLM), 
its subsidiaries and/or its employees shall not be liable for the incorrect or 
incomplete transmission of this e-mail or any attachments, nor responsible for 
any delay in receipt.
**


--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: z/Architecture design errors

2006-07-21 Thread Edward Jaffe

Vernooy, C.P. - SPLXM wrote:

Gee, the time that a processor [consi]sted of a large number of transistors,
efficiently bundled into a chip, is really ages ago. Interesting
article!!!
  


You're obviously an incredibly fast reader!

--
Edward E Jaffe
Phoenix Software International, Inc
5200 W Century Blvd, Suite 800
Los Angeles, CA 90045
310-338-0400 x318
[EMAIL PROTECTED]
http://www.phoenixsoftware.com/

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: z/Architecture design errors

2006-07-21 Thread Shmuel Metz (Seymour J.)
In [EMAIL PROTECTED], on 07/20/2006
   at 10:59 PM, Joel C. Ewing [EMAIL PROTECTED] said:

In the very unlikely event that some instructions documented in the
PoOp  are found not to perform as advertised, IBM would be compelled
to fix  the broken microcode so that they do.

No. Current implementations have 3 levels, not just two, and IBM can
replace broken microcode with millicode.
 
-- 
 Shmuel (Seymour J.) Metz, SysProg and JOAT
 ISO position; see http://patriot.net/~shmuel/resume/brief.html 
We don't care. We don't have to care, we're Congress.
(S877: The Shut up and Eat Your spam act of 2003)

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: z/Architecture design errors

2006-07-21 Thread Shmuel Metz (Seymour J.)
In
[EMAIL PROTECTED],
on 07/21/2006
   at 04:33 PM, Vernooy, C.P. - SPLXM [EMAIL PROTECTED] said:

Gee, the time that a processor existed of a large number of
transistors, efficiently bundled into a chip, is really ages ago.

Chip? That was new. Before that they were discrete components
connected with wires and traces on PC boards. There was a time when
they didn't even have the PC boards.
 
-- 
 Shmuel (Seymour J.) Metz, SysProg and JOAT
 ISO position; see http://patriot.net/~shmuel/resume/brief.html 
We don't care. We don't have to care, we're Congress.
(S877: The Shut up and Eat Your spam act of 2003)

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


z/Architecture design errors

2006-07-20 Thread john gilmore

Michael Kuredjian writes

| How do we know the number of hardware design errors? With IA32, it's 
easier to

| discover these problems because the CPU is used by many people under many
| operating systems. IBM designs the OS and CPU, making it much easier to 
cover up

| any problems that do exist.

IBM does design both, but many others write assembly-language code that 
exercises these instructions vigorously.  Microprocessor assemblers are 
toys, designed (to quote myself) to discourage their use.  The IBM HLASM is 
a very different, heavily used animal.


Mr. Kuredjian's sophomore-cynical comment is, however, wide of the mark in 
another way.


There are two ways to deal with errors in this business.  One is to try to 
keep them secret, fixing them under the covers.  The other is to call a 
shovel a spade as quickly as it has been identified in order to turn one's 
back upon it as quickly as possible.


IBM does the second.  The trouble with the first approach is that when, 
inevitably, dissimulation is detected, it becomes a cause celebre.  Even 
Microsoft learned, after a time, that candor about the security deficiencies 
of Windows was the only feasible approach.  It now has its hand held in the 
fire much more briefly than used to be the case.


It is perhaps also worth noting that, while software errors are expensive, 
hardware errors are even more expensive and much more embarassing.  It is 
much cheaper to find them before they get into silicon.


John Gilmore
Ashland, MA 01721-1817
USA

_
Don’t just search. Find. Check out the new MSN Search! 
http://search.msn.click-url.com/go/onm00200636ave/direct/01/


--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: z/Architecture design errors

2006-07-20 Thread Kuredjian, Michael
I didn't see much cynicism in my comment, although that may be the result of 
being jaded by my experience with PC manufacturers and their reluctance to 
admit and correct problems. I'm very used to both hardware and software 
manufacturers ignoring obvious problems in their products. 

I may have mis-used the term cover-up. What I meant was that they[IBM] could 
release software patches that specifically avoid making use of broken circuits 
in silicon. However, I wasn't aware that mainframe developers routinely make 
use of micro-assembly instructions, thereby revealing hardware bugs quickly. 

-Original Message-
From: IBM Mainframe Discussion List [mailto:[EMAIL PROTECTED]
Behalf Of john gilmore
Sent: Thursday, July 20, 2006 11:40 AM
To: IBM-MAIN@BAMA.UA.EDU
Subject: z/Architecture design errors


Michael Kuredjian writes

| How do we know the number of hardware design errors? With IA32, it's 
easier to
| discover these problems because the CPU is used by many people under many
| operating systems. IBM designs the OS and CPU, making it much easier to 
cover up
| any problems that do exist.

IBM does design both, but many others write assembly-language code that 
exercises these instructions vigorously.  Microprocessor assemblers are 
toys, designed (to quote myself) to discourage their use.  The IBM HLASM is 
a very different, heavily used animal.

Mr. Kuredjian's sophomore-cynical comment is, however, wide of the mark in 
another way.

There are two ways to deal with errors in this business.  One is to try to 
keep them secret, fixing them under the covers.  The other is to call a 
shovel a spade as quickly as it has been identified in order to turn one's 
back upon it as quickly as possible.

IBM does the second.  The trouble with the first approach is that when, 
inevitably, dissimulation is detected, it becomes a cause celebre.  Even 
Microsoft learned, after a time, that candor about the security deficiencies 
of Windows was the only feasible approach.  It now has its hand held in the 
fire much more briefly than used to be the case.

It is perhaps also worth noting that, while software errors are expensive, 
hardware errors are even more expensive and much more embarassing.  It is 
much cheaper to find them before they get into silicon.

John Gilmore
Ashland, MA 01721-1817
USA

_
Don't just search. Find. Check out the new MSN Search! 
http://search.msn.click-url.com/go/onm00200636ave/direct/01/

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: z/Architecture design errors

2006-07-20 Thread Edward Jaffe

Kuredjian, Michael wrote:

I may have mis-used the term cover-up. What I meant was that they[IBM] could 
release software patches that specifically avoid making use of broken circuits in 
silicon. However, I wasn't aware that mainframe developers routinely make use of 
micro-assembly instructions, thereby revealing hardware bugs quickly.
  


Fixing broken instructions is one potential use of millicode. If the 
broken instruction is already implemented in millicode, the millicode 
is simply corrected. If it's implemented in hardware, millicode can 
intercept its execution and make changes as appropriate. A millicode 
intercept will usually slow the instruction way down. Better it runs 
slowly and correctly than quickly and incorrectly.


--
Edward E Jaffe
Phoenix Software International, Inc
5200 W Century Blvd, Suite 800
Los Angeles, CA 90045
310-338-0400 x318
[EMAIL PROTECTED]
http://www.phoenixsoftware.com/

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: z/Architecture design errors

2006-07-20 Thread Tom Marchant
The mainframe is very different from the PC in this regard.  You might find 
the chapter on Machine Check Handling in the POO to be interesting.  
Mainframes are designed with extensive error detection and recovery 
capabilities.  z/OS records error information to LOGREC to assist in fixing 
problems.

For several years, every CPU chip has been designed with two processors.  
They both process the same instruction stream against the same data, and 
the results are compared.  If there is a discrepency anywhere along the 
way, the instruction is aborted and retried.  If it fails again, the CPU is 
switched out and a spare is switched in to restart the instruction stream.

In a way, this could be considered a cover-up, but I think not the way 
you meant it.  The program is protected from most hardware failures.

Tom Marchant

On Thu, 20 Jul 2006 13:11:38 -0400, Kuredjian, Michael 
[EMAIL PROTECTED] wrote:

I didn't see much cynicism in my comment, although that may be the
result of being jaded by my experience with PC manufacturers and
their reluctance to admit and correct problems. I'm very used to
both hardware and software manufacturers ignoring obvious problems
in their products.

I may have mis-used the term cover-up. What I meant was that
they[IBM] could release software patches that specifically avoid
making use of broken circuits in silicon. However, I wasn't aware
that mainframe developers routinely make use of micro-assembly
instructions, thereby revealing hardware bugs quickly.

-Original Message-
From: IBM Mainframe Discussion List [mailto:[EMAIL PROTECTED]
Behalf Of john gilmore
Sent: Thursday, July 20, 2006 11:40 AM
To: IBM-MAIN@BAMA.UA.EDU
Subject: z/Architecture design errors


Michael Kuredjian writes

| How do we know the number of hardware design errors? With IA32, it's
easier to
| discover these problems because the CPU is used by many people under many
| operating systems. IBM designs the OS and CPU, making it much easier to
cover up
| any problems that do exist.

IBM does design both, but many others write assembly-language code that
exercises these instructions vigorously.  Microprocessor assemblers are
toys, designed (to quote myself) to discourage their use.  The IBM HLASM is
a very different, heavily used animal.

Mr. Kuredjian's sophomore-cynical comment is, however, wide of the mark in
another way.

There are two ways to deal with errors in this business.  One is to try to
keep them secret, fixing them under the covers.  The other is to call a
shovel a spade as quickly as it has been identified in order to turn one's
back upon it as quickly as possible.

IBM does the second.  The trouble with the first approach is that when,
inevitably, dissimulation is detected, it becomes a cause celebre.  Even
Microsoft learned, after a time, that candor about the security 
deficiencies
of Windows was the only feasible approach.  It now has its hand held in the
fire much more briefly than used to be the case.

It is perhaps also worth noting that, while software errors are expensive,
hardware errors are even more expensive and much more embarassing.  It is
much cheaper to find them before they get into silicon.


--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: z/Architecture design errors

2006-07-20 Thread Joel C. Ewing
I think you mis-interpreted John's remarks.  He didn't say anything 
about mainframe developers extensively writing and testing microcode 
instructions (which is not a documented user interface), but instead 
extensively writing and testing z-Architecture assembler code (with the 
High Level Assembler, which uses instructions documented in the z/OS 
Principles of Operation, which IS a documented user interface).  Outside 
of IBM, microcode assemblers might be used in a limited educational 
setting in conjunction with emulated execution to demonstrate principles 
of microcode development - they would not be used to generate and modify 
microcode on an actual IBM mainframe processor.


As for bypassing broken silicon, IBM over the years has put much 
effort into making their mainframe hardware able to detect and isolate 
failing hardware components.  At least since introduction of 9672 CMOS 
mainframes, there are even spare processors that can be used to 
replace a failed processor.  This diagnostic and sparing capability is 
built into the existing microcode.


In the very unlikely event that some instructions documented in the PoOp 
are found not to perform as advertised, IBM would be compelled to fix 
the broken microcode so that they do.  Eliminating a failing 
z-architecture instruction from software is not an option, because 
non-IBM software outside IBM's control could be dependent on any 
behavior that is documented in the PoOp.


Users do not have access to IBM's architecture-defining microcode, and
leasing/licensing agreements precludes user access to that microcode;
but, IBM can download microcode fixes fairly easily for installation by
a local IBM Customer Engineer.  Mainframe processors load the defining
microcode into the processors at every Power-on Reset, so updating the
microcode for fixes or enhancements is not as drastic as on a PC, where 
the microcode is typically frozen into a physical chip.


Software patches and enhancements to IBM's mainframe Operating System 
and support programs, which utilize the z-architecture machine 
instructions that are documented in the Principles of Operations, are 
installed by the installation's own System Programmers.  These patches 
have nothing to do with bugs at the microcode or physical hardware 
level, which are fixed by IBM Customer Engineers.


Over the last 25 years, many of the mainframe microcode patches I've 
seen have been for enhancements, and most of the remaining bug fixes 
were for failures that could only occur under an unlikely sequence of 
events.  Over that same 25 years the number of mainframe system failures 
I've seen that directly relate to mainframe microcode bugs is less than 
five, and in such cases it was not uncommon that the bug exposure had 
been present for many months and kazillions of instruction executions 
before we just happened to hit a triggering sequence of events.


I would agree with John about IBM's responsiveness to such issues.  IBM 
regards any failure that takes down  mainframe hardware or mainframe 
operating systems as a serious event and has effective procedures in 
place for reporting, diagnosing, and resolving such issues. This is one 
of those areas in which mainframe support is orders of magnitude better 
than what one finds in the PC world.  To be fair to PCs, they do have a 
much more difficult problem - there is only marginal centralized control 
for hardware architecture; or, considering requirements for OEM device 
drivers, for even the core parts of the PC Operating Systems.  With the 
relative chaos that reins in both of these areas, its amazing PCs run as 
well as they do.


Kuredjian, Michael wrote:
I didn't see much cynicism in my comment, although that may be the result of being jaded by my experience with PC manufacturers and their reluctance to admit and correct problems. I'm very used to both hardware and software manufacturers ignoring obvious problems in their products. 

I may have mis-used the term cover-up. What I meant was that they[IBM] could release software patches that specifically avoid making use of broken circuits in silicon. However, I wasn't aware that mainframe developers routinely make use of micro-assembly instructions, thereby revealing hardware bugs quickly. 


-Original Message-
From: IBM Mainframe Discussion List [mailto:[EMAIL PROTECTED]
Behalf Of john gilmore
Sent: Thursday, July 20, 2006 11:40 AM
To: IBM-MAIN@BAMA.UA.EDU
Subject: z/Architecture design errors


Michael Kuredjian writes

| How do we know the number of hardware design errors? With IA32, it's 
easier to

| discover these problems because the CPU is used by many people under many
| operating systems. IBM designs the OS and CPU, making it much easier to 
cover up

| any problems that do exist.

IBM does design both, but many others write assembly-language code that 
exercises these instructions vigorously.  Microprocessor assemblers are 
toys, designed (to quote myself) to discourage their use