From: Steve O'Neill <smoneill@sanders.lockheed.com>
Subject: Re: Ariane 5 failure
Date: 1996/10/09
Date: 1996-10-09T00:00:00+00:00 [thread overview]
Message-ID: <325BE79B.7610@sanders.lockheed.com> (raw)
In-Reply-To: 53fhsg$45$1@goanna.cs.rmit.edu.au
@@ robin wrote:
> ---Definitely not. No floating-point overflow occurred. In
> Ariane 5, the overflow occurred on converting a double-precision
> (some 56 bits?) floating-point to a 16-bit integer (15
> significant bits).
>
> That's why it was so important to have a check that the
> conversion couldn't overflow!
> Agreed. Yes, the basic reason for the destruction of a billion dollar
vehicle was for want of a couple of lines of code. But it relects a
systemic problem much more damaging than what language was used.
I would have expected that in a mission/safety critical application
the proper checks would have been implemented, no matter what. And in a
'belts-and-suspenders' mode I would also expect an exception handler to
take care of unforeseen possibilities at the lowest possible level and
raise things to a higher level only when absolutely necessary. Had these
precautions been taken there would probably be lots of entries in an
error log but the satellites would now be orbiting.
As outsiders we can only second guess as to why this approach was not
taken but the review board implies that 1) the SRI software developers
had an 80% max utilization requirement and 2) careful consideration
(including faulty assumptions) was used in deciding what to protect and
not protect.
>It was designed to shut down if any interrupt occurred. It wasn't ^^^^^^^^^ exception, actually
>intended to be shut down for a routine thing as a conversion of
>floating-point to integer.
This was based on the (faulty) system-wide assumption that any exception
was the result of a random hardware failure. This is related to the
other faulty assumption that "software should be considered correct until
is proven to be at fault". But that's what the specification said.
> ---No, the backup SRI experienced the programming error (UNCHECKED
> CONVERSION from floating-point to integer) first, and shut itself
> down, then the active SRI computer experienced the same programming
> error, then it shut itself down.
Yes, according to the report the backup died first (by 0.05 seconds).
Probably not as a result of an unchecked_conversion though - the source
and target are of different sizes which would not be allowed. Most
likely just a conversion of a float to an sixteen-bit integer. This
would have raised a Constraint_Error (or Operand_Error in this
environment). This error could have been handled within the context of
this procedure (and the mission continued) but obviously was not.
Instead it appears to have been propagated to a global exception handler
which performed the specified actions admirably. Unfortunately these
included committing suicide and, in doing so, dooming the mission.
--
Steve O'Neill | "No,no,no, don't tug on that!
Sanders, A Lockheed Martin Company | You never know what it might
smoneill@sanders.lockheed.com | be attached to."
(603) 885-8774 fax: (603) 885-4071| Buckaroo Banzai
next prev parent reply other threads:[~1996-10-09 0:00 UTC|newest]
Thread overview: 105+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <agrapsDy4oJH.29G@netcom.com>
1996-09-25 0:00 ` Ariane 5 failure @@ robin
1996-09-25 0:00 ` Bob Kitzberger
1996-09-26 0:00 ` Ronald Kunne
1996-09-26 0:00 ` Matthew Heaney
1996-09-27 0:00 ` Ronald Kunne
1996-09-27 0:00 ` Lawrence Foard
1996-10-04 0:00 ` @@ robin
1996-09-28 0:00 ` Ken Garlington
1996-09-28 0:00 ` Ken Garlington
1996-09-29 0:00 ` Alan Brain
1996-09-29 0:00 ` Robert A Duff
1996-09-30 0:00 ` Wayne L. Beavers
1996-10-01 0:00 ` Ken Garlington
1996-10-01 0:00 ` Wayne L. Beavers
1996-10-01 0:00 ` Ken Garlington
1996-10-02 0:00 ` Sandy McPherson
1996-10-03 0:00 ` Richard A. O'Keefe
1996-10-01 0:00 ` Ken Garlington
1996-09-27 0:00 ` Wayne Hayes
1996-09-27 0:00 ` Richard Pattis
1996-09-29 0:00 ` Chris McKnight
1996-09-29 0:00 ` Real-world education (was: Ariane 5 failure) Michael Feldman
1996-09-29 0:00 ` Ariane 5 failure Alan Brain
1996-09-29 0:00 ` Dann Corbit
1996-10-01 0:00 ` Ken Garlington
1996-09-28 0:00 ` Ken Garlington
1996-09-27 0:00 ` Alan Brain
1996-09-28 0:00 ` Ken Garlington
1996-09-27 0:00 ` Ken Garlington
1996-09-29 0:00 ` Louis K. Scheffer
1996-09-25 0:00 ` Michel OLAGNON
1996-09-25 0:00 ` Chris Morgan
1996-09-25 0:00 ` Byron Kauffman
1996-09-25 0:00 ` A. Grant
1996-09-25 0:00 ` Ken Garlington
1996-09-26 0:00 ` Byron Kauffman
1996-09-27 0:00 ` A. Grant
1996-09-26 0:00 ` Sandy McPherson
1996-09-27 0:00 ` John McCabe
1996-10-01 0:00 ` Michael Dworetsky
1996-10-04 0:00 ` Steve Bell
1996-10-07 0:00 ` Ken Garlington
1996-10-09 0:00 ` @@ robin
1996-10-09 0:00 ` Steve O'Neill [this message]
1996-10-12 0:00 ` Alan Brain
1996-10-04 0:00 ` @@ robin
1996-10-04 0:00 ` Joseph C Williams
1996-10-06 0:00 ` Wayne Hayes
1996-10-04 0:00 ` Michel OLAGNON
1996-10-09 0:00 ` @@ robin
1996-10-17 0:00 ` Ralf Tilch
1996-10-17 0:00 ` Ravi Sundaram
1996-10-22 0:00 ` shmuel
1996-10-22 0:00 ` Jim Carr
1996-10-24 0:00 ` hayim
1996-10-25 0:00 ` Ken Garlington
1996-10-25 0:00 ` Michel OLAGNON
1996-10-01 0:00 Marin David Condic, 407.796.8997, M/S 731-93
1996-10-02 0:00 ` Alan Brain
1996-10-02 0:00 ` Ken Garlington
1996-10-02 0:00 ` Matthew Heaney
1996-10-04 0:00 ` Robert S. White
1996-10-05 0:00 ` Robert Dewar
1996-10-05 0:00 ` Alan Brain
1996-10-06 0:00 ` Robert S. White
1996-10-03 0:00 ` Alan Brain
1996-10-04 0:00 ` Ken Garlington
-- strict thread matches above, loose matches on Subject: below --
1996-10-01 0:00 Marin David Condic, 407.796.8997, M/S 731-93
1996-10-02 0:00 ` Ken Garlington
1996-10-01 0:00 Marin David Condic, 407.796.8997, M/S 731-93
1996-10-02 0:00 ` Robert I. Eachus
1996-10-02 0:00 ` Ken Garlington
1996-10-02 0:00 ` Matthew Heaney
1996-10-04 0:00 ` Ken Garlington
1996-10-05 0:00 ` Robert Dewar
1996-10-06 0:00 ` Keith Thompson
1996-10-10 0:00 ` Ken Garlington
1996-10-14 0:00 ` Matthew Heaney
1996-10-15 0:00 ` Robert Dewar
1996-10-16 0:00 ` Ken Garlington
1996-10-18 0:00 ` Keith Thompson
1996-10-18 0:00 ` Ken Garlington
1996-10-18 0:00 ` Samuel T. Harris
1996-10-21 0:00 ` Ken Garlington
1996-10-23 0:00 ` robin
1996-10-03 0:00 Marin David Condic, 407.796.8997, M/S 731-93
1996-10-03 0:00 Marin David Condic, 407.796.8997, M/S 731-93
1996-10-03 0:00 Marin David Condic, 407.796.8997, M/S 731-93
1996-10-14 0:00 Marin David Condic, 407.796.8997, M/S 731-93
1996-10-15 0:00 ` Robert I. Eachus
1996-10-15 0:00 ` Robert Dewar
1996-10-16 0:00 ` Michael F Brenner
1996-10-16 0:00 ` Robert Dewar
1996-10-23 0:00 ` robin
1996-10-16 0:00 Marin David Condic, 407.796.8997, M/S 731-93
1996-10-18 0:00 ` Ken Garlington
1996-10-19 0:00 ` Frank Manning
1996-10-21 0:00 ` Norman H. Cohen
1996-10-21 0:00 Marin David Condic, 407.796.8997, M/S 731-93
1996-10-22 0:00 ` Adam Beneschan
1996-10-28 0:00 Marin David Condic, 561.796.8997, M/S 731-93
1996-10-29 0:00 ` Ken Garlington
1996-11-08 0:00 ` robin
1996-10-28 0:00 Marin David Condic, 561.796.8997, M/S 731-93
1996-10-31 0:00 Marin David Condic, 561.796.8997, M/S 731-93
replies disabled
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox