From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00 autolearn=ham autolearn_force=no version=3.4.4 X-Google-Thread: 103376,7272aa7508a3d83f X-Google-Attributes: gid103376,public X-Google-Language: ENGLISH,ASCII-7-bit Path: g2news1.google.com!news2.google.com!proxad.net!news.cs.univ-paris8.fr!fr.ip.ndsoftware.net!62.253.162.218.MISMATCH!news-in.ntli.net!newsrout1-win.ntli.net!ntli.net!newspeer1-win.ntli.net!newsfe2-win.ntli.net.POSTED!53ab2750!not-for-mail From: "Dr. Adrian Wrigley" Subject: Re: pointer questions User-Agent: Pan/0.14.2 (This is not a psychotic episode. It's a cleansing moment of clarity.) Message-Id: Newsgroups: comp.lang.ada References: <19cfb$4361207d$4995001$19541@ALLTEL.NET> MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 8bit Date: Wed, 02 Nov 2005 00:52:16 GMT NNTP-Posting-Host: 80.4.127.115 X-Complaints-To: http://www.ntlworld.com/netreport X-Trace: newsfe2-win.ntli.net 1130892736 80.4.127.115 (Wed, 02 Nov 2005 00:52:16 GMT) NNTP-Posting-Date: Wed, 02 Nov 2005 00:52:16 GMT Organization: ntl Cablemodem News Service Xref: g2news1.google.com comp.lang.ada:6102 Date: 2005-11-02T00:52:16+00:00 List-Id: On Mon, 31 Oct 2005 06:21:37 +0000, Jeffrey R. Carter wrote: > Robert A Duff wrote: > >> That's hard to believe. Dr. Wrigley said the hardware failures turned >> from "insideous" to "catastophic" when he changed some sort of Things to >> pointers-to-Things. I take that to mean, he got wrong answers before, >> and crashes after. Is that right, Dr. Wrigley? I Thinks so, but the error rate was very low, so it is hard to tell. > Thinking about it a little more, I get the idea that before he had 1 pointer to > a 1 GB data structure on the heap. Then he changed to a large number of pointers > to smaller data structures on the heap, equivalent to the previous 1 GB structure. This is exactly the situation. The "Things" were about 36 bytes each and I changed to having around 20 million pointers to things. Single bit errors in pointers had a much more significant effect than single bit errors in "Things" (which tended to be ignored for various reasons). > With more pointers, there's a greater chance that one of the pointers might have > a bit flip, resulting in the occasional crash (1 crash every few months, IIRC). Yes. > Of course, this is sheer speculation. well speculated! (was it so unclear?) Another "feature" I observed was that files could stay cached by the OS for months, and accumulate the occasional single-bit error. But when you evict the cached pages and read the data again, the errors disappear. Plenty of scope for very rare Heisenbugs. The warning I give is that these error rates are "normal" for modern SDRAM, but aren't usually noticed because they usually only show up if you have several GB of memory and care about every bit 24/7. Operating a financial business, I care about this! -- Adrian