From mboxrd@z Thu Jan  1 00:00:00 1970
X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me
X-Spam-Level: 
X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,FREEMAIL_FROM
	autolearn=ham autolearn_force=no version=3.4.4
X-Google-Thread: a07f3367d7,56525db28240414a
X-Google-Attributes: gida07f3367d7,public,usenet
X-Google-NewGroupId: yes
X-Google-Language: ENGLISH,ASCII-7-bit
Received: by 10.66.85.231 with SMTP id k7mr991826paz.38.1343206997625;
        Wed, 25 Jul 2012 02:03:17 -0700 (PDT)
Path: 
 b9ni52930305pbl.0!nntp.google.com!npeer02.iad.highwinds-media.com!news.highwinds-media.com!feed-me.highwinds-media.com!postnews.google.com!glegroupsg2000goo.googlegroups.com!not-for-mail
From: Keean Schupke <keean.schupke@googlemail.com>
Newsgroups: comp.lang.ada
Subject: Re: Efficient Sequential Access to Arrays
Date: Wed, 25 Jul 2012 02:03:15 -0700 (PDT)
Organization: http://groups.google.com
Message-ID: <f2f61da1-3139-4dfc-b851-8398f27d558d@googlegroups.com>
References: <01983f1c-f842-4b1f-a180-bcef531dad4c@googlegroups.com>
 <87ipdf4vh6.fsf@mid.deneb.enyo.de> <a72rt6Fq5sU1@mid.individual.net>
 <4ce44d2d-d789-42a0-a6ed-035f7f8d58be@googlegroups.com>
 <a79kcjFufvU1@mid.individual.net>
NNTP-Posting-Host: 82.44.19.199
Mime-Version: 1.0
X-Trace: posting.google.com 1343206997 8299 127.0.0.1 (25 Jul 2012 09:03:17
 GMT)
X-Complaints-To: groups-abuse@google.com
NNTP-Posting-Date: Wed, 25 Jul 2012 09:03:17 +0000 (UTC)
In-Reply-To: <a79kcjFufvU1@mid.individual.net>
Complaints-To: groups-abuse@google.com
Injection-Info: glegroupsg2000goo.googlegroups.com; posting-host=82.44.19.199;
 posting-account=T5Z2vAoAAAB8ExE3yV3f56dVATtEMNcM
User-Agent: G2/1.0
X-Received-Bytes: 6434
Content-Type: text/plain; charset=ISO-8859-1
Date: 2012-07-25T02:03:15-07:00
List-Id: <comp.lang.ada>

On Wednesday, 25 July 2012 08:09:09 UTC+1, Niklas Holsti  wrote:
> On 12-07-24 19:00 , robin vowels wrote:
> &gt; On Monday, 23 July 2012 03:34:26 UTC+10, Niklas Holsti  wrote:
> &gt;&gt; On 12-07-22 19:21 , Florian Weimer wrote:
> &gt;&gt; &amp;gt; * Keean Schupke:
> &gt;&gt; &amp;gt;
> &gt;&gt; &amp;gt;&amp;gt; So assuming we need this level of performance, what would be the
> &gt;&gt; &amp;gt;&amp;gt; best (and most idiomatic Ada) way to package this type of usage
> &gt;&gt; &amp;gt;&amp;gt; pattern as an abstract datatype?
> &gt;&gt; &amp;gt;
> &gt;&gt; &amp;gt; You should look at compiler dumps to figure out why the strength
> &gt;&gt; &amp;gt; reduction optimization does not kick in.
> &gt;&gt; &amp;gt;
> &gt;&gt;
> &gt;&gt; As I understand it, the program is accessing A(I,J), where A is a 2D
> &gt;&gt; array, and also accessing A(I+1,J), A(I,J+1), and the other neighbours
> &gt;&gt; of A(I,J), eight in all.
> &gt;&gt;
> &gt;&gt; I,J are indices that are *not* traversed in sequential order, but in
> &gt;&gt; some scattered order.
> &gt;&gt;
> &gt;&gt; Computing the address of A(I,J) involves multiplications of J and I with
> &gt;&gt; the size of an array element and the size of an array row, respectively.
> &gt;&gt; The problem seems to be that the results of these multiplications are
> &gt;&gt; not reused in computing the address of A(I+1,J) etc.
> &gt;&gt;
> &gt;&gt; If the index constraints (or at least the row length) of A are static
> &gt;&gt; (compile-time) constants, and the address of A(I,J) has been computed,
> &gt;&gt; and c1 and c2 are some other compile-time constants (-1, 0, +1 in this
> &gt;&gt; program), then the address of A(I+c1,J+c2) can be computed by adding a
> &gt;&gt; static offset to the address of A(I,J). This is what happens when A(I,J)
> &gt;&gt; and the neighbours are accessed through a C pointer, or when Keean uses
> &gt;&gt; address arithmetic explicitly in the Ada program. The question is why
> &gt;&gt; the Ada compiler is not doing this optimization.
> &gt;&gt;
> &gt;&gt; As I understand it, &amp;quot;strength reduction&amp;quot; is a loop optimization where an
> &gt;&gt; expression of the form C*i, with &amp;quot;i&amp;quot; the loop index, is replaced by a
> &gt;&gt; new variable that is incremented by C on each loop iteration. But in
> &gt;&gt; Keean&amp;#39;s program the indices I,J are not loop indices, nor linearly
> &gt;&gt; related to loop indices.
> &gt;&gt;
> &gt;&gt; The optimization that would be required here is to replace (X+C)*F by
> &gt;&gt; X*F+C*F, when C and F are static constants and X*F has already been
> &gt;&gt; computed and is available. This would replace the multiplication by an
> &gt;&gt; addition of a static constant. I&#39;ve never seen this special kind of
> &gt;&gt; optimization proposed or described.
> &gt;
> &gt; I think that it&#39;s a standard optimisation.
> 
> I&#39;m willing to be convinced of that. But it seems that GNAT/GCC is not 
> performing this optimization in Keean&#39;s program.
> 
> &gt; For the PL/I optimising compiler for the CDC Cyber (c. 1980),
> &gt; multiplication was eliminated for matrix elements
> &gt; by using a table of offsets into each row.
> &gt; Thus, double subscripting reduced to simple addition.
> 
> That sounds like a specific choice of code generation for arrays, rather 
> than a general optimization.
> 
> Keean could simulate this code by defining the array as a 
> one-dimensional array of one-dimensional arrays (rows), which would 
> replace the row-length multiplication by an additional indexing.
> 
> &gt;&gt; (We are of course (I believe) assuming that index range checks are
> &gt;&gt; turned off.)
> &gt;
> &gt; Index range checks should be left on.
> 
> Sure, in general. But here the OP (Keean) is trying to beat the 
> performance of C++ that uses pointers without index checks. Enabling 
> index checks in the Ada code would probably (in this case, where the 
> algorithm has scatterered indexing of the array) slow it by an amount 
> that swamps the effect of the multiplication optimization.
> 
> -- 
> Niklas Holsti
> Tidorum Ltd
> niklas holsti tidorum fi
>        .      @       .

On the specific point of array index checking, if you can statically prove the algorithm using the array will never use an out of range index, there is no need to run-time index check.

If we generate a random number between one and N, and then lookup the Nth element, we do not need to range check as the number is by definition in range. If the type system can carry such proofs (for example using range types in Ada) so you have:


subtype Idx is Integer range 1 .. 10;
type Arr is array (Idx) of Integer;  
function Random returns Idx;
A : Arr;
A(Random); -- as the random number is already in the range there needs to be no range checks on the array lookup.


I don't know how good the compiler is at doing this?

There are times when the programmer can guarantee that the output of a function will be within range due to extensive unit testing and validation.


I wonder, is there a way to get the compiler (GNAT) to issue a warning when it has to insert a range check?


Cheers,
Keean.