From: mfb@mbunix.mitre.org (Michael F Brenner)
Subject: Re: Strings in Ada83
Date: 1997/10/15
Date: 1997-10-15T00:00:00+00:00 [thread overview]
Message-ID: <622h8m$bd1@top.mitre.org> (raw)
In-Reply-To: 3443D8AF.D9CD4393@morden.csee.usf.edu
Scott said: > ... strings package is at
> http://www.csee.usf.edu/~dick/cstring.html
> ... your comments would be most welcome. Thanks...
If you ever choose to revise it, you might consider these ideas:
(1) Separation of I/O from computation. For example,
The get_line and put_line could be put in a second package
(or a child package in Ada-95). With suitable revisions,
the package would then be a pure package which would be
subject to additional optimizations. In Ada-95 this pureness
can be celebrated with a specific pragma, but an Ada-83
optimizer can still do the data analysis to recoginize the
pureness of the code. Making the string manipulation portion
of the package dependent upon text_io makes it non-portable.
Realtime embedded systems may not have a text_io
connected, consider a satellite in orbit. In addition,
text_io usually brings in a lot of software which is
a memory hog. More important than the practical problems
with forcing the user to use text_io in order to use
strings is the lack of cohesion in the package caused by
mixing I/O with string manipulation; to increase the
cohesion, let the package do one thing (string manipulation)
without a very complex other thing (I/O).
(2) Use of a limited type. Since you chose to make it a limited type,
why not make INSERT and DELETE functions instead of procedures,
taking advantage of a fully functional style?
(3) Use of a limited private type. Consider making type cstrings
a limited private type, and convert all the functions to
procedures, except tostring, length, and the comparision
operators, taking advantage of the safety of limited private
thypes.
(4) Letting the user choose the size of each string. You might wish
to change the type definition to permit each string to have its
own size. Arrays, however, would have to have all strings of the
same maximum length.
(5) The exception cstring_null which is raised when the text_io file
is not open, does not inform the user what the real problem is.
(6) Have you tested this using the full capability of text_io to
process files with end-of-page markers in them?
(7) The loop in the less-than function would be easier to read, by
removing the boolean variable GO and experiencing the corresponding
lowering in the depth of nesting. To do this, replace the
statement IF LEFT... /= RIGHT ... with the statement
EXIT WHEN LEFT /= RIGHT. In most compilers, doing this would
result in faster code because the I:=I+1 would then be the
innermost nesting level of the loop and thus be slightly
better optimized. This would not matter so much in a machine
with a deep instruction pipeline or where the program cache
is not voided by jumps.
(8) The body of the function less-than-or-equal-to can be simplified
by noting that IF CONDITION THEN RETURN TRUE ELSE RETURN FALSE
is equivalent to RETURN CONDITION.
(9) There are no protection against uninitialized cstrings (as you
and this code may go into long or infinite loops when used
on uninitialized cstrings.
(10) Returning a maximum sized Ada string (say 1000000 characters
long) for a 1 character long cstring is not efficient and this
should be documented for the user. Better, why not return an
Ada string that is the same length as the cstring?
(11) In CSTRINGTONUMBER, the expression
IF NOT (('0'<=X) and (x<='9')) THEN
would be more readable if replaced with
IF X NOT IN '0'..'9' THEN
(12) The exception cstring_null in cstringtonumber does not
adequately inform the user that a minus sign, plus sign,
underline, or decimal point was used as part of a number
and this code prohibits those characters.
(13) It might be worth testing why MY_REVERSE does not actually
reverse all cstrings.
(14) The variable FLAG would be obviated by replacing
flag:=True;
WHILE FLAG LOOP
... code ...
if CONDITION then
flag := FALSE;
end if;
end loop
with the simpler loop
loop
... code ...
exit when CONDITION;
end loop;
(15) When back filling with ASCII null characters, some of the
loops fill the cstring up to max_size and some fill it
up to max_size+1. This is often the sign of a bug. In this
case, using suitable checks for non-initialized cstrings,
the appropriate thing to do is not to fill the cstrings at
all, but just add a single ASCII null at then end of the
meat of the string.
(16) The formal parameter INDEX could be renamed AFTER to indicate
its function in the procedure INSERT.
(17) What happens when inserting something at the beginning of
a cstring?
(18) What happens when deleting max_size characters from a string?
(19) Is it true that BM_scan only located strings correctly when
the pattern string being looked for (SUB) does not contain
repetitious strings withing itself? For example, will it
fail if INJINKINL is searched using the pattern INKINL, because
it notices that INKINL doesnot match INJINK (the first 6 letters)
and bumps up the count by 6, next trying to match INKINL against
INL which also fails?
(20) The test programs in your sample drivers do not flesh out all
the bugs. An IF statement is needed for each feature of the
package. This should only appear inside this package if it
done entirely with ASSERT statements and use no I/O.
procedure ASSERT (condition: boolean) is
Scott_made_an_error: exception;
begin
if not condition then
raise Scott_made_an_error;
end if;
end assert;
(21) The data integrity paragraph of the user documentation is not
accurate, promising that cstrings can only be accessed
through the operations defined in the package. In addition,
a private type can be access through the equality operator,
the assignment operator, and through non-initialized
elaboration.
(22) The Notes on Design and Implementation are not adequate to
fix the bugs or to maintain this software.
(23) The above comments, notwithstanding, this is a good piece
of work and, if you concentrate an equal amount of time
writing test programs as you do writing lines of code,
you will solve many problems during your long and happy
career.
next prev parent reply other threads:[~1997-10-15 0:00 UTC|newest]
Thread overview: 3+ messages / expand[flat|nested] mbox.gz Atom feed top
1997-10-14 0:00 Strings in Ada83 Scott Dick (CS)
1997-10-15 0:00 ` Michael F Brenner [this message]
1997-10-18 0:00 ` Matthew Heaney
replies disabled
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox