comp.lang.ada
 help / color / mirror / Atom feed
From: "Dmitry A. Kazakov" <mailbox@dmitry-kazakov.de>
Subject: Re: Data table text I/O package?
Date: Tue, 21 Jun 2005 14:44:26 +0200
Date: 2005-06-21T14:44:30+02:00	[thread overview]
Message-ID: <nfosboayysqq.18h2l9wtc4081$.dlg@40tude.net> (raw)
In-Reply-To: 42b7fcde$0$15419$9b4e6d93@newsread2.arcor-online.net

On Tue, 21 Jun 2005 13:41:25 +0200, Georg Bauhaus wrote:

> Dmitry A. Kazakov wrote:
>  
>> See below. It is a table. It has bracketing: rows and columns.
> 
> Back to step one: brackets in computer tables are not named,
> a computer doesn't have accountants' abilities in pattern matching
> when looking at rows and colums in a table. Again, I said XML is good
> for parsing of data if you cannot tell in advance that the data stream
> is totally free of errors.

No it is bad, because missing one bracket may lead to loss of the whole
data set. As a medium XML is as awful as readable.

> XML provides means to build robust data
> streams in the absence of tight definitions and reliable procedures.

> As for whitespace, read Stroustrup's article on defining operator
> whitespace.

Delimiter /= whitespace.

>> This form
>> existed for centuries before XML. Who would print tables of logarithms in
>> XML?
> 
> You're missing the point: XML is *not* about rendering data.

Sorry, but the thread's subject reads "Data table text I/O package". Text =
rendered data.

> Logarithms are logarithms, not printed logarithms, this is a second
> step.  Data formats for exchange or storage on the one hand and
> a print-out of some data on the other hand are two very different beasts,
> with different purposes. Consider the MVC paradigm.

This is obviously wrong, clearly print-outs serve both data exchange and
data storage when humans are involved.

>>>If you care about robust data interchange in a "sloppy
>>>field", you employ standard tools to help you get the correct
>>>data.
>> 
>> That is a different problem for which I would use a well-defined binary
>> format instead of fancy 3.15. What is the *accuracy* of this value, huh?
> 
> It is totally unimportant what you or I would want, sorry.
> For a robust data interchange, absent comprehensive definitions
> and guarantees about data production, you need redundancy, period.
> 
> The accuracy is well defined and most importantly,
> it is up to the application, yours and mine repectively.
> We both use the accuracy that is most appropriate, and I won't
> tell you not to use an internal type when it suits your application.
> I expect the same of you. If all I have to do is to store kilometers
> measuring straight lines inside the Netherlands in a relational database,
> I known the datatype I can use, no matter what you think is best
> in your application.

This is a wrong approach of course. Because the accuracy of the data is
*not* defined by the internal type used. And in any case the internal type
is irrelevant to the data format used. Note that binary format has nothing
to do with any internal format.

> This has been discussed for years during the development of
> XML Schema. What do you care about my accuracy as long as
> I compute values from your data that are within application
> bounds? 3.15 is as accurate as can be, and independent of
> bits.

Is it 3.14998751 or 3.150000? Floating-point numbers are intervals.
Transporting them you should either use explicit bounds: [3.1499, 3.1600]
or accuracy: 3.15 +/-0.0001. "As accurate as can be" is nice, but what if
the application is a gateway, which reads 3.15 as accurate as 4 bytes float
is and then sends it away? Two other applications communicating through it
and using long long float will be quite perplexed...

>> But, lack of readability is not in the ugly </> brackets. Tabulated data
>> are readable because they are tabulated.
> 
> This is the *View* in MVC, XML is about *data*. So there is no point in
> talking about final looks, it is important to know how data will have
> to be seen. For example, can you debug datastreams using the simplest
> tools? Think of a log file of a concurrent application, processing data
> from several heterogenous input sources on the net.

Really? A normal log file of our data acquisition and control system (3-4
nodes, 500-1000 channels each) is about 10-100 MB. A trace file of the same
system is typically about 10-100GB. The first is a highly dense binary
format. The second is dense ASCII. Do you know any editor capable to load
10GB? In UltraEdit you need about 10 minutes to wait, before it becomes
ready to do anything. Now, you propose me to convert all that into XML? How
much is SCSI terabyte now? But more importantly each extra byte of rubbish
you write is multiplied by the number of channels and their frequencies,
that costs system performance.

>> That is: the names, the types and
>> units are *factored* out to the table header, which allows the reader to
>> concentrate on the *values*. Thus a table looks as:
>> 
>> Distance [km]   Temperature  [�C]  ...
>> 3.15                29.0     ...
>> 2.10                14.4     ...
>> 
>> This is readable.
> 
> This is irrelevant in data exchange. This is print.
> 
>> To make difference more visible, consider bitmaps stored XML format. Would
>> you be able to recognize a person's face in it?
> 
> You do know about NOTATION?
> I think it is very hard to find someone suggesting that we should recode
> bitmap graphics formats as pixel tags.

So an image is not print whereas a table is?

-- 
Regards,
Dmitry A. Kazakov
http://www.dmitry-kazakov.de



  reply	other threads:[~2005-06-21 12:44 UTC|newest]

Thread overview: 68+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2005-06-15  9:57 Data table text I/O package? Jacob Sparre Andersen
2005-06-15 11:43 ` Preben Randhol
2005-06-15 13:35   ` Jacob Sparre Andersen
2005-06-15 14:12     ` Preben Randhol
2005-06-15 15:02       ` Jacob Sparre Andersen
2005-06-15 16:17         ` Preben Randhol
2005-06-15 16:58           ` Dmitry A. Kazakov
2005-06-15 17:30             ` Marius Amado Alves
2005-06-15 18:41               ` Dmitry A. Kazakov
2005-06-15 19:09                 ` Marius Amado Alves
2005-06-15 18:58         ` Randy Brukardt
2005-06-16  9:55           ` Jacob Sparre Andersen
2005-06-16 10:53             ` Marius Amado Alves
2005-06-16 12:24               ` Robert A Duff
2005-06-16 14:01               ` Georg Bauhaus
2005-06-16 12:27                 ` Dmitry A. Kazakov
2005-06-16 14:46                   ` Georg Bauhaus
2005-06-16 14:51                     ` Dmitry A. Kazakov
2005-06-20 11:19                       ` Georg Bauhaus
2005-06-20 11:39                         ` Dmitry A. Kazakov
2005-06-20 18:25                           ` Georg Bauhaus
2005-06-20 18:45                             ` Preben Randhol
2005-06-20 18:54                             ` Dmitry A. Kazakov
2005-06-21  9:24                               ` Georg Bauhaus
2005-06-21  9:52                                 ` Jacob Sparre Andersen
2005-06-21 11:10                                   ` Georg Bauhaus
2005-06-21 12:35                                     ` Jacob Sparre Andersen
2005-06-21 10:42                                 ` Dmitry A. Kazakov
2005-06-21 11:41                                   ` Georg Bauhaus
2005-06-21 12:44                                     ` Dmitry A. Kazakov [this message]
2005-06-21 21:01                                       ` Georg Bauhaus
2005-06-22 12:15                                         ` Dmitry A. Kazakov
2005-06-22 22:24                                           ` Georg Bauhaus
2005-06-23  9:03                                             ` Dmitry A. Kazakov
2005-06-23  9:47                                               ` Georg Bauhaus
2005-06-23 10:34                                                 ` Dmitry A. Kazakov
2005-06-23 11:37                                                   ` Georg Bauhaus
2005-06-23 12:59                                                     ` Dmitry A. Kazakov
2005-06-23 14:16                                               ` Marc A. Criley
2005-06-25 16:38                               ` Simon Wright
2005-06-16 13:26                 ` Marius Amado Alves
2005-06-16 18:10                   ` Georg Bauhaus
2005-06-30  3:02             ` Randy Brukardt
2005-06-30 18:43               ` Jacob Sparre Andersen
2005-07-01  1:22                 ` Randy Brukardt
2005-07-01  3:01                   ` Alexander E. Kopilovich
2005-07-01  5:59                     ` Jeffrey Carter
2005-07-02  1:54                     ` Randy Brukardt
2005-07-02 10:24                       ` Dmitry A. Kazakov
2005-07-06 22:04                         ` Randy Brukardt
2005-06-30 19:24               ` Björn Persson
2005-07-01  0:54                 ` Randy Brukardt
2005-07-01 21:36                   ` TSV and CSV Björn Persson
2005-07-01 22:08                     ` Martin Dowie
2005-07-02  0:05                       ` Georg Bauhaus
2005-07-02  1:10                         ` Randy Brukardt
2005-07-02  1:20                           ` Ed
2005-07-03  9:08                           ` Georg Bauhaus
2005-07-02  0:07                   ` Data table text I/O package? Georg Bauhaus
2005-07-02  1:21                     ` Randy Brukardt
     [not found]     ` <20050615141236.GA90053@pvv.org>
2005-06-15 15:40       ` Marius Amado Alves
2005-06-15 19:18         ` Oliver Kellogg
2005-06-17  9:02           ` Jacob Sparre Andersen
     [not found]       ` <7adf1648bb99ca2bb4055ed8e6e381f4@netcabo.pt>
2005-06-15 15:46         ` Preben Randhol
     [not found]         ` <20050615154640.GA1921@pvv.org>
2005-06-15 16:14           ` Marius Amado Alves
     [not found]           ` <f04ccd7efd67fe197cc14cda89340779@netcabo.pt>
2005-06-15 16:20             ` Preben Randhol
2005-06-15 19:30 ` Simon Wright
2005-06-15 22:40 ` Lionel Draghi
replies disabled

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox