From mboxrd@z Thu Jan  1 00:00:00 1970
X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me
X-Spam-Level: 
X-Spam-Status: No, score=-0.9 required=5.0 tests=BAYES_00,FORGED_GMAIL_RCVD,
	FREEMAIL_FROM autolearn=no autolearn_force=no version=3.4.4
X-Google-Thread: 103376,a525118741961e98
X-Google-NewGroupId: yes
X-Google-Attributes: gida07f3367d7,domainid0,public,usenet
X-Google-Language: ENGLISH,ASCII-7-bit
Path: 
 g2news1.google.com!postnews.google.com!u3g2000vbe.googlegroups.com!not-for-mail
From: Emmanuel Briot <briot.emmanuel@gmail.com>
Newsgroups: comp.lang.ada
Subject: Re: xml/ada dropping data when pre-defined entities are separated by
 space?
Date: Mon, 31 Jan 2011 06:56:18 -0800 (PST)
Organization: http://groups.google.com
Message-ID: <782436de-8a75-450d-be79-1efe555c4f5e@u3g2000vbe.googlegroups.com>
References: <05aafe44-cdd9-4c28-8e3f-24ecd9067ab3@u6g2000vbh.googlegroups.com>
 <4d415333$0$6769$9b4e6d93@newsspool3.arcor-online.net>
 <839dbe65-e971-4db7-ad25-269253f02c69@c10g2000vbv.googlegroups.com>
 <m2mxmj3a68.fsf@pushface.org>
 <5e91567e-883f-428c-b01e-ee51e91ca30f@o8g2000vbq.googlegroups.com>
 <4d4600c9$0$6880$9b4e6d93@newsspool2.arcor-online.net>
 <c4e6eada-7ca3-4add-9aca-b6ca828ef73a@d21g2000vbf.googlegroups.com>
NNTP-Posting-Host: 194.98.77.125
Mime-Version: 1.0
Content-Type: text/plain; charset=ISO-8859-1
X-Trace: posting.google.com 1296485778 23423 127.0.0.1 (31 Jan 2011 14:56:18
 GMT)
X-Complaints-To: groups-abuse@google.com
NNTP-Posting-Date: Mon, 31 Jan 2011 14:56:18 +0000 (UTC)
Complaints-To: groups-abuse@google.com
Injection-Info: u3g2000vbe.googlegroups.com; posting-host=194.98.77.125;
 posting-account=6yLzewoAAABoisbSsCJH1SPMc9UrfXBH
User-Agent: G2/1.0
X-HTTP-UserAgent: Mozilla/5.0 (X11; U; Linux x86_64; en-US) AppleWebKit/534.10
 (KHTML, like Gecko) Chrome/8.0.552.237 Safari/534.10,gzip(gfe)
Xref: g2news1.google.com comp.lang.ada:16814
Date: 2011-01-31T06:56:18-08:00
List-Id: <comp.lang.ada>

> I was just going to ask what makes xml/ada decide why a textnode is
> sometimes split into several nodes. I see the pattern now, of course,
> 'split on predefined entity' but why?

Because XML/Ada tries to be as efficient as possible, and normalizing
the document takes time that a lot of application have no need for. If
indeed your application is only able to deal with normalized document
(it really shouldn't, the XML standard is quite clear that a document
is not necessarily normalized), then indeed you should call Normalize.