comp.lang.ada
 help / color / mirror / Atom feed
From: Stephen Leake <stephen_leake@stephe-leake.org>
Subject: Re: AdaCore xmlada throws XML_Fatal_Error on <script> tag
Date: Thu, 7 Sep 2017 13:54:41 -0700 (PDT)
Date: 2017-09-07T13:54:41-07:00	[thread overview]
Message-ID: <379b5423-a6bf-41cb-9d47-d6a38ce27789@googlegroups.com> (raw)
In-Reply-To: <32182e81-afdc-4309-a48f-ff6115e405d7@googlegroups.com>

On Thursday, September 7, 2017 at 3:52:39 PM UTC-5, Stephen Leake wrote:
> On Thursday, September 7, 2017 at 3:47:56 PM UTC-5, Stephen Leake wrote:
> > On Thursday, September 7, 2017 at 3:05:08 PM UTC-5, gautier...@hotmail.com wrote:
> > > > As Emmanuel pointed out, you need an HTML parser, not an XML parser.
> > > 
> > > Just developing on this topic: a typical random Web page is a mix of the following
> > > 1) HTML (typically, <br> tags)
> > > 2) Ill-formed HTML (closing tags that don't close anything, opening tags that are never closed, singleton tags written as closing tags: </br>)
> > > 3) XHTML (HTML following XML syntax; e.g. <br />)
> > > At some point, there was a will to impose a clean, well-defined standard (XHTML), but it did not succeed because of a typical phenomenon: browsers need to be compatible with 1) & 2) to be in use; a browser accepting only well-formed XHTML would be ignored by users...
> > 
> > So I got wasabee_text.adb to compile, and forgetting momentarily that it wants to fetch a URL, not read a local file, I typed "wasabee_text.exe discogs.html", and it said:
> > 
> > ...
> > 
> > This document is not an XHTML document
> > 
> > So is this supposed to be an HTML 4/5 parser, or an XHTML parser?
> 
> Digging a little deeper, one top level operation is:
> 
>    procedure Load_frame(ho: in out HT_object; from: DOM.Core.Node_List);
> 
> This creates an HTML object from an XML DOM tree. And the XML DOM tree is created by AdaCore xmlada. So this is not going to do me any good.
> 
> Unless you've modified the XML parser in exactly the way I need - I'll assume that and keep digging.

Nope: 

stephe@Takver4$ /Projects/wasabee-code/target/text/wasabee_text.exe file:///Projects/org.stephe_leake.misc/build/discogs.html
protocole : file
host      : -- localhost - unused --
Port      :  0
Ressource : /Projects/org.stephe_leake.misc/build/discogs.html

Execution terminated by unhandled exception
raised SAX.READERS.XML_FATAL_ERROR : /Projects/org.stephe_leake.misc/build/discogs.html:48:5: Name differ for closing tag (expecting link, opened line 47)

  reply	other threads:[~2017-09-07 20:54 UTC|newest]

Thread overview: 17+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2017-09-07  1:10 AdaCore xmlada throws XML_Fatal_Error on <script> tag Stephen Leake
2017-09-07  6:21 ` briot.emmanuel
2017-09-07 19:56   ` Stephen Leake
2017-09-08 23:31     ` Georg Bauhaus
2017-09-07 15:12 ` gautier_niouzes
2017-09-07 20:03   ` Stephen Leake
2017-09-07 20:05   ` gautier_niouzes
2017-09-07 20:47     ` Stephen Leake
2017-09-07 20:52       ` Stephen Leake
2017-09-07 20:54         ` Stephen Leake [this message]
2017-09-07 21:46       ` gautier_niouzes
2017-09-08  2:12         ` Stephen Leake
2017-09-08  5:38           ` Stephen Leake
2017-09-08 16:55             ` Shark8
2017-09-09 18:07             ` Stephen Leake
2017-09-09 19:35               ` Simon Wright
2017-09-08  5:40           ` gautier_niouzes
replies disabled

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox