From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=-1.2 required=5.0 tests=BAYES_00,URI_TRY_3LD autolearn=no autolearn_force=no version=3.4.4 X-Google-Language: ENGLISH,ASCII X-Google-Thread: 103376,ac1252c179cf9560 X-Google-Attributes: gid103376,public X-Google-ArrivalTime: 2002-11-15 08:06:35 PST Path: archiver1.google.com!news1.google.com!newsfeed.stanford.edu!news-spur1.maxwell.syr.edu!news.maxwell.syr.edu!uio.no!ntnu.no!not-for-mail From: Preben Randhol Newsgroups: comp.lang.ada Subject: Re: HTML parser in Ada ? Date: Fri, 15 Nov 2002 16:06:35 +0000 (UTC) Organization: Norwegian university of science and technology Message-ID: References: NNTP-Posting-Host: kiuk0152.chembio.ntnu.no Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit X-Trace: tyfon.itea.ntnu.no 1037376395 7599 129.241.83.78 (15 Nov 2002 16:06:35 GMT) X-Complaints-To: usenet@itea.ntnu.no NNTP-Posting-Date: Fri, 15 Nov 2002 16:06:35 +0000 (UTC) User-Agent: slrn/0.9.7.4 (Linux) Xref: archiver1.google.com comp.lang.ada:30944 Date: 2002-11-15T16:06:35+00:00 List-Id: Gautier direct_replies_not_read wrote: > Is there somewhere Ada source(s) for an (ideally simple) HTML > parser ? - Before I reinvent the wheel... If you are not making something that is aimed to read the web-pages on the net, please make something that reads XHTML only or that it follows the HTML DTD strictly and rejects all faulty pages. Trying to make something that can read web-pages is very difficult and your application gets very error-prone. Most web-pages out there are broken and does not use propper HTML. So if you want to display the pages correctly then you have to make a lot of exceptions to the HTML DTD. Hmm, think is it time I validate my own web-pages again. :-) It would be great with a (X)HTML engine that does not crash though. > MSN 8 with e-mail virus protection service: 2 months FREE* After two months they will start sending you viruses so you will pay I guess ;-) > http://join.msn.com/?page=features/virus Resistance is futile. After learning that simple javascript on a page can get IE to delete all your files, I pull out the network cable when I have to use Windows for something. -- Preben Randhol ------------------------ http://www.pvv.org/~randhol/ -- �There are three things you can do to a woman. You can love her, suffer for her, or turn her into literature.� - Justine, by Lawrence Durrell