comp.lang.ada
 help / color / mirror / Atom feed
From: Georg Bauhaus <sb463ba@l1-hrz.uni-duisburg.de>
Subject: Re: HTML parser in Ada ?
Date: Mon, 18 Nov 2002 14:17:46 +0000 (UTC)
Date: 2002-11-18T14:17:46+00:00	[thread overview]
Message-ID: <arasqa$sur$2@a1-hrz.uni-duisburg.de> (raw)
In-Reply-To: 17cd177c.0211161143.7f8d5842@posting.google.com

Gautier <gautier_niouzes@hotmail.com> wrote:
: OK - I'll take a look at proposed solutions:
: XML/Ada and OpenToken.

Be aware though that you are on the edge of natural language
processing when dealing with real world web pages. The best
one can hope is that you don't need more than full SGML with
tag minimization features. Second best will likely require
context sensitive parsing and some heuristics.

-- georg



  parent reply	other threads:[~2002-11-18 14:17 UTC|newest]

Thread overview: 11+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2002-11-15 10:49 HTML parser in Ada ? Gautier direct_replies_not_read
2002-11-15 16:06 ` Preben Randhol
2002-11-15 17:00   ` Adrian Knoth
2002-11-16  4:11   ` Randy Brukardt
2002-11-16 19:43   ` Gautier
2002-11-17 12:00     ` Preben Randhol
2002-12-02 19:50       ` Nicolas Seriot
2002-11-18 14:17     ` Georg Bauhaus [this message]
  -- strict thread matches above, loose matches on Subject: below --
2002-11-15 11:08 Grein, Christoph
2002-11-15 14:24 ` Victor Porton
2002-11-18  6:38 Grein, Christoph
replies disabled

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox