From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00 autolearn=unavailable autolearn_force=no version=3.4.4 Path: eternal-september.org!reader01.eternal-september.org!reader02.eternal-september.org!feeder.eternal-september.org!nntp-feed.chiark.greenend.org.uk!ewrotcd!newsfeed.xs3.de!io.xs3.de!news.jacob-sparre.dk!franka.jacob-sparre.dk!pnx.dk!.POSTED.rrsoftware.com!not-for-mail From: "Randy Brukardt" Newsgroups: comp.lang.ada Subject: Re: Encaspulation: What to export Date: Wed, 29 Nov 2017 14:32:04 -0600 Organization: JSA Research & Innovation Message-ID: References: <8666203a-4e42-438d-8fe0-1a63f643955f@googlegroups.com> Injection-Date: Wed, 29 Nov 2017 20:32:05 -0000 (UTC) Injection-Info: franka.jacob-sparre.dk; posting-host="rrsoftware.com:24.196.82.226"; logging-data="27466"; mail-complaints-to="news@jacob-sparre.dk" X-Priority: 3 X-MSMail-Priority: Normal X-Newsreader: Microsoft Outlook Express 6.00.2900.5931 X-RFC2646: Format=Flowed; Original X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.7246 Xref: reader02.eternal-september.org comp.lang.ada:49259 Date: 2017-11-29T14:32:04-06:00 List-Id: wrote in message news:b9353594-3831-4fcb-b27c-b0afca754a60@googlegroups.com... > Shark8: > >> Doesn't there have to be some sort of parsing for HTML? Specifically the >> TABLE-tag? (And possibly keeping track of opening-/closing tags, in >> general.) > > For sure, HTML needs to be parsed. You find a parser here: Really? We don't have any parser (just a lexer) in the search engine crawler. As I recall, section closes are counted rather than anything more complex. I stand by my original statement. Randy.