From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,FREEMAIL_FROM autolearn=ham autolearn_force=no version=3.4.4 X-Google-Thread: a07f3367d7,e5a48ab332277b89 X-Google-Attributes: gida07f3367d7,public,usenet X-Google-NewGroupId: yes X-Google-Language: ENGLISH,ASCII Path: g2news1.google.com!postnews.google.com!15g2000yqi.googlegroups.com!not-for-mail From: Gautier write-only Newsgroups: comp.lang.ada Subject: Re: TIOBE index Date: Mon, 8 Mar 2010 08:26:31 -0800 (PST) Organization: http://groups.google.com Message-ID: <30343fd5-0f76-4802-b423-e00b0649a532@15g2000yqi.googlegroups.com> References: NNTP-Posting-Host: 206.122.158.4 Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Trace: posting.google.com 1268065592 8451 127.0.0.1 (8 Mar 2010 16:26:32 GMT) X-Complaints-To: groups-abuse@google.com NNTP-Posting-Date: Mon, 8 Mar 2010 16:26:32 +0000 (UTC) Complaints-To: groups-abuse@google.com Injection-Info: 15g2000yqi.googlegroups.com; posting-host=206.122.158.4; posting-account=gRqrnQkAAAAC_02ynnhqGk1VRQlve6ZG User-Agent: G2/1.0 X-HTTP-UserAgent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US) AppleWebKit/532.5 (KHTML, like Gecko) Chrome/4.0.249.89 Safari/532.5,gzip(gfe),gzip(gfe) Xref: g2news1.google.com comp.lang.ada:9470 Date: 2010-03-08T08:26:31-08:00 List-Id: Well, newsgroups contents are visible, copied several times, on various search engines... YouTube has a normalized weight of only 5.73%. > Also I noticed: "The first 100 pages per search engine are checked for = =A0 > possible false positives and this is used to define the confidence factor= " =A0 > - the confidence factor for Ada is not publicised. That could be the explanation... Anyway, the idea of a "transparent" index, with the same rules, but with publication of all figures, could be a nice project. Of course automatized... How complicated would it be to parse the HTML page returned by search engines ? For instance, the Google request gives somewhere in the HTML code: *

* Results 1 - 10 of about 1,440,000 for ada programming Matching the above tag, the result is at reach of the hand. Maybe some HTTP request to do with AWS ? Any hint is welcome, I'm doing the rest of the code :-) Gautier