From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00 autolearn=ham autolearn_force=no version=3.4.4 X-Google-Thread: 103376,e2db735fbfbc85e6 X-Google-Attributes: gid103376,public X-Google-Language: ENGLISH,ASCII-7-bit Path: g2news1.google.com!postnews.google.com!g49g2000cwa.googlegroups.com!not-for-mail From: Paul.Jansen@tiobe.com Newsgroups: comp.lang.ada Subject: Re: TIOBE Programming Community Index - update Date: 19 Sep 2005 16:50:39 -0700 Organization: http://groups.google.com Message-ID: <1127173839.479890.146810@g49g2000cwa.googlegroups.com> References: <1597249.XV1kqXaE3M@linux1.krischik.com> NNTP-Posting-Host: 213.46.68.9 Mime-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" X-Trace: posting.google.com 1127173844 14232 127.0.0.1 (19 Sep 2005 23:50:44 GMT) X-Complaints-To: groups-abuse@google.com NNTP-Posting-Date: Mon, 19 Sep 2005 23:50:44 +0000 (UTC) User-Agent: G2/0.2 X-HTTP-UserAgent: Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; .NET CLR 1.1.4322),gzip(gfe),gzip(gfe) Complaints-To: groups-abuse@google.com Injection-Info: g49g2000cwa.googlegroups.com; posting-host=213.46.68.9; posting-account=rfbu1wwAAAD1MltusiYQKehlYMDsWbUd Xref: g2news1.google.com comp.lang.ada:4919 Date: 2005-09-19T16:50:39-07:00 List-Id: Hi Martin, Thanks again for your critical view on the TPC index. Let me first stress that I am not biased as you are suggesting in your review. I don't care about Ada being in 1st, 5th or 34th position. This has nothing to do with our portfolio, because we adjust our portfolio based on the TPC index and not the other way around. What we are trying to do is to measure programming popularity with as simple rules as possible in an objective way. A while ago the language ABC (predecessor of Python) had a high score. This was mainly due to the TV channel ABC and not because of ABC's raising popularity. That's why we excluded "tv" and later on "channel". For ABC the difference is 26,000 hits versus 886 hits! That's what I call a difference (2934.5%)! Now let me give you a brief course on statistics. If we add the "missing" 17.4% to Ada's ratings this will become 0.540% * 1.174 = 0.634%. Now Ada is passing Fortran, wow! But... Fortran misses also 14.3%, so Fortran's ratings will become 0.600 * 1.143 = 0.686%. In other words, the "-tv -channel" addition has no effect on the position of Ada whatsoever! To be honest, I think most staticians will laugh about your conclusion. It is a beginner's mistake. I agree with you that we exclude now also legitimate pages with "-tv -channel". But these are more or less equally distributed over all languages. They hardly influence the ratings of languages because all languages suffer from it more or less to the same degree. Only languages which have a high amount of false positives are corrected in this way. Finally, there is indeed a bug in Google with exclusions. I have submitted this already to Google.com, but apparently this has not been fixed yet. Looking forward to your next review! Regards, Paul