From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=0.2 required=5.0 tests=BAYES_00,INVALID_MSGID, REPLYTO_WITHOUT_TO_CC autolearn=no autolearn_force=no version=3.4.4 X-Google-Language: ENGLISH,ASCII-7-bit X-Google-Thread: 103376,a1868a27625c21e7 X-Google-Attributes: gid103376,public From: kilgallen@eisner.decus.org (Larry Kilgallen) Subject: Re: Looking for keyed file package Date: 1999/09/25 Message-ID: <1999Sep25.114101.1@eisner>#1/1 X-Deja-AN: 529438217 X-Nntp-Posting-Host: eisner.decus.org References: <37E817C6.80ED41E0@easystreet.com> <7saii8$5bl$1@nnrp1.deja.com> <7scrba$bdu$1@clnews.edf.fr> <7semeb$69u$1@nnrp1.deja.com> <7seobv$7ib$1@nnrp1.deja.com> <1999Sep24.130829.1@eisner> <7sijv6$tc8$1@nnrp1.deja.com> X-Trace: news.decus.org 938274065 2931 KILGALLEN [216.44.122.34] Organization: LJK Software Reply-To: Kilgallen@eisner.decus.org.nospam Newsgroups: comp.lang.ada Date: 1999-09-25T00:00:00+00:00 List-Id: In article <7sijv6$tc8$1@nnrp1.deja.com>, Robert Dewar writes: >> but I think that all started back when disk and memory space >> were much more dear. I/O bandwidth one area of computer >> performance that does not seem to be advancing so much as the >> others, so these days typical advice might be to engage in >> extensive caching of the index data if performance is >> important. > > The whole *point* of key compression is to allow this caching > to be more efficient. You want as much of the key index in main > memory as possible (and if you are really pushing things, in > cache as possible). Key compression is vitally important for > this, and in fact it is MORE important with modern architectures > than with older ones, precisely because of the increasing gap > between processor/memory speed and I/O speed. Since you say "in main memory as possible (and ...in cache", I gather you are talking about processor memory cache, whereas what I meant was having the index in the caches created by the indexed file system in main memory. Presuming the actual data will likely require a disk read, I think the difference between index data being in processor cache and being in the file system cache in main memory is not of consequence. The data I have been dealing with this month have a maximum of 16 bytes of key data for a record length of 600 bytes, so the standard recommendation to set the cache size large enough to hold the entire index tree in memory seems reasonable. Memory is cheap enough, however, that even not having the keys compressed would be acceptable in this setting. That would be quite different with 584 bytes of key data for a record length of 600 bytes. Of course particular key data can be more or less susceptible to compression. Larry Kilgallen