From: Peter Brooks <peter.h.m.brooks@gmail.com>
Subject: Re: B-tree performance
Date: Tue, 3 Sep 2013 10:43:47 -0700 (PDT)
Date: 2013-09-03T10:43:47-07:00 [thread overview]
Message-ID: <f00b1750-5d75-4ffa-9e01-b811567301d2@googlegroups.com> (raw)
In-Reply-To: <31834951-571f-4361-b18a-e0abde8fa219@googlegroups.com>
On Tuesday, 3 September 2013 18:14:19 UTC+2, Dan'l Miller wrote:
>
>
> So it would make sense to not use a B-tree. Indeed, it would make sense to not restart the comparison again at each node, because it wastes time and bloats memory-space. Hence, it would make sense to use a radix trie. http://en.wikipedia.org/wiki/Radix_trie
>
Yes, that might make sense. I did something similar many years ago when I produced correlation matrices of letter sequences in words. It wouldn't work as a linked list, but a multi-dimensional associative array might be an option - thank you, I'll look into that.
One pleasant side-effect of this method would be that it'd be much easier to search for the same entry in different URIs.
I'll have to experiment a bit on workable representations of this.
The funny thing about this is that it might be very easy to implement in awk. Not what I was looking for or thinking of. I'm not sure how awk behaves with very large arrays - I'll find out quite soon.
prev parent reply other threads:[~2013-09-03 17:43 UTC|newest]
Thread overview: 13+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-08-31 7:14 B-tree performance Peter Brooks
2013-08-31 15:26 ` Shark8
2013-08-31 15:47 ` Peter Brooks
2013-09-01 4:57 ` Niklas Holsti
2013-09-01 6:05 ` Peter Brooks
2013-09-01 8:05 ` Niklas Holsti
2013-09-01 8:38 ` Peter Brooks
2013-09-02 4:28 ` Niklas Holsti
2013-09-03 2:15 ` Peter Brooks
2013-09-05 8:25 ` Memory mapping and network file systems (Was: B-tree performance) Jacob Sparre Andersen
2013-09-05 10:12 ` Peter Brooks
2013-09-03 16:14 ` B-tree performance Dan'l Miller
2013-09-03 17:43 ` Peter Brooks [this message]
replies disabled
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox