From: mheaney@on2.com (Matthew Heaney)
Subject: Re: sorting large numbers of large records
Date: 30 Jul 2003 07:55:40 -0700
Date: 2003-07-30T14:55:42+00:00 [thread overview]
Message-ID: <1ec946d1.0307300655.2ad33d81@posting.google.com> (raw)
In-Reply-To: jyFVa.715$7h6.251@nwrdny03.gnilink.net
Hyman Rosen <hyrosen@mail.com> wrote in message news:<jyFVa.715$7h6.251@nwrdny03.gnilink.net>...
> Keith Thompson wrote:
> > If you're using GNU sort, the length of each line can be unlimited;
>
> So there :-)
>
> > Finally, I'm not sure what GNU sort (or any other Unix-like sort) does
> > with input too big to fit into memory;
>
> As far as I know, UNIX sort has always been able to sort arbitrarily large
> files by doing what I suggested in an earlier message - sort pieces of the
> original into temporary files, then merge them together.
Actually, since these are fixed size records, you could do a sort by
simply exchanging record values on disk. You have random access via
Direct_IO.
You could also map the entire file, to view it as one large array.
In either case you could use a generic algorithm (the one in Charles
would be adequate) to do the work.
next prev parent reply other threads:[~2003-07-30 14:55 UTC|newest]
Thread overview: 18+ messages / expand[flat|nested] mbox.gz Atom feed top
2003-07-29 13:10 sorting large numbers of large records Brien L. Christesen
2003-07-29 14:30 ` Larry Kilgallen
2003-07-30 0:32 ` Keith Thompson
2003-07-30 1:53 ` Hyman Rosen
2003-07-30 14:55 ` Matthew Heaney [this message]
2003-07-30 16:41 ` Chad R. Meiners
-- strict thread matches above, loose matches on Subject: below --
2003-07-28 15:29 Brien L. Christesen
2003-07-28 15:35 ` Vinzent Hoefler
2003-07-31 15:22 ` Brien L. Christesen
2003-07-28 16:25 ` Hyman Rosen
2003-07-28 20:30 ` John R. Strohm
2003-07-28 20:52 ` Hyman Rosen
2003-07-28 23:47 ` Matthew Heaney
2003-07-28 23:33 ` Matthew Heaney
2003-07-28 23:43 ` Matthew Heaney
2003-07-29 0:42 ` John Cupak
2003-07-29 3:38 ` Matthew Heaney
2003-07-29 8:32 ` Preben Randhol
replies disabled
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox