From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=-0.3 required=5.0 tests=BAYES_00, REPLYTO_WITHOUT_TO_CC autolearn=no autolearn_force=no version=3.4.4 X-Google-Thread: 103376,a65bb7bde679ed1d X-Google-NewGroupId: yes X-Google-Attributes: gida07f3367d7,domainid0,public,usenet X-Google-Language: ENGLISH,ASCII-7-bit Received: by 10.68.16.106 with SMTP id f10mr8309368pbd.8.1322831557799; Fri, 02 Dec 2011 05:12:37 -0800 (PST) Path: lh20ni58398pbb.0!nntp.google.com!news1.google.com!goblin2!goblin.stu.neva.ru!aioe.org!.POSTED!not-for-mail From: "Dmitry A. Kazakov" Newsgroups: comp.lang.ada Subject: Re: Ann: Natools.Chunked_Strings, beta 1 Date: Fri, 2 Dec 2011 14:11:19 +0100 Organization: cbb software GmbH Message-ID: <1p2as5zhorw26.1lj8cjmzr1to0$.dlg@40tude.net> References: <4ed4fc37$0$2537$ba4acef3@reader.news.orange.fr> <7nz692j39hkt$.146ba4w7yczck$.dlg@40tude.net> <4ed89aa8$0$7616$9b4e6d93@newsspool1.arcor-online.net> Reply-To: mailbox@dmitry-kazakov.de NNTP-Posting-Host: FbOMkhMtVLVmu7IwBnt1tw.user.speranza.aioe.org Mime-Version: 1.0 X-Complaints-To: abuse@aioe.org User-Agent: 40tude_Dialog/2.0.15.1 X-Notice: Filtered by postfilter v. 0.8.2 Xref: news1.google.com comp.lang.ada:19305 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Date: 2011-12-02T14:11:19+01:00 List-Id: On Fri, 02 Dec 2011 10:30:11 +0100, Georg Bauhaus wrote: > On 02.12.11 09:27, Dmitry A. Kazakov wrote: >> DNA chain is not a text string. Furthermore it would likely have some >> specific operations and a representation tailored substring search. > > I have tried this once. Sequence information is given and is using just > a handful of characters. I mapped those to some 4bit type, even tried less. > Added lots of purportedly smart unchecked conversions, some shifting, > made my head spin by thinking about what combinations of "characters" > might suggest there could be clever additions, not shifts and the like > for obtaining info about substrings or single "characters", noted that > addition is faster than shifting or logical operations on the processor, > etc. Tried specializing searching. If there is a solution, it seems tricky. > Perhaps to be found by someone with more than ordinary combinatorial skills. > In my case this effort has produced only minuscule advantages, sometimes > the opposite, but the cost was a large number of specialized subprograms. There is a simple way to do such things efficiently, You tabulate operations you want. It is just 2**16 combinations for a dyadic operation taking two arguments of 2**8 states. A 64K array is nothing in these times. The array can be initialized by an aggregate which text is generated once. I am using such techniques from time to time, e.g. for character conversion maps, for small images (icons etc) embedded into the application, etc. -- Regards, Dmitry A. Kazakov http://www.dmitry-kazakov.de