From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00 autolearn=unavailable autolearn_force=no version=3.4.4 Path: eternal-september.org!reader01.eternal-september.org!reader02.eternal-september.org!news.eternal-september.org!mx02.eternal-september.org!feeder.eternal-september.org!feeder.erje.net!eu.feeder.erje.net!news.albasani.net!.POSTED!not-for-mail From: Martin Trenkmann Newsgroups: comp.lang.ada Subject: Re: Implementing character sets for Wide_Character Date: Fri, 06 Mar 2015 22:02:41 +0100 Organization: albasani.net Message-ID: References: <87385i9emd.fsf@theworld.com> Mime-Version: 1.0 Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: 7bit X-Trace: news.albasani.net x66om31GpwiEft022UoT8V6LF/n3hSpSB4MRfE62IY4q2KW9yfEpA/iH3znwj7/gmrXK4pOyns4O2/KIO0gOgw== NNTP-Posting-Date: Fri, 6 Mar 2015 21:02:41 +0000 (UTC) Injection-Info: news.albasani.net; logging-data="kj3ZOFYJo9bsK4pzjiBfoTHR4ZLWV2VhgSh+lZ+/ucrG1tGYXLqSLzFjI9c9WLyZkYzGFTl//94QKX5CcS9iingEJ8D/RNb2KcQTC/5vmVc+cTU47bgYL6jOdgbbPegi"; mail-complaints-to="abuse@albasani.net" User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Icedove/31.4.0 In-Reply-To: <87385i9emd.fsf@theworld.com> Cancel-Lock: sha1:fWB/ZSdFp33We+gbshyCTJZI9UM= Xref: news.eternal-september.org comp.lang.ada:25132 Date: 2015-03-06T22:02:41+01:00 List-Id: >> I need to implement a containment check for Wide_Character in a wide >> character set. I have two approaches in mind. > > If you really care about speed, then implement both and measure the > speed. It's not clear that the bit map will be faster -- using more > memory harms cache behavior. Yes that's true. > Also take a look at package Ada.Strings.Wide_Maps. > It contains type Wide_Character_Set, represented as a sorted > sequence of ranges. Measure that one, too. Don't know why I overlooked that package. I don't intend to reinvent the wheel - thanks for the hint. > Why Wide_Character rather than Wide_Wide_Character? Because my pretty old character set I have to deal with assigns Japanese characters to 16-bit values. - Martin