From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: * X-Spam-Status: No, score=1.3 required=5.0 tests=BAYES_00,INVALID_MSGID, MSGID_RANDY autolearn=no autolearn_force=no version=3.4.4 X-Google-Language: ENGLISH,ASCII-7-bit X-Google-Thread: 103376,6bf9d4ba0cfd8cb6 X-Google-Attributes: gid103376,public From: Ted Dennison Subject: Re: Announce: OpenToken 2.0 released Date: 2000/02/04 Message-ID: <87ess9$b1k$1@nnrp1.deja.com>#1/1 X-Deja-AN: 581493422 References: <3890C62B.18309585@telepath.com> <876unj$jcs$1@nnrp1.deja.com> <879fc4$eod$1@nnrp1.deja.com> X-Http-Proxy: 1.0 x25.deja.com:80 (Squid/1.1.22) for client 204.48.27.130 Organization: Deja.com - Before you buy. X-Article-Creation-Date: Fri Feb 04 15:57:01 2000 GMT X-MyDeja-Info: XMYDJUIDtedennison Newsgroups: comp.lang.ada X-Http-User-Agent: Mozilla/4.7 [en] (WinNT; I) Date: 2000-02-04T00:00:00+00:00 List-Id: In article <879fc4$eod$1@nnrp1.deja.com>, Ted Dennison wrote: > In article , > Hyman Rosen wrote: > > Well, at one point I was writing code to parse Adobe PDF files. > > They have a binary format, where arbitrary 8-bit bytes can appear, > > and a structure which I think lends itself well to syntax-oriented > > parsing. > > You along with some emailers have convinced me. I'll make the change > to the analyzer I mentioned in the previous message. That should be > sufficient to allow binaries to be parsed. All references to EOF_Character have now been removed from the analyzer. This has been integrated and tested. Thus the next version of OpenToken should be suitable for use in parsing binaries. As for the protracted discussion here on the best method for determining when the end of the text has been reached, I sort of cheated. I left that problem to the implementors of the text feeders. -- T.E.D. http://www.telepath.com/~dennison/Ted/TED.html Sent via Deja.com http://www.deja.com/ Before you buy.