From: Oleg Goodyckov <og@videoproject.kiev.ua>
Subject: Re: FAQ and string functions
Date: Mon, 5 Aug 2002 18:12:31 +0300
Date: 2002-08-05T18:12:31+03:00 [thread overview]
Message-ID: <20020805181231.C2351@videoproject.kiev.ua> (raw)
In-Reply-To: b0osku0tktsihgp0hoih183250hq3pjhq5@4ax.com
On Mon, Aug 05, 2002 at 01:50:38PM +0200, Dmitry A. Kazakov wrote:
> >me it is more effectively to process only correct data (which are reliably
> >recognized) and any other simply to drop nuffig.
>
> Ah, that practice, which makes HTML a disaster because browsers
> silently ignore what they do not understand. The results are known.
Seems, you don't like this "known" result? Why?
> The problem of all global methods is that the parameters they need
> cannot be optimal in a large context. Split is an example. It
> requires a separator and a notion of a token which may vary from point
> to point, making the approach useless.
Baseless assertions. Again.
> I remember a project with a config file of ~2MBytes big. (it was a
> Windows registry folder). I wonder how much time it would take to
> parse it using split technique.
Why you've took so nasty example?
> >> that as the complexity of syntax increases it becomes almost impossible at
> >> some point to write a correct pattern and prove that it is correct.
> >
> >Which nuffig "complexity of syntax"? Syntax is - no more simplest: fields
> >with separators (of one type) between of them.
>
> It is not a real syntax.
It is what I try to tell.
> >Take record, split it by separators and enjoy.
>
> Well, how long a record is allowed to be?
It is no need in such constrain. Any.
> >Really? Empty words. Try and show me. In skipped example I've seen one
> >attempt. Show me another - better.
> >Task solved in skipped example has name - building hystorgram of words
> >implementation. Why you name this task not realistic?
>
> Because histogram is also a global method (used for I suppose sort of
> clustering) which also has great limitations and is by no means an end
> product of the program.
Ok. It is answer on my second question (not very impressive, BTW). Now how
about first - about better realization of task?
> >So, if that 80% of code throw out, then program will work? Or they are
> >necessary though?
>
> Not for text processing. I supposed that it does something more than
> only that.
>
> Generally, if you have a problem to solve you must first decompose it
> into subproblems. You should do it properly. Surely one could use
> eigenvalues and vectors to invert a matrix but this would be a *bad*
> idea. To decompose some text analysing problem into a bunch of split
> operations as also a *bad* idea. This is my point.
Baseless point. Exorcisms.
next prev parent reply other threads:[~2002-08-05 15:12 UTC|newest]
Thread overview: 86+ messages / expand[flat|nested] mbox.gz Atom feed top
2002-07-30 6:32 FAQ and string functions Oleg Goodyckov
2002-07-30 8:52 ` Colin Paul Gloster
2002-07-30 13:48 ` Ted Dennison
2002-07-31 4:52 ` Brian May
2002-08-01 16:09 ` Ted Dennison
2002-08-02 0:21 ` Brian May
2002-08-02 1:56 ` tmoran
2002-08-02 13:59 ` Ted Dennison
2002-07-31 7:46 ` Oleg Goodyckov
2002-07-31 9:04 ` Lutz Donnerhacke
2002-07-31 9:39 ` Pascal Obry
2002-07-31 15:06 ` Oleg Goodyckov
2002-07-31 16:50 ` Oleg Goodyckov
2002-07-31 20:16 ` Simon Wright
2002-07-31 20:56 ` Robert A Duff
2002-08-01 0:11 ` Darren New
2002-08-01 1:08 ` tmoran
2002-08-01 9:25 ` Brian May
2002-08-01 11:20 ` Oleg Goodyckov
2002-08-01 15:43 ` Darren New
2002-08-01 21:37 ` Robert A Duff
2002-08-03 0:42 ` Ted Dennison
2002-08-03 13:51 ` Robert A Duff
2002-08-03 16:43 ` Darren New
2002-08-05 13:37 ` Stephen Leake
2002-08-02 8:01 ` Oleg Goodyckov
2002-08-02 16:09 ` Darren New
2002-08-01 11:09 ` Oleg Goodyckov
2002-08-01 14:08 ` Frank J. Lhota
2002-08-01 15:06 ` Robert A Duff
2002-08-01 16:05 ` Oleg Goodyckov
2002-08-01 14:57 ` Georg Bauhaus
2002-07-31 22:04 ` Dmitry A.Kazakov
2002-07-31 15:23 ` Oleg Goodyckov
2002-08-01 21:57 ` Dmitry A.Kazakov
2002-08-01 13:10 ` Oleg Goodyckov
2002-08-02 23:29 ` Dmitry A.Kazakov
2002-08-02 16:35 ` Oleg Goodyckov
2002-08-05 11:50 ` Dmitry A. Kazakov
2002-08-05 14:29 ` Larry Kilgallen
2002-08-05 14:57 ` Dmitry A. Kazakov
2002-08-05 15:12 ` Oleg Goodyckov [this message]
2002-08-05 16:20 ` Darren New
2002-08-05 17:01 ` Georg Bauhaus
2002-08-05 17:48 ` Darren New
2002-08-05 19:06 ` tmoran
2002-08-05 20:08 ` Darren New
[not found] ` <slrnakv3q9.p2.lutz@taranis.iks-jena.de>
[not found] ` <3D4FEFCB.3B74F5E5@san.rr.com>
2002-08-14 0:07 ` Randy Brukardt
2002-08-01 14:29 ` Ted Dennison
2002-08-01 16:47 ` Oleg Goodyckov
2002-08-02 14:05 ` Ted Dennison
2002-08-02 16:11 ` Darren New
2002-08-03 0:30 ` Ted Dennison
2002-08-03 0:58 ` Darren New
2002-08-03 2:04 ` Dale Stanbrough
2002-08-03 2:32 ` Ted Dennison
2002-08-03 2:47 ` Darren New
2002-08-03 12:41 ` Ted Dennison
2002-08-03 16:53 ` Darren New
2002-08-04 1:08 ` Ted Dennison
2002-08-04 16:23 ` Darren New
2002-08-05 2:16 ` Robert Dewar
2002-08-05 3:45 ` Darren New
2002-08-05 9:56 ` Lutz Donnerhacke
2002-08-05 16:02 ` Darren New
2002-08-14 0:42 ` Randy Brukardt
2002-08-14 1:45 ` Darren New
2002-08-14 19:37 ` Randy Brukardt
2002-08-14 20:25 ` Stephen Leake
2002-08-14 20:22 ` Stephen Leake
2002-08-15 19:24 ` Randy Brukardt
[not found] ` <jb1vkustkugeutalhvrhv1n0k9hqn2fpip@4ax.com>
[not found] ` <3D4FF351.8F4A6C0A@san.rr.com>
2002-08-14 1:03 ` Randy Brukardt
2002-08-14 1:05 ` Robert A Duff
[not found] ` <3D4EA1AC.80D17170@s <wccofc6b66u.fsf@shell01.TheWorld.com>
2002-08-14 20:29 ` Stephen Leake
2002-08-26 17:53 ` Robert A Duff
2002-08-26 18:40 ` Chad R. Meiners
2002-08-26 18:52 ` Robert A Duff
2002-08-26 21:46 ` Chad R. Meiners
2002-08-05 13:29 ` Stephen Leake
2002-08-03 5:07 ` achrist
2002-08-03 12:52 ` Ted Dennison
2002-08-05 15:34 ` Ted Dennison
2002-08-05 13:24 ` Stephen Leake
2002-08-05 16:02 ` Darren New
2002-08-05 7:18 ` Oleg Goodyckov
2002-08-02 1:04 ` tmoran
replies disabled
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox