From: Jeffrey Carter <spam@spam.com>
Subject: Re: character matching
Date: Fri, 13 Aug 2004 18:41:19 GMT
Date: 2004-08-13T18:41:19+00:00 [thread overview]
Message-ID: <jX7Tc.20005$9Y6.13857@newsread1.news.pas.earthlink.net> (raw)
In-Reply-To: <gcYSc.818$Y24.419@cyclops.nntpserver.com>
John J wrote:
> To develop a program that counts words and sentences I'm trying to
> write some code that will recognise a word due to it being a series
> of alphanumeric characters terminated by one or more of a space,
> comma, fullstop, colon, exclamation mark or question mark. I'm going
> to use a case to match the condition then perform an advance on a
> wordcounter.
>
> I also need to include a case match that will recognise sentences by
> being a sequence of words terminated by one or more of full stop,
> exclamation mark, question mark or colon. The code also needs to be
> able to accept that there maybe spaces between the last word of the
> sentence and the terminating stop ie. "hope this works !". I'm not
> sure on how to match these conditions and would greatly appreciate
> some assistance. My skeletion code is as follows:
This certainly sounds like a homework assignment.
A good approach to this is to consider it a state machine with 2 states:
you're either in a word or not in a word; initially you're not in a
word. When you're not in a word, characters that terminate a word are
junk and leave you in the same state; characters that can be a word put
you into the in-a-word state. When you're in a word, characters that can
be a word leave you in the same state; characters that terminate a word
put you into the not-in-a-word state and terminators that terminate a
sentence can also increment the sentence count. This kind of approach
can help you consider "Help me!!!" as one sentence rather than 3.
--
Jeff Carter
"Ada has made you lazy and careless. You can write programs in C that
are just as safe by the simple application of super-human diligence."
E. Robert Tisdale
72
next prev parent reply other threads:[~2004-08-13 18:41 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2004-08-13 5:23 character matching John J
2004-08-13 10:33 ` David C. Hoos
2004-08-13 11:12 ` Nick Roberts
2004-08-13 18:41 ` Jeffrey Carter [this message]
2004-08-15 12:36 ` John J
2004-08-15 14:52 ` Ludovic Brenta
2004-08-15 21:02 ` Adrian Knoth
2004-08-16 22:26 ` Ludovic Brenta
2004-08-15 17:21 ` Steve
replies disabled
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox