trailing space in RE
Doru-Catalin Togea
doru-cat at ifi.uio.no
Fri Aug 2 11:25:31 EDT 2002
Hi all!
I have written a little script to parse some Bible text, and to this
purpose I defined the following re:
bibleRef = r'(\w+) (\d+):(\d+) (.+)'
I use it to match Bible references of the kind: 'gen 1:1' or '1Co 10:12',
and the pertaining text.
My re looks for the following:
- (\w+) # first 3 characters, either 3 letters, or 1 digit and 2
letters: 'gen', '1co'
- a space
- (\d+) # a number
- : # a collon
- (\d+) # another number
- (.+) # the rest of the text
Everything works fine, but I have a problem in that "the rest of the
text" allways has a trailing space like this:
"Gen 1:1 In the beginning God created the heavens and the earth. "
"1Co 10:12 Therefore let him who thinks he stands take heed lest he
fall. "
So my question is, how do I match "the rest of the text" but not the last
character (which is a space)?
I guess I could strip, or slice my text before matching it, but I would
like to know how to write the re, as described above.
I appreciate your help.
Best regards,
Catalin
<<<< ================================== >>>>
<< We are what we repeatedly do. >>
<< Excellence, therefore, is not an act >>
<< but a habit. >>
<<<< ================================== >>>>
More information about the Python-list
mailing list