find and replace with regular expressions

chrispoliquin at gmail.com chrispoliquin at gmail.com
Thu Jul 31 22:07:16 CEST 2008


I am using regular expressions to search a string (always full
sentences, maybe more than one sentence) for common abbreviations and
remove the periods.  I need to break the string into different
sentences but split('.') doesn't solve the whole problem because of
possible periods in the middle of a sentence.

So I have...

----------------

import re

middle_abbr = re.compile('[A-Za-z0-9]\.[A-Za-z0-9]\.')

# this will find abbreviations like e.g. or i.e. in the middle of a
sentence.
# then I want to remove the periods.

----------------

I want to keep the ie or eg but just take out the periods.  Any
ideas?  Of course newString = middle_abbr.sub('',txt) where txt is the
string will take out the entire abbreviation with the alphanumeric
characters included.



More information about the Python-list mailing list