Changing every other instance of <B> in a file

Emile van Sebille emile at fenx.com
Tue Mar 27 08:56:26 EST 2001


Here's one way (although I'm interested to see how re might do this):

txt = """Blablabla <B> talktalk <B> blabla blabla balbalblabla
Blablabla <B> talktalk <B> blabla blabla balbalblabla
Blablabla <B> talktalk <B> blabla blabla balbalblabla
Blablabla <B> talktalk <B> blabla blabla balbalblabla
Blablabla <B> talktalk <B> blabla blabla balbalblabla
Blablabla <B> talktalk <B> blabla blabla balbalblabla
Blablabla <B> talktalk <B> blabla blabla balbalblabla
Blablabla <B> talktalk <B> blabla blabla balbalblabla"""

parts = txt.split("<B>")

joiners = ("<B>","</B>")
toggle = 0

result = []

for part in parts:
 result.append(part)
 result.append(joiners[toggle])
 toggle = 1 - toggle

newText = "".join(result[:-1])

print newText

--

Emile van Sebille
emile at fenx.com

---------
"Lars Klæboe" <larskl at klassekampen.no> wrote in message
news:8e41ctkk4l38jmj34070j15iotgv4is2cg at 4ax.com...
> I have quite a few (~70000) quark tagged files I'd like to HTMLize, and
> as my tool of choice has been python for every other script, I thought
> I'd stick to what I know. Or thought I knew.
>
> The original file.txt (quark tagged) :
>
> Blablabla <B> talktalk <B> blabla blabla balbalblabla
>
> The resulting file.html (html)
>
> Blablabla <B> talktalk </B> blabla blabla balbalblabla
>
> As you can tell, every other instance of <B> is to be changed into </B>.
>
> How on earth can I do this as easy as I did in sed ? I'm stuck
> mindwise, locked on a useless solution I'm too embarrased
> to show you ;)
>
> All help is appreciated !
>
> --
> Lars Klæboe
>





More information about the Python-list mailing list