Regex help needed!

Oltmans rolf.oltmans at gmail.com
Mon Dec 21 08:06:08 EST 2009


On Dec 21, 5:05 pm, Umakanth <cum... at gmail.com> wrote:
> How about re.findall(r'\d+(?:\.\d+)?',str)
>
> extracts only numbers from any string....
>

Thank you. However, I only need the digits within the ID attribute of
the DIV. Regex that you suggested fails on the following string

----
lksjdfls <div id ='amazon_345343'> kdjff lsdfs </div> sdjfls <div id
=   "amazon_35343433">sdfsd</div><div id='amazon_8898'>welcome</div>
hello, my age is 86 years old and I was born in 1945. Do you know that
PI is roughly 3.1443534534534534534
----




> ~uk
>
> On Dec 21, 4:38 pm, Oltmans <rolf.oltm... at gmail.com> wrote:
>
> > Hello,. everyone.
>
> > I've a string that looks something like
> > ----
> > lksjdfls <div id ='amazon_345343'> kdjff lsdfs </div> sdjfls <div id
> > =   "amazon_35343433">sdfsd</div><div id='amazon_8898'>welcome</div>
> > ----
>
> > From above string I need the digits within the ID attribute. For
> > example, required output from above string is
> > - 35343433
> > - 345343
> > - 8898
>
> > I've written this regex that's kind of working
> > re.findall("\w+\s*\W+amazon_(\d+)",str)
>
> > but I was just wondering that there might be a better RegEx to do that
> > same thing. Can you kindly suggest a better/improved Regex. Thank you
> > in advance.
>
>




More information about the Python-list mailing list