Problem when fetching page using urllib2.urlopen

jitu nair.jitendra at gmail.com
Mon Aug 10 07:39:03 EDT 2009


Hi,

A html page  contains 'anchor' elements with 'href' attribute  having
a semicolon  in the url , while fetching the page using
urllib2.urlopen, all such href's  containing  'semicolons' are
truncated.


For example the href http://travel.yahoo.com/p-travelguide-6901959-pune_restaurants-i;_ylt=AlWSqpkpqhICp1lMgChtJkCdGWoL
get truncated to http://travel.yahoo.com/p-travelguide-6901959-pune_restaurants-i

The page I am talking about can be fetched from
http://travel.yahoo.com/p-travelguide-485468-pune_india_vacations-i;_ylc=X3oDMTFka28zOGNuBF9TAzI3NjY2NzkEX3MDOTY5NTUzMjUEc2VjA3NzcC1kZXN0BHNsawN0aXRsZQ--

Thanks a Lot
Regards
jitu




More information about the Python-list mailing list