download all mib files from a web page

powah wong_powah at yahoo.ca
Wed May 27 14:06:05 EDT 2009


On May 27, 1:50 pm, Jeff McNeil <j... at jmcneil.net> wrote:
> On May 27, 12:29 pm, powah <wong_po... at yahoo.ca> wrote:
>
>
>
>
>
> > I want to download all mib files from the web page:http://www.juniper.net/techpubs/software/junos/junos94/swconfig-net-m...
>
> > All mib filenames are of this format:www.juniper.net/techpubs... .txt
>
> > I write this program but has the following error.
> > Please help.
> > Thanks.
>
> > [code]
> > #!/usr/bin/env python
> > import urllib2,os,urlparse
> > url="http://www.juniper.net/techpubs/software/junos/junos94/swconfig-
> > net-mgmt/juniper-specific-mibs-junos-nm.html#jN18E19"
> > page=urllib2.urlopen(url)
> > f=0
> > links=[]
> > data=page.read().split("\n")
> > for item in data:
> >     if "www.juniper.net/techpubs" in item:
> >         httpind=item.index("www.juniper.net/techpubs")
> >         item=item[httpind:]
> >         #print "item " + item
> >         ind=item.index("<")
> >         links.append(item[:ind]) #grab all links
> > # download all links
> > for link in links:
> >     print "link " + link
> >     filename=link.split("/")[-1]
> >     print "downloading ... " + filename
> >     u=urllib2.urlopen(link)
> >     p=u.read()
> >     open(filename,"w").write(p)
> > [/code]
>
> > $ ~/python/downloadjuniper.py
> > linkwww.juniper.net/techpubs/software/junos/junos94/swconfig-net-mgmt/mib...
> > downloading ... mib-jnx-user-aaa.txt
> > Traceback (most recent call last):
> >   File "/home/powah/python/downloadjuniper.py", line 20, in ?
> >     u=urllib2.urlopen(link)
> >   File "/usr/lib/python2.4/urllib2.py", line 130, in urlopen
> >     return _opener.open(url, data)
> >   File "/usr/lib/python2.4/urllib2.py", line 350, in open
> >     protocol = req.get_type()
> >   File "/usr/lib/python2.4/urllib2.py", line 233, in get_type
> >     raise ValueError, "unknown url type: %s" % self.__original
> > ValueError: unknown url type:www.juniper.net/techpubs/software/junos/junos94/swconfig-net-mgmt/mib...
>
> > $ python
> > Python 2.4.4 (#1, Oct 23 2006, 13:58:00)
> > [GCC 4.1.1 20061011 (Red Hat 4.1.1-30)] on linux2
> > Type "help", "copyright", "credits" or "license" for more information.
>
> > My computer is FC6 linux.
>
> There's only a couple dozen of them, right-click->Save As. I'm sure
> Juniper would appreciate that much more than an automated crawler.
>
> As far as your ValueError is concerned, consider that
> 'www.juniper.com'doesn't start with a protocol specification when
> passed into urllib2.urlopen.
>
> -Jeff
> mcjeff.blogspot.com

Juniper's web page is simple, I am learning python so as to download
files from more complex web page and do other things as well.



More information about the Python-list mailing list