[Tutor] Using Python to access .txt files stored behind a firewall as .exe files
Alan Gauld
alan.gauld at yahoo.co.uk
Mon May 1 17:44:16 EDT 2017
On 01/05/17 18:20, Ian Monat wrote:
> ... I've written a script using the requests module but I
> think a web scraper like Scrapy, Beautiful Soup or Selinium may be
> required.
I'm not sure what you are looking for. Scrapy, BS etc will
help you read the HTML but not to fetch the file. Also do
you want to process the file (extract the text) in Python
too, or is it enough to just fetch the file?
If the problem is with reading the HTML then you need to
give us more detail about the problem areas and HTML
format.
If the problem is fetching the file, it sounds like you
have already done that and it should be a case of fine
tuning/tidying up the code you've written.
What kind of help exactly are you asking for?
--
Alan G
Author of the Learn to Program web site
http://www.alan-g.me.uk/
http://www.amazon.com/author/alan_gauld
Follow my photo-blog on Flickr at:
http://www.flickr.com/photos/alangauldphotos
More information about the Tutor
mailing list