[CentralOH] 2015-01-23 會議 Scribbles 落書/惡文?: dict:get() web scraping next person fileinput raw strings pythonpy crush matrices
jep200404 at columbus.rr.com
jep200404 at columbus.rr.com
Tue Jan 27 20:20:18 CET 2015
Welcome back Raymond!
Thanks to Pillar and Raymond Chandler for hosting us at The Forge.
They are quite generous again, with food catered by Chef Jeff,
more pizza than we could eat, and drink.
Please show us how to play with C++ from within Ipython Notebook.
Jim Prior presented on dict.get()
http://nbviewer.ipython.org/url/colug.net/python/cohpy/20150126/cohpy-20150126-2-dict.get().ipynb
http://colug.net/python/cohpy/20150126/cohpy-20150126-2-dict.get().ipynb
Jan Milosh presented on real-world messy screen scraping
github.com/janmilosh/web_scraper
github.com/janmilosh/web_scraper/wiki
https://github.com/janmilosh/web_scraper/wiki/3.-Tools-used
selenium
beautiful soup
pickling
http://www.ezselect.com/ (requires $ subscription)
print(soup.prettify())
Beware: Thieves troll github for AWS passwords
don't put secrets on github
put secrets in a separate file that is listed in .gitignore
set thresholds for AWS accounts
http://linux.die.net/man/1/wget
-w seconds
--wait=seconds
--random-wait
emacs robots.txt
chmod -w robots.txt
Jim Prior presented on codin
g for next person that has to work on the code.
http://nbviewer.ipython.org/url/colug.net/python/cohpy/20150126/cohpy-20150126-3-next-eyeballs.ipynb
http://colug.net/python/cohpy/20150126/cohpy-20150126-3-next-eyeballs.ipynb
Eric Floehr presented
https://gist.github.com/efloehr/33cf64dc6c074aac1204
#!/usr/bin/env python
import sys
print sys.platform
import fileinput
# https://docs.python.org/2/library/fileinput.html
for line in fileinput.input(): # works like cat
print line,
for line in fileinput.input(): # works like cat
if line.startswith('>f') and line.strip().endswith('.jpg$'):
print line,
faint memory of some security exploit associated with #!/usr/bin/env
https://duckduckgo.com/html/?q=%22%2Fusr%2Fbin%2Fenv%22+exploit
Jim Prior presented on raw strings
Raw strings are not completely raw.
A \ must be followed by another character.
\uXXXX and \UXXXXXXXX escape sequences are processed in ur'' strings.
http://nbviewer.ipython.org/url/colug.net/python/cohpy/20150126/cohpy-20150126-1-string-constants.ipynb
http://colug.net/python/cohpy/20150126/cohpy-20150126-1-string-constants.ipynb
Jason Green presented on pythonpy
pythonpy rocks
https://github.com/Russell91/pythonpy
Ve don't need no steenkin' awk.
Jason, please post your code/commands examples.
wp:Brazil (1985 film)
wp:Red pill and blue pill
wp:The Matrix
wp:bullet time
scribble time
wp:Portishead (band)
wp:Cesária
at after meeting:
wp:Number One Crush
Romeo & Juliet version great bass
Higg's boson confirmed.
Pb on Pb
LHC double energy again
wp:Sherlock (TV series)
three episodes per season
wp:Benedict Cumberbatch
wp:From the Beginning (song)
Boom Boom - John Lee Hooker
wp:Raven's Progressive Matrices
To get good answers, follow the advice in the links below.
http://catb.org/~esr/faqs/smart-questions.html
http://web.archive.org/web/20090627155454/www.greenend.org.uk/rjk/2000/06/14/quoting.html
wp: prefix means Wikipedia
More information about the CentralOH
mailing list