[CentralOH] 2015-01-23 會議 Scribbles 落書/惡文?: dict:get() web scraping next person fileinput raw strings pythonpy crush matrices

jep200404 at columbus.rr.com jep200404 at columbus.rr.com
Tue Jan 27 20:20:18 CET 2015


Welcome back Raymond!

Thanks to Pillar and Raymond Chandler for hosting us at The Forge. 
They are quite generous again, with food catered by Chef Jeff,
more pizza than we could eat, and drink.

Please show us how to play with C++ from within Ipython Notebook.

Jim Prior presented on dict.get()

    http://nbviewer.ipython.org/url/colug.net/python/cohpy/20150126/cohpy-20150126-2-dict.get().ipynb
    http://colug.net/python/cohpy/20150126/cohpy-20150126-2-dict.get().ipynb

Jan Milosh presented on real-world messy screen scraping

    github.com/janmilosh/web_scraper
    github.com/janmilosh/web_scraper/wiki
    https://github.com/janmilosh/web_scraper/wiki/3.-Tools-used
        selenium
        beautiful soup
        pickling
    http://www.ezselect.com/ (requires $ subscription)
    print(soup.prettify())

    Beware: Thieves troll github for AWS passwords
    don't put secrets on github
        put secrets in a separate file that is listed in .gitignore
    set thresholds for AWS accounts

    http://linux.die.net/man/1/wget

        -w seconds 
        --wait=seconds
        --random-wait

    emacs robots.txt
    chmod -w robots.txt

Jim Prior presented on codin    
g for next person that has to work on the code.

    http://nbviewer.ipython.org/url/colug.net/python/cohpy/20150126/cohpy-20150126-3-next-eyeballs.ipynb
    http://colug.net/python/cohpy/20150126/cohpy-20150126-3-next-eyeballs.ipynb

Eric Floehr presented

    https://gist.github.com/efloehr/33cf64dc6c074aac1204

    #!/usr/bin/env python

    import sys
    print sys.platform

    import fileinput
    # https://docs.python.org/2/library/fileinput.html

    for line in fileinput.input(): # works like cat
        print line,

    for line in fileinput.input(): # works like cat
        if line.startswith('>f') and line.strip().endswith('.jpg$'):
            print line,

    faint memory of some security exploit associated with #!/usr/bin/env
    https://duckduckgo.com/html/?q=%22%2Fusr%2Fbin%2Fenv%22+exploit

Jim Prior presented on raw strings

    Raw strings are not completely raw.
    A \ must be followed by another character.
    \uXXXX and \UXXXXXXXX escape sequences are processed in ur'' strings.

    http://nbviewer.ipython.org/url/colug.net/python/cohpy/20150126/cohpy-20150126-1-string-constants.ipynb
    http://colug.net/python/cohpy/20150126/cohpy-20150126-1-string-constants.ipynb

Jason Green presented on pythonpy

    pythonpy rocks
    https://github.com/Russell91/pythonpy
    Ve don't need no steenkin' awk.

    Jason, please post your code/commands examples.

wp:Brazil (1985 film)
wp:Red pill and blue pill
wp:The Matrix
wp:bullet time
scribble time
wp:Portishead (band)
wp:Cesária
at after meeting:
wp:Number One Crush
    Romeo & Juliet version great bass

Higg's boson confirmed.
Pb on Pb
LHC double energy again

wp:Sherlock (TV series)
    three episodes per season
wp:Benedict Cumberbatch
wp:From the Beginning (song)
Boom Boom - John Lee Hooker

wp:Raven's Progressive Matrices

To get good answers, follow the advice in the links below.
http://catb.org/~esr/faqs/smart-questions.html
http://web.archive.org/web/20090627155454/www.greenend.org.uk/rjk/2000/06/14/quoting.html
wp: prefix means Wikipedia


More information about the CentralOH mailing list