[GTALUG] Links from my talk on Web Scraping last night

Myles Braithwaite 👾 me at mylesb.ca
Wed Sep 13 10:00:06 EDT 2017


Jupyter Notebook:
<https://github.com/myles/2017-09-12-gtalug-web-scraping/blob/master/presentation.ipynb>

Articles:

* Remedies for Web Scraping by John Gregory,
<http://www.slaw.ca/2013/10/21/remedies-for-web-scraping/>

* Web Scraping and Crawling Are Perfectly Legal, Right? by Benoit
Bernard,
<https://benbernardblog.com/web-scraping-and-crawling-are-perfectly-legal-right/>

Python Libraries:

* Selenium (good for download data from a JavaScript only website) -
<http://www.seleniumhq.org/projects/webdriver/>

* BeautifulSoup (HTML Parser) -
<https://www.crummy.com/software/BeautifulSoup/>

* Pandas (Data Analysis Library) - <http://pandas.pydata.org/>

* Matplotlib (Fancy Graphs) - <https://matplotlib.org/>

>From Questions:

* Good Python library for easily sending emails -
<https://github.com/lavr/python-emails>

* Stewart's talk on OCR, A Bit More Than Mostly Searchable: Scanned
Paper You Can Find - <https://www.youtube.com/watch?v=EH_txB_hJWw&t=9s>


More information about the talk mailing list