[GTALUG] Links from my talk on Web Scraping last night
Myles Braithwaite 👾
me at mylesb.ca
Wed Sep 13 10:00:06 EDT 2017
Jupyter Notebook:
<https://github.com/myles/2017-09-12-gtalug-web-scraping/blob/master/presentation.ipynb>
Articles:
* Remedies for Web Scraping by John Gregory,
<http://www.slaw.ca/2013/10/21/remedies-for-web-scraping/>
* Web Scraping and Crawling Are Perfectly Legal, Right? by Benoit
Bernard,
<https://benbernardblog.com/web-scraping-and-crawling-are-perfectly-legal-right/>
Python Libraries:
* Selenium (good for download data from a JavaScript only website) -
<http://www.seleniumhq.org/projects/webdriver/>
* BeautifulSoup (HTML Parser) -
<https://www.crummy.com/software/BeautifulSoup/>
* Pandas (Data Analysis Library) - <http://pandas.pydata.org/>
* Matplotlib (Fancy Graphs) - <https://matplotlib.org/>
>From Questions:
* Good Python library for easily sending emails -
<https://github.com/lavr/python-emails>
* Stewart's talk on OCR, A Bit More Than Mostly Searchable: Scanned
Paper You Can Find - <https://www.youtube.com/watch?v=EH_txB_hJWw&t=9s>
More information about the talk
mailing list