A guide to scraping historical snapshots of webpages from the Archive.org Wayback Machine.
My name’s Evan and I’m a recovering physicist. I currently run a web scraping and data processing company called Intoli. I spend most of my days writing code, analyzing data, designing algorithms, and learning about web scraping usecases.
PhD in High Energy Nuclear Physics, 2014
University of California Davis
MS in Physics, 2010
University of California Davis
BA in Physics and Classical Music Composition, 2008
Bard College
A guide to scraping historical snapshots of webpages from the Archive.org Wayback Machine.
An analysis of which stories are removed from the front page of Hacker News due to moderator intervention.
A data analysis of how many deaths the DST transition causes due to tired driving.
A data-driven exploration of how the Hacker News ranking algorithm works.