WebDriver For Journalists: Scraping the Web To Report the Truth

10th October 2023
  • Locked
Phil Wells's profile
Phil Wells

Senior Software Engineer

WebDriver For Journalists: Scraping the Web To Report the Truth image
Talk Description
Did you know that in 2021, a Pulitzer Prize was awarded to a project that had WebDriver code at its core?

The New York Times COVID data-tracking project became the United States' most-watched dashboard for tracking changes in the spread of the pandemic. It worked by aggregating data from municipalities across the nation. These sources ranged in sophistication from sanitized data available for download, to bespoke HTML maps.

In this presentation, we'll discuss the role of WebDriver and other web-crawling technologies in that and other journalistic endeavours. 
We'll review using selectors to find data for journalists, cleaning source data, and the value of agility in deadline-driven workflows. 
We'll also explore how the lessons learned in this line of work are applicable to the practice of software testing and beyond.
What you’ll learn

By the end of this talk, you'll be able to:

  • Reflect about WebDriver as a tool for something other than just application testing
  • Recite the story of the New York Times COVID dashboard; how it works and what it accomplished
  • Consider alternative information-gathering methods for cases when WebDriver is not the best tool in the box
Phil Wells's profile'

Phil Wells

Senior Software Engineer

Phil Wells has been a software quality practitioner for over a decade. Now, Phil is a senior software engineer with Epic Games. His team builds shared services to help developers get their systems tested and delivered safely. Phil likes to go beyond writing tests and building infrastructure for delivery. He also acts as a coach for his peers in web development, teaching and advocating for modern test practices and technologies. People have all sorts of funny ideas about what Phil does every day. Phil does not play Fortnite all day. Phil lives in the hills of New Jersey, USA. He has three little kids who also do not play Fortnite all day.
Suggested Content
Our Dotfuscator uses a multi-tiered defence strategy to protect your apps and the people using them. Discover more about Dotfuscator and its features, including obfuscation, runtime checks, and attack detection to protect your code from security breaches.
Explore MoT
Episode Five: TestBash Special image
A free monthly virtual community gathering
A Software Tester’s Guide To Chrome Devtools
Learn how to dig deeper into the Web with the use of Devtools
This Week in Testing
Debrief the week in Testing via a community radio show hosted by Simon Tomes and members of the community