Collecting and Vetting Public Data for Research

CS4137.01
Course System Home Terms Fall 2023 Collecting and Vetting Public Data for Research

Course Description

Summary

In this course we will go over major methods for collecting and vetting public data to be used in research or computing settings. The course will start by learning about publicly available data sets, then progress through using APIs to call data providers, web-scraping public data, and finally capturing streaming data and converting it into usable datasets. This course will be taught in Python using Jupyter Notebooks. Students will be expected to be fluent in Python or R for data analysis before starting the course and to have undertaken basic coursework in statistics. This course will be especially helpful for students who are preparing STEM or Social Science plan projects that require data for analysis. It may also be of interest to CS students looking to learn web-scraping and how to capture streaming data.

Prerequisites

Students who have at least one class each in statistics or data science and data visualization or mapping should contact faculty directly at michaelcorey@bennington.edu.

Please contact the faculty member : michaelcorey@bennington.edu

Instructor

  • Michael Corey

Day and Time

Academic Term

Fall 2023

Credits

4

Course Level

4000

Maximum Enrollment

16