In reverse-order, the list of things we've learned (or will soon be learning). In addition to the articles/tutorials listed below, here are a couple of overall references:
Homework: Wednesday, March 11 - Collecting Dallas Officer-Involved Shootings
Collect and parse the Dallas Police Department's officer-involved shooting data and make an interactive map. |
Extra Credit: Tuesday, March 10 - The Celebrity (Tw)It List
Finding out who the most-followed users follow on Twitter. |
Homework: Tuesday, February 24 - Draft proposal of a final project
Use your computational methods to solve a computational problem of your own choosing. |
Extra Credit: Friday, February 20 - Build face-grep in Python
Taking the Unix philosophy to Python and computer vision object-detection algorithms. |
The overview to a four-part homework assignment in looking up and comparing lobbying and U.S. Congressional activity. |
Text formatting and templates and loops |
How HTML works, and some of its terminology
|
A quick introduction to a few of the useful endpoints of the Spotify music service. |
Homework: Friday, February 13 - Analyzing Tweets in CSV form
Connect to the Twitter API, download a user's tweets as CSV, and count frequency of hashtags and words. |
An exercise in using the Instagram API to get your own media collection. |
A walkthrough of basic API usage, JSON parsing, and ImageMagick magic to create some fun static montages. |
Extra Credit: Tuesday, March 10 - 404-Finder
Write a program to auto-detect broken links |
Some dev-ops steps needed to get Twitter-related tools onto our Farmshare accounts
|
Homework: Friday, February 6 - Collecting and analyzing job listings from the USAJobs.gov API
Ask what you can do for your country, and what your country can pay you. |
Using curl to fetch metadata about a webpage's existence
|
Homework: Tuesday, February 3 - Using baby names to classify names by gender
Use the SSA baby name data to make a naive filter for guessing the gender of a name. |
Homework: Friday, January 30 - Basic if-else practice
Practice the logic of if-elif-else conditional branching |
How to write a program that can branch into more than one path of execution.
|
Homework: Tuesday, January 27 - Exploring Congressional Twitter data as JSON
Basic JSON parsing exercise using what Congress tweets. |
JSON is a lightweight format that is nearly ubiquitous for data-exchange. jq is a command-line tool for parsing JSON.
|
Extra Credit: Tuesday, February 10 - Firsts in American baby-naming
Even more practice with text filters, this time to find when baby names first became known. |
Extra Credit: Tuesday, January 27 - More analysis of trends in American baby-naming
More practice with text filters to find interesting trends in the SSA baby name data. |
Homework: Friday, January 30 - Death Row rows parsing
Collect and aggregate data from three different states' death row listings. |
A quick exercise in HTML/CSS selectors and dirty data
|
Extra Credit: Tuesday, February 17 - Listing the BuzzFeed listicles
Practicing web-scraping and regexes on BuzzFeed listicle titles |
Using the pup tool to more sanely extract data from HTML files
|
Homework: Friday, January 16 - Managing baby names and data projects with Github
A sampler project that demonstrates how your code and data should be organized for minimal head-smashing. |
|
How to design and package your code so that it can be re-used in future scenarios.
|
Homework: Friday, January 16 - Basic word analysis of the White House Press Briefings
After collecting the list of WH Briefings, it's time to get each briefing. |
Homework: Thursday, January 22 - Parsing the White House Press Briefings as HTML
Data analysis of all the words used in the White House press briefings |
How to wrap your code into a file and run it from the command-line
|
How to multitask from the command-line interface
|
How to save programs as files and execute them with bash
|
How to loop, aka designing a program to do repetitive work for you
|
A syntax for describing patterns of text, i.e. Steroids for Grep
|
The fastest way to search text from the command-line
|
Homework: Wednesday, January 14 - Collecting the White House Press Briefings
The first step in analyzing web data is to just collect the webpages. |
How to download files straight from the command-line interface
|
Using variables to refer to data, including the results of a command.
|
Thinking of programs as independent filters working on streams of data
|
One of Unix's most fundamental features: how programs can be joined together through the use of pipes and redirections.
|
An overview of how commands are interpreted and executed by the shell.
|
Homework: Wednesday, January 7 - Setup Prep
Setting up our programming toolbox and environment. |
An overview of how Bash interprets text, both literally and symbolically.
|
A quick overview of the Bash command-line interpreter and the mechanics of its prompt
|