Skip to content

DIRECT Software Engineering for Data Scientists, Homework 2

Notifications You must be signed in to change notification settings

UWDIRECT/SEDS-HW2

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

SEDS-HW2

Homework #2 from SEDS

Due Jan. 25, 2017 at 5PM

Assignment

  1. Create a new Jupyter notebook called 'SEDS-HW2.ipynb' and finish the in class excersize of creating a block of code to download the HCEPDB data, unzip it and load it into a data frame in Python (not using %% bash magic!) so that it works for a single file.
  2. Run the code you put together for the three files below. Use Markdown between the set of cells to offset and explain your work.
URL filename csv_filename
http://faculty.washington.edu/dacb/HCEPDB_moldata_set1.zip HCEPDB_moldata_set1.zip HCEPDB_moldata_set1.csv
http://faculty.washington.edu/dacb/HCEPDB_moldata_set2.zip HCEPDB_moldata_set2.zip HCEPDB_moldata_set2.csv
http://faculty.washington.edu/dacb/HCEPDB_moldata_set3.zip HCEPDB_moldata_set3.zip HCEPDB_moldata_set3.csv
  1. In a separate set of cells, use lists, tuples, dictionaries, for loops, if statements, and whatever other tasty Python bits you want to run your code on the three files so that you don't replicate code.
  2. Use the Markdown in the notebook to describe how using these features of Python made your code shorter, more easy to read, and easier to maintain.
  3. Pick one of the data transformations you used in homework 1 (or that you find interesting) and add it to your code in #3 above.

Grading rubric

Dave will fill this section in

About

DIRECT Software Engineering for Data Scientists, Homework 2

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published