Homework #2 from SEDS
- Create a new Jupyter notebook called 'SEDS-HW2.ipynb' and finish the in class excersize of creating a block of code to download the HCEPDB data, unzip it and load it into a data frame in Python (not using %% bash magic!) so that it works for a single file.
- Run the code you put together for the three files below. Use Markdown between the set of cells to offset and explain your work.
| URL | filename | csv_filename |
|---|---|---|
| http://faculty.washington.edu/dacb/HCEPDB_moldata_set1.zip | HCEPDB_moldata_set1.zip | HCEPDB_moldata_set1.csv |
| http://faculty.washington.edu/dacb/HCEPDB_moldata_set2.zip | HCEPDB_moldata_set2.zip | HCEPDB_moldata_set2.csv |
| http://faculty.washington.edu/dacb/HCEPDB_moldata_set3.zip | HCEPDB_moldata_set3.zip | HCEPDB_moldata_set3.csv |
- In a separate set of cells, use lists, tuples, dictionaries, for loops, if statements, and whatever other tasty Python bits you want to run your code on the three files so that you don't replicate code.
- Use the Markdown in the notebook to describe how using these features of Python made your code shorter, more easy to read, and easier to maintain.
- Pick one of the data transformations you used in homework 1 (or that you find interesting) and add it to your code in #3 above.
Dave will fill this section in