Just wanted to point you to some similar functionality we have in statsmodels that just pulls from the Rdatasets repo. https://github.com/statsmodels/statsmodels/blob/master/statsmodels/datasets/utils.py#L246