This project downloads and extracts Divvy trip data CSV files using Python. It's fully containerized using Docker for consistent, reproducible execution.
If you havenβt already:
git clone https://github.com/saccini/data-downloader.gitUse Docker Compose:
docker-compose up --buildThis will:
- Build the Docker image
- Run main.py inside the container
- Download and extract CSVs into downloads/ folder
Extracted .csv files will appear in the downloads/ folder (ignored by Git).
π³ Project Structure
data-downloader/
βββ main.py # Main Python script
βββ requirements.txt # Python dependencies
βββ Dockerfile # Docker build instructions
βββ docker-compose.yml # Docker Compose setup
βββ .gitignore # Excludes downloads and more
βββ .dockerignore # Optimizes Docker build
βββ downloads/ # Downloaded CSVs (auto-generated)
βββ README.md # You're here!
π¦ Dependencies
All Python dependencies are listed in requirements.txt. These are installed inside the Docker container automatically.
π§Ό Clean Up
To stop the container:
docker-compose down
To delete downloaded files (optional):
rm -rf downloads/
π‘ Notes
Invalid or broken URLs are handled gracefully with error messages. Make sure Docker and Docker Compose are installed on your system.