Skip to content

Conversation

@raulk
Copy link
Contributor

@raulk raulk commented Jan 22, 2026

Summary

  • Add scripts/copy_data_to_dist.py that copies parquet files for rendered dates to site/dist/data/
  • Add /data/ page listing available downloads by date with file sizes and row counts
  • Update justfile with copy-data recipe included in publish
  • Update sync workflow to run copy-data before R2 upload
  • Add "Data downloads" link to sidebar navigation (desktop and mobile)

URL structure

  • Data index: /data/
  • Individual files: /data/{date}/{query}.parquet

Example: /data/2026-01-20/blobs_per_slot.parquet

Add infrastructure to publish parquet data files alongside the site:

- Add copy_data_to_dist.py script that copies parquet files for
  rendered dates from notebooks/data/ to site/dist/data/
- Add /data/ page listing available downloads by date with file
  sizes and row counts
- Update justfile with copy-data recipe and include in publish
- Update sync workflow to run copy-data before R2 upload
- Add "Data downloads" link to sidebar navigation

Files will be accessible at /data/{date}/{query}.parquet URLs.
@raulk
Copy link
Contributor Author

raulk commented Jan 22, 2026

@misilva73 here's the PR to track for the parquet files listing.

@github-actions
Copy link

Preview Deployment

🚀 Preview is ready!

URL: https://observatory-staging.ethp2p.dev/pr-51/

Updated: 2026-01-22T15:21:57.862Z

@raulk raulk merged commit 6cdf107 into main Jan 22, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants