feat(admin-api): add sync progress endpoint #1528

mitchhs12 · 2026-01-07T23:00:38Z

Summary

Added GET /datasets/{namespace}/{name}/versions/{revision}/sync-progress endpoint.
Returns per-table sync progress including current_block, start_block, job_status, and file stats.
Uses TableSnapshot::synced_range() with canonical_chain logic to accurately report sync progress, handling gaps and reorgs.

Tests

Endpoint returns correct structure for valid dataset
Returns 404 for non-existent dataset
Verifies RUNNING status while job is actively syncing
Verifies COMPLETED status when end block is reached

Response format:

{
  "dataset_namespace": "ethereum",
  "dataset_name": "mainnet",
  "revision": "0.0.0",
  "manifest_hash": "2dbf16e8a4d1c526e3893341d1945040d51ea1b68d1c420e402be59b0646fcfa",
  "tables": [
    {
      "table_name": "blocks",
      "current_block": 950000,
      "start_block": 0,
      "job_id": 1,
      "job_status": "RUNNING",
      "files_count": 47,
      "total_size_bytes": 2147483648
    }
  ]
}

LNSD · 2026-01-08T14:08:24Z

crates/core/metadata-db/src/sync_progress.rs

Do we want to do this to get the sync progress? I find this ad-hoc and not integrated with the Amp "data lake" (or a data store).

Not sure. This approach does couple us a lot though...

The problem with doing a query-based approach is that it might take a while to execute the query. For example, if we do SELECT MAX(block_number) FROM eth_rpc.logs this will be pretty slow if there are a lot of files.

Do you think it would be better to update the worker to save the progress whenever it finishes writing a file? and then we can just query the new column (so updating the jobs table in the database)?

…p + reorg handling via canonical_chain logic

feat(admin-api): add sync progress endpoint

4c73e1a

mitchhs12 requested review from LNSD and leoyvens January 7, 2026 23:00

mitchhs12 added 2 commits January 7, 2026 18:11

update admin spec for sync progress endpoint

a437ba9

made sql type casting explicit for sync progress query

3e079b4

mitchhs12 self-assigned this Jan 7, 2026

mitchhs12 linked an issue Jan 7, 2026 that may be closed by this pull request

Job progress reporting #1405

Open

LNSD reviewed Jan 8, 2026

View reviewed changes

updated the aggregation sql with TableSnapshot::synced_range() for ga…

7723067

…p + reorg handling via canonical_chain logic

mitchhs12 requested a review from LNSD January 8, 2026 21:55

mitchhs12 mentioned this pull request Jan 9, 2026

Added get_sync_progress() method edgeandnode/amp-python#33

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(admin-api): add sync progress endpoint #1528

feat(admin-api): add sync progress endpoint #1528

Uh oh!

mitchhs12 commented Jan 7, 2026 •

edited

Loading

Uh oh!

LNSD Jan 8, 2026

Uh oh!

mitchhs12 Jan 8, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat(admin-api): add sync progress endpoint #1528

Are you sure you want to change the base?

feat(admin-api): add sync progress endpoint #1528

Uh oh!

Conversation

mitchhs12 commented Jan 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Tests

Uh oh!

LNSD Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

mitchhs12 Jan 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mitchhs12 commented Jan 7, 2026 •

edited

Loading

mitchhs12 Jan 8, 2026 •

edited

Loading