Refactor database migration to use SYSTEM RESTORE DATABASE REPLICA #443

otselnik · 2026-02-06T10:28:22Z

Summary

Refactored database migration to use native SYSTEM RESTORE DATABASE REPLICA command (CH >= 25.8) instead of manual ZooKeeper management.

Key Changes

migration.py: New DatabaseMigrator class with simplified migration flow and version validation
database_replica.py: Added restore_replica_with_system_command() as preferred method, legacy code moved to ZookeeperDatabaseManager (deprecated)
zookeeper.py: New ZKTransactionBuilder class for atomic ZooKeeper operations
database_group.py: Updated CLI to use system command with automatic fallback
Tests: Updated version requirements to 25.8, removed manual restart steps

Benefits

Simpler, more reliable code using native ClickHouse functionality
Backward compatible with fallback for older versions
Better error handling and maintainability

TODO

Replace supervisorctl restart clickhouse-server with proper restart functionality once merged

Summary by Sourcery

Refactor ClickHouse database migration and replica restore flows to use the native SYSTEM RESTORE DATABASE REPLICA command when available, with a structured ZooKeeper-based fallback and updated helpers.

New Features:

Introduce a DatabaseMigrator abstraction to handle migrations between Atomic and Replicated database engines.
Add support for restoring database replicas via SYSTEM RESTORE DATABASE REPLICA with automatic version detection and fallback to ZooKeeper-based recovery.
Provide configuration-driven ClickHouse server restart and wait commands for migrations that require a restart.
Support restoring replicas for databases whose table names contain special characters via proper ZooKeeper name escaping.

Bug Fixes:

Ensure table metadata paths and names are correctly escaped in ZooKeeper so replicas for tables with special characters can be restored reliably.
Improve error reporting for double migrations by surfacing a clear message when replica ZooKeeper nodes already exist with unexpected values.

Enhancements:

Encapsulate ZooKeeper transaction handling in a reusable ZKTransactionBuilder context manager with validation and error handling.
Move database existence checks and detach/attach operations into dedicated helpers to simplify database management logic.
Simplify the database restore-replica CLI command by delegating restore logic to the system command or a single fallback path and removing redundant detach/attach sequences.

Tests:

Update migration and restore-replica feature tests to target ClickHouse 25.8+, cover unsupported-version behavior, remove manual restart and detach/attach steps, and add scenarios for tables with special characters in names.

sourcery-ai · 2026-02-06T10:28:32Z

Reviewer's Guide

Refactors database migration and replica restore logic to prioritize ClickHouse’s native SYSTEM RESTORE DATABASE REPLICA (CH >= 25.8), encapsulate migration behavior in a DatabaseMigrator, centralize ZooKeeper transaction handling, and update CLI/tests for the new flow with a ZooKeeper-based fallback for older versions.

Sequence diagram for restore_replica_command with SYSTEM RESTORE and fallback

sequenceDiagram
    actor Operator
    participant CLI as restore_replica_command
    participant DBMeta as parse_database_metadata
    participant DBUtil as DatabaseModule
    participant DBReplica as DatabaseReplicaModule
    participant ZKMgr as ZookeeperDatabaseManager
    participant ZK as ZooKeeper

    Operator->>CLI: invoke restore_replica_command(database)
    CLI->>DBUtil: is_database_exists(ctx, database)
    DBUtil-->>CLI: exists or not
    CLI->>DBMeta: parse_database_metadata(database)
    DBMeta-->>CLI: db_metadata (engine, zookeeper_path)
    CLI->>CLI: validate engine is_replicated

    CLI->>DBReplica: supports_system_restore_database_replica(ctx)
    DBReplica-->>CLI: bool
    alt CH >= 25.8 and supported
        CLI->>DBReplica: system_restore_database_replica(ctx, database)
        DBReplica->>DBReplica: execute SYSTEM RESTORE DATABASE REPLICA
        DBReplica-->>CLI: success or exception
        alt restore failed
            CLI->>CLI: log error and raise
        else restore ok
            CLI-->>Operator: completed
        end
    else CH < 25.8 (fallback)
        CLI->>DBReplica: _restore_replica_fallback(ctx, database, zookeeper_path)
        DBReplica->>ZKMgr: __init__(ctx)
        DBReplica->>DBReplica: check_database_exists_in_zk(ctx, database, zookeeper_path)
        DBReplica-->>CLI: first_replica flag
        alt first replica
            DBReplica->>ZKMgr: create_database_structure(database, zookeeper_path)
            ZKMgr->>ZK: create database structure nodes
            ZK-->>ZKMgr: ok or NodeExistsError
            alt NodeExistsError
                ZKMgr-->>DBReplica: signal concurrent creation
                DBReplica->>DBReplica: treat as non-first replica
            end
        end
        DBReplica->>ZKMgr: create_replica_nodes(database, first_replica, zookeeper_path)
        ZKMgr->>ZK: transaction with ZKTransactionBuilder
        ZK-->>ZKMgr: committed
        ZKMgr-->>DBReplica: success
        DBReplica-->>CLI: fallback restore complete

        CLI->>DBUtil: detach_database(ctx, database)
        DBUtil-->>CLI: detached
        CLI->>DBUtil: attach_database(ctx, database)
        DBUtil-->>CLI: attached
        CLI-->>Operator: completed via fallback
    end

Class diagram for database migration and replica management refactor

classDiagram
    class AttacherContext {
        -ctx: Context
        -database: str
        +__init__(ctx: Context, database: str)
        +__enter__() None
        +__exit__(exc_type: Optional[type], exc_value: Optional[Exception], traceback: Optional[Any]) bool
    }

    class DatabaseMigrator {
        -ctx: Context
        +__init__(ctx: Context)
        +migrate_to_atomic(database: str, clean_zookeeper: bool) None
        +migrate_to_replicated(database: str) None
        -_validate_version_support() None
        -_validate_database_engine(database: str) Any
        -_check_tables_consistent(database_name: str, local_tables: list~TableInfo~) None
        -_compare_table_schemas(table_name: str, local_metadata: str, zk_metadata: str) bool
        -_sync_table_uuids(tables_info: list~TableInfo~) bool
    }

    class ZookeeperDatabaseManager {
        -ctx: Context
        +__init__(ctx: Context)
        +create_database_structure(database_name: str, db_replica_path: Optional[str]) None
        +create_replica_nodes(database_name: str, first_replica: bool, db_replica_path: Optional[str]) None
        -_generate_counter(zk: KazooClient, db_zk_path: str) str
        -_create_first_replica_name_node(builder: ZKTransactionBuilder, prefix_db_zk_path: str, database_name: str) None
        -_create_query_log_entry(builder: ZKTransactionBuilder, prefix_db_zk_path: str, counter: str) None
        -_create_replica_registration(builder: ZKTransactionBuilder, database_name: str, prefix_db_zk_path: str) None
        -_create_table_metadata_nodes(builder: ZKTransactionBuilder, database_name: str, prefix_db_zk_path: str) None
        -_get_host_id(database_name: str, replica: str) str
        -_get_server_uuid() str
        -_get_shard_and_replica() Tuple~str, str~
        -_get_default_db_path(database_name: str) str
    }

    class ZKTransactionBuilder {
        -ctx: Context
        -zk: KazooClient
        -txn
        -path_to_nodes: List~str~
        -_committed: bool
        +__init__(ctx: Context, zk: KazooClient)
        +__enter__() ZKTransactionBuilder
        +__exit__(exc_type: Any, exc_val: Any, exc_tb: Any) None
        +create_node(path: str, value: str) ZKTransactionBuilder
        +delete_node(path: str) ZKTransactionBuilder
        +commit() None
        +reset() None
        +_check_result_txn(results: List, no_throw: bool) bool
    }

    class DatabaseModule {
        +list_databases(ctx: Context, database: Optional[str], like: Optional[str], engine: Optional[str], with_stats: bool, with_active_parts: bool, skip_temporary: bool, format_: Optional[str]) Any
        +is_database_exists(ctx: Context, database_name: str) bool
        +detach_database(ctx: Context, database: str) None
        +attach_database(ctx: Context, database: str) None
    }

    class DatabaseReplicaModule {
        +system_database_drop_replica(ctx: Context, database_zk_path: str, replica: str, dry_run: bool) None
        +supports_system_restore_database_replica(ctx: Context) bool
        +system_restore_database_replica(ctx: Context, database_name: str) None
        +check_database_exists_in_zk(ctx: Context, database_name: str, db_replica_path: Optional[str]) bool
        +get_default_table_in_db_path(database_name: str, table_name: str) str
        +get_tables_metadata(ctx: Context, database_name: str) dict~str, str~
        +_restore_replica_fallback(ctx: Context, database_name: str, db_replica_path: Optional[str]) None
    }

    AttacherContext --> DatabaseModule : uses detach_database
    AttacherContext --> DatabaseModule : uses attach_database

    DatabaseMigrator --> AttacherContext : manages detach_attach
    DatabaseMigrator --> DatabaseReplicaModule : uses check_database_exists_in_zk
    DatabaseMigrator --> DatabaseReplicaModule : uses get_default_table_in_db_path
    DatabaseMigrator --> DatabaseReplicaModule : uses get_tables_metadata
    DatabaseMigrator --> DatabaseReplicaModule : uses supports_system_restore_database_replica
    DatabaseMigrator --> DatabaseReplicaModule : uses system_restore_database_replica

    DatabaseReplicaModule --> ZookeeperDatabaseManager : uses in _restore_replica_fallback
    ZookeeperDatabaseManager --> ZKTransactionBuilder : builds_transactions
    ZookeeperDatabaseManager --> DatabaseModule : uses is_database_exists via queries

Flow diagram for DatabaseMigrator.migrate_to_replicated

flowchart TD
    A["Start migrate_to_replicated(database)"] --> B[Validate ClickHouse version with supports_system_restore_database_replica]
    B -->|version < 25.8| C[Raise RuntimeError: unsupported version]
    B -->|version >= 25.8| D[Validate database engine is Atomic]
    D -->|not Atomic| E[Raise RuntimeError: only Atomic supported]
    D -->|Atomic| F[Check ZooKeeper for existing database<br/>first_replica = not check_database_exists_in_zk]

    F -->|first_replica true| G[Log migrating as first replica]
    F -->|first_replica false| H[Log migrating as non-first replica]

    G --> I[Detach database]
    H --> I

    I -->|first_replica false| J[List tables in database]
    J --> K[Check tables consistent with ZooKeeper metadata]
    K --> L{UUID sync needed?}
    L -->|no| N[Set engine to Replicated in metadata]
    L -->|yes| M[Sync table UUIDs via ZooKeeper metadata]
    M --> O[need_restart = True]

    G --> N

    N --> P{need_restart?}
    P -->|yes| Q[Restart ClickHouse via restart_command]
    P -->|no| R[Attach database]

    Q --> S[Run SYSTEM RESTORE DATABASE REPLICA]
    R --> S
    S --> T[Log success and finish]

File-Level Changes

Change	Details	Files
Refactor migration flow into DatabaseMigrator and switch to detach/attach helpers and system-based replica restore with version validation and automatic UUID sync.	Introduce DatabaseMigrator class to handle Atomic↔Replicated migrations with engine checks and version gating Replace inline DETACH/ATTACH SQL with detach_database/attach_database helpers and restart via restart_command when UUIDs change Use check_database_exists_in_zk and ZooKeeper table metadata to decide first vs non-first replica behavior and to validate schemas Add table-UUID synchronization against ZooKeeper metadata and call system_restore_database_replica after changing engine	`ch_tools/chadmin/internal/migration.py`
Add SYSTEM RESTORE DATABASE REPLICA support and encapsulate legacy ZooKeeper replica management in ZookeeperDatabaseManager with reusable helpers.	Add supports_system_restore_database_replica and system_restore_database_replica helpers that wrap the native SYSTEM commands Introduce ZookeeperDatabaseManager to create database ZK structure, register replicas, and write table metadata using ZKTransactionBuilder Provide helpers for checking DB existence in ZK and reading table metadata, including support for escaped table names and special characters Implement _restore_replica_fallback to restore replicas via ZooKeeper when SYSTEM RESTORE DATABASE REPLICA is unavailable	`ch_tools/chadmin/internal/database_replica.py`
Extract generic ZooKeeper transaction handling into a reusable ZKTransactionBuilder with context-manager support and stricter validation.	Create ZKTransactionBuilder in zookeeper.py with create_node/delete_node/commit/reset operations and path tracking Implement context manager semantics to ensure transactions are reset after use Enhance transaction result validation to optionally log all failing operations and prevent reuse after commit	`ch_tools/chadmin/internal/zookeeper.py`
Extend database utilities with reusable existence and detach/attach helpers used by migration and CLI flows.	Add is_database_exists helper that queries system.databases via templated SQL Add detach_database and attach_database helpers that wrap DETACH/ATTACH DATABASE statements with proper quoting Document module intent for database management utilities	`ch_tools/chadmin/internal/database.py`
Update CLI restore-replica command to prefer SYSTEM RESTORE DATABASE REPLICA with ZooKeeper fallback and clean detach/attach behavior.	Add short docstring for restore_replica_command describing SYSTEM RESTORE behavior Use is_database_exists and DatabaseMetadata checks before restore Call system_restore_database_replica when supported and fail fast on error, otherwise invoke _restore_replica_fallback Detach and re-attach the database after legacy restore to ensure proper synchronization	`ch_tools/chadmin/cli/database_group.py`
Adjust migration and restore tests for new version requirements, behavior, and restart-free flow, adding coverage for special table names and unsupported versions.	Bump migration scenarios from @require_version_24.8 to @require_version_25.8 and add @require_version_less_than_25.8 negative case for unsupported migration Inline chadmin restart/ping configuration in the background to replace explicit supervisorctl restart steps, and remove manual sleeps Remove explicit DETACH/ATTACH steps in restore-replica tests now handled internally by the new flow Add a restore-replica scenario covering tables with special characters in names and verifying data replication on both replicas	`tests/features/database_migrate.feature` `tests/features/database_restore_replica.feature`

Tips and commands

Interacting with Sourcery

Trigger a new review: Comment @sourcery-ai review on the pull request.
Continue discussions: Reply directly to Sourcery's review comments.
Generate a GitHub issue from a review comment: Ask Sourcery to create an
issue from a review comment by replying to it. You can also reply to a
review comment with @sourcery-ai issue to create an issue from it.
Generate a pull request title: Write @sourcery-ai anywhere in the pull
request title to generate a title at any time. You can also comment
@sourcery-ai title on the pull request to (re-)generate the title at any time.
Generate a pull request summary: Write @sourcery-ai summary anywhere in
the pull request body to generate a PR summary at any time exactly where you
want it. You can also comment @sourcery-ai summary on the pull request to
(re-)generate the summary at any time.
Generate reviewer's guide: Comment @sourcery-ai guide on the pull
request to (re-)generate the reviewer's guide at any time.
Resolve all Sourcery comments: Comment @sourcery-ai resolve on the
pull request to resolve all Sourcery comments. Useful if you've already
addressed all the comments and don't want to see them anymore.
Dismiss all Sourcery reviews: Comment @sourcery-ai dismiss on the pull
request to dismiss all existing Sourcery reviews. Especially useful if you
want to start fresh with a new review - don't forget to comment
@sourcery-ai review to trigger a new review!

Customizing Your Experience

Access your dashboard to:

Enable or disable review features such as the Sourcery-generated pull request
summary, the reviewer's guide, and others.
Change the review language.
Add, remove or edit custom review instructions.
Adjust other review settings.

Getting Help

Contact our support team for questions or feedback.
Visit our documentation for detailed guides and information.
Keep in touch with the Sourcery team by following us on X/Twitter, LinkedIn or GitHub.

Method get_default_table_in_db_path is called via self.zk_manager, so it cannot be a static method.

Replace f-string interpolation with Jinja2 template parameters in: - is_database_exists() - create_table_metadata_nodes() - _get_host_id() BREAKING CHANGE: None

Add comprehensive module-level documentation for: - zookeeper.py - database_replica.py - database.py

- Implement __enter__ and __exit__ for 'with' statement support - Add method chaining (return self from create_node/delete_node) - Add transaction state protection with _committed flag - Add reset() method for builder reuse - Improve documentation with usage examples BREAKING CHANGE: None (fully backward compatible)

Update all ZKTransactionBuilder usage to use context manager pattern: - database_replica.py: create_database_structure(), create_replica_nodes() - migration.py: _migrate_non_first_replica() Improves resource management and code readability.

…REPLICA - Add support for SYSTEM RESTORE DATABASE REPLICA command (CH >= 25.8) - Implement version detection and automatic fallback for older versions - Simplify migration flow by using built-in restore command - Refactor restore_replica_command to use new restoration methods - Add _restore_replica_fallback for legacy ZooKeeper-based restoration - Update migrate_to_replicated to use SYSTEM RESTORE DATABASE REPLICA - Remove manual ZooKeeper transaction handling from migration flow - Improve error handling and logging throughout restoration process - Clean up docstrings and remove redundant code

Replace detailed ZooKeeper error trace with single meaningful error message about unexpected digest value in replica node

…d version check test - Update @require_version from 24.8 to 25.8 for all migration scenarios - Add new scenario to test migration on unsupported version (< 25.8) - Remove unnecessary clickhouse-server restart commands from tests - Simplify test scenarios by removing manual restart steps

- Simplify database_replica.py logic (220 -> reduced lines) - Refactor migration.py implementation (81 -> reduced lines) - Clean up zookeeper.py utilities (35 -> reduced lines) - Update database_group.py CLI commands (11 changes) Total: -81 lines of code

…and comprehensive documentation - Add creation of 'first_replica_database_name' ZK node in create_database_structure() to match ClickHouse implementation (DatabaseReplicated.cpp:689-692) - Add escape_for_zookeeper() for table names in metadata nodes to properly handle special characters (matches ClickHouse escapeForFileName logic) - Add comprehensive ZooKeeper structure documentation to ZookeeperDatabaseManager class describing all nodes, their purpose, and replica registration process - Add detailed docstrings to all key methods explaining ZK node creation and structure - Improve inline comments for better code maintainability All changes verified against ClickHouse source code and tested: - Lint: passed (176 files) - Unit tests: 120 passed - Integration tests: 4 scenarios, 97 steps passed

Test validates escape_for_zookeeper() for tables with special characters (dash, dots, percent). Ensures compatibility with ClickHouse escapeForFileName() logic.

Removed TODO and replaced execute() call with ctx.invoke(restart_command) in migration.py

Automatically perform DETACH/ATTACH after restore. Extract reusable detach_database() and attach_database() functions to database.py.

Remove unused ZookeeperDatabaseManager and fix early return in restore-replica command

sourcery-ai

Hey - I've found 3 issues, and left some high level feedback:

In DatabaseMigrator._sync_table_uuids you build ZK paths with get_default_table_in_db_path(database_name, table_name), but ZookeeperDatabaseManager._create_table_metadata_nodes now stores metadata under an escaped table name, so UUID sync will fail for tables with special characters; consider using the same escaping logic (or a shared helper) when reading and writing table metadata paths.
The system_restore_database_replica helper constructs SYSTEM RESTORE DATABASE REPLICA {database_name} without quoting the database identifier, which may break for database names with special characters; consider wrapping the name in backticks or using parameterized / macro-based substitution similar to other queries.

Prompt for AI Agents

Please address the comments from this code review:

## Overall Comments
- In `DatabaseMigrator._sync_table_uuids` you build ZK paths with `get_default_table_in_db_path(database_name, table_name)`, but `ZookeeperDatabaseManager._create_table_metadata_nodes` now stores metadata under an escaped table name, so UUID sync will fail for tables with special characters; consider using the same escaping logic (or a shared helper) when reading and writing table metadata paths.
- The `system_restore_database_replica` helper constructs `SYSTEM RESTORE DATABASE REPLICA {database_name}` without quoting the database identifier, which may break for database names with special characters; consider wrapping the name in backticks or using parameterized / macro-based substitution similar to other queries.

## Individual Comments

### Comment 1
<location> `ch_tools/chadmin/internal/database_replica.py:60-69` </location>
<code_context>
+def get_default_table_in_db_path(database_name: str, table_name: str) -> str:
</code_context>

<issue_to_address>
**issue (bug_risk):** Table name escaping for ZooKeeper metadata is inconsistent between readers and writers.

`_create_table_metadata_nodes` escapes `table_name` via `escape_for_zookeeper(table_name)`, but `get_default_table_in_db_path` uses the raw `table_name` when building the path used by `_sync_table_uuids`. For tables whose raw and escaped names differ, this will cause `get_zk_node` lookups to fail. Align these call sites so both read and write paths use the same escaping semantics for the ZK node name.
</issue_to_address>

### Comment 2
<location> `ch_tools/chadmin/internal/database_replica.py:64-73` </location>
<code_context>
+

-from ch_tools.chadmin.internal.utils import execute_query
+def get_tables_metadata(ctx: Context, database_name: str) -> dict[str, str]:
+    """Retrieve table metadata from ZooKeeper, returns dict mapping table names to CREATE statements."""
+    zk_tables_metadata: dict[str, str] = {}
+
+    with zk_client(ctx) as zk:
+        zk_metadata_path = f"{DEFAULT_ZK_ROOT}/{database_name}/{ZK_METADATA_SUBPATH}"
+        children = zk.get_children(zk_metadata_path)
+        if not children:
+            return zk_tables_metadata
+
+        for table_name in children:
+            try:
+                table_metadata_path = f"{zk_metadata_path}/{table_name}"
+                metadata_data = zk.get(table_metadata_path)
+                if metadata_data and metadata_data[0]:
+                    zk_tables_metadata[table_name] = metadata_data[0].decode().strip()
+            except NoNodeError:
+                logging.warning(f"Table {table_name} metadata removed concurrently")
+
</code_context>

<issue_to_address>
**suggestion:** Handling of missing/empty ZooKeeper metadata nodes may hide partial failures.

If `get_children` returns tables but individual `get` calls fail or return empty data, those tables are dropped from the result and only appear as `missing_in_zk`, losing the signal that their ZK nodes existed but were unreadable. It would help to distinguish “no children” from “children with missing/empty metadata” by at least logging the affected table name and path when `metadata_data` is missing/empty, so partial ZK corruption is easier to diagnose.

Suggested implementation:

```python
import logging
from typing import Optional, Tuple

from click import Context
from kazoo.client import KazooClient
from kazoo.exceptions import NodeExistsError, NoNodeError

```

```python
def get_default_table_in_db_path(database_name: str, table_name: str) -> str:
    return f"{DEFAULT_ZK_ROOT}/{database_name}/{ZK_METADATA_SUBPATH}/{table_name}"


def get_tables_metadata(ctx: Context, database_name: str) -> dict[str, str]:
    """Retrieve table metadata from ZooKeeper, returns dict mapping table names to CREATE statements."""
    zk_tables_metadata: dict[str, str] = {}

    with zk_client(ctx) as zk:
        zk_metadata_path = f"{DEFAULT_ZK_ROOT}/{database_name}/{ZK_METADATA_SUBPATH}"
        children = zk.get_children(zk_metadata_path)
        if not children:
            return zk_tables_metadata

        for table_name in children:
            table_metadata_path = f"{zk_metadata_path}/{table_name}"
            try:
                metadata_data = zk.get(table_metadata_path)
                if metadata_data and metadata_data[0]:
                    zk_tables_metadata[table_name] = metadata_data[0].decode().strip()
                else:
                    # Node exists but is empty or unreadable; log for easier diagnosis of partial failures.
                    logging.warning(
                        "Empty or missing ZooKeeper metadata for table %s at path %s",
                        table_name,
                        table_metadata_path,
                    )
            except NoNodeError:
                logging.warning(
                    "ZooKeeper metadata node for table %s was removed concurrently at path %s",
                    table_name,
                    table_metadata_path,
                )

    return zk_tables_metadata

```
</issue_to_address>

### Comment 3
<location> `ch_tools/chadmin/internal/zookeeper.py:37-46` </location>
<code_context>
         return False


-class ZKTransactionBuilder:
-    ctx: Context
-    txn: TransactionRequest
</code_context>

<issue_to_address>
**suggestion (bug_risk):** Context manager behavior for ZKTransactionBuilder may surprise callers who expect automatic commit/rollback.

`__enter__` currently returns the builder and `__exit__` always calls `reset()`, so `with ZKTransactionBuilder(...) as b:` will drop all operations unless `b.commit()` is explicitly called inside the block. This is easy to misuse given the context manager interface. Consider either committing automatically on successful exit (and resetting afterward), or clearly documenting this pattern and/or warning when exiting with uncommitted operations.
</issue_to_address>

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

ch_tools/chadmin/internal/database_replica.py

ch_tools/chadmin/internal/zookeeper.py

…ilder behavior, and database name quoting - Add escape/unescape functions for ZK table names matching ClickHouse escapeForFileName logic - Improve logging for missing/empty ZK metadata nodes - Make ZKTransactionBuilder auto-commit on successful context exit - Add backticks to database name in SYSTEM RESTORE DATABASE REPLICA query

otselnik added 13 commits February 10, 2026 10:37

refactoring

3f1b2db

fix(database_replica): remove incorrect @staticmethod decorator

84fb268

Method get_default_table_in_db_path is called via self.zk_manager, so it cannot be a static method.

fix(database): prevent SQL injection in database queries

9d511d1

Replace f-string interpolation with Jinja2 template parameters in: - is_database_exists() - create_table_metadata_nodes() - _get_host_id() BREAKING CHANGE: None

docs(database): add module docstrings

99e2a67

Add comprehensive module-level documentation for: - zookeeper.py - database_replica.py - database.py

test: simplify error message assertion in database migrate test

038841a

Replace detailed ZooKeeper error trace with single meaningful error message about unexpected digest value in replica node

Add TODO for replacing server restart command with proper functionality

fc24c74

Add test for table name escaping in database replica restoration

9cdda97

Test validates escape_for_zookeeper() for tables with special characters (dash, dots, percent). Ensures compatibility with ClickHouse escapeForFileName() logic.

otselnik force-pushed the refactoring_new_hope branch from 532a79e to 9cdda97 Compare February 10, 2026 07:37

otselnik added 3 commits February 10, 2026 11:21

Replace hardcoded server restart with restart_command function

90cfa6d

Removed TODO and replaced execute() call with ctx.invoke(restart_command) in migration.py

Integrate detach/attach into database restore-replica command

eacd24f

Automatically perform DETACH/ATTACH after restore. Extract reusable detach_database() and attach_database() functions to database.py.

Fix restore-replica fallback logic and remove unused import

e69b5a0

Remove unused ZookeeperDatabaseManager and fix early return in restore-replica command

otselnik marked this pull request as ready for review February 10, 2026 10:32

sourcery-ai bot reviewed Feb 10, 2026

View reviewed changes

ch_tools/chadmin/internal/database_replica.py Show resolved Hide resolved

ch_tools/chadmin/internal/database_replica.py Show resolved Hide resolved

ch_tools/chadmin/internal/zookeeper.py Show resolved Hide resolved

otselnik requested review from aalexfvk and kirillgarbar February 10, 2026 15:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor database migration to use SYSTEM RESTORE DATABASE REPLICA #443

Refactor database migration to use SYSTEM RESTORE DATABASE REPLICA #443

Uh oh!

otselnik commented Feb 6, 2026 •

edited by sourcery-ai bot

Loading

Uh oh!

sourcery-ai bot commented Feb 6, 2026 •

edited

Loading

Interacting with Sourcery

Customizing Your Experience

Getting Help

Uh oh!

sourcery-ai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Refactor database migration to use SYSTEM RESTORE DATABASE REPLICA #443

Are you sure you want to change the base?

Refactor database migration to use SYSTEM RESTORE DATABASE REPLICA #443

Uh oh!

Conversation

otselnik commented Feb 6, 2026 • edited by sourcery-ai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Key Changes

Benefits

TODO

Summary by Sourcery

Uh oh!

sourcery-ai bot commented Feb 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviewer's Guide

Sequence diagram for restore_replica_command with SYSTEM RESTORE and fallback

Class diagram for database migration and replica management refactor

Flow diagram for DatabaseMigrator.migrate_to_replicated

File-Level Changes

Interacting with Sourcery

Customizing Your Experience

Getting Help

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

otselnik commented Feb 6, 2026 •

edited by sourcery-ai bot

Loading

sourcery-ai bot commented Feb 6, 2026 •

edited

Loading