fix problems with the new versions by KFilippopolitis · Pull Request #49 · madgik/mipdb

KFilippopolitis · 2025-10-30T12:45:41Z

No description provided.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2025-10-30T12:53:12Z

mipdb/usecases.py

+    def _validate_datasets(self, csv_path, cdes, copy_from_file):
        csv_columns = pd.read_csv(csv_path, nrows=0).columns.tolist()
        if DATASET_COLUMN_NAME not in csv_columns:
            raise InvalidDatasetError(
                "The 'dataset' column is required to exist in the csv."
            )
-        metadata_table = MetadataTable.from_db(data_model, self.sqlite_db)
-        cdes = metadata_table.table
+
+        if self.monetdb is None:
+            LOGGER.debug("MonetDB disabled – skipping primary data table creation.")
+            df = pd.read_csv(csv_path, usecols=["dataset"])
+            unique_datasets = df["dataset"].unique().tolist()
+            return unique_datasets


Skipping dataset schema validation when MonetDB is disabled

When ValidateDataset._validate_datasets sees self.monetdb is None it now returns immediately after reading only the dataset column. This short‑circuit means that all other validations (type checks, min/max boundaries, enumeration validations other than dataset name) are never executed whenever the CLI is run with --no-monetdb. Previously the same method always invoked validate_csv/validate_csv_with_volume, so invalid rows would be rejected before import. After this change, any CSV with wrong types or values will be accepted silently in SQLite‑only mode, allowing bad data into the system. The MonetDB dependency is only needed for the COPY‑FROM‑FILE branch; the pure Pandas validation should still run even without MonetDB. Consider removing the early return and run the same validation logic regardless of MonetDB availability.

Useful? React with 👍 / 👎.

fix problems with the new versions

482a6ab

KFilippopolitis merged commit f8b00c3 into main Oct 30, 2025
3 checks passed

KFilippopolitis deleted the monetdb-toggle branch October 30, 2025 12:45

chatgpt-codex-connector bot reviewed Oct 30, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

fix problems with the new versions#49

fix problems with the new versions#49
KFilippopolitis merged 1 commit intomainfrom
monetdb-toggle

KFilippopolitis commented Oct 30, 2025

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Oct 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

KFilippopolitis commented Oct 30, 2025

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Oct 30, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant