Skip to content

Feature/ensembl ref db#18

Open
jweiser wants to merge 18 commits intomainfrom
feature/ensembl-ref-db
Open

Feature/ensembl ref db#18
jweiser wants to merge 18 commits intomainfrom
feature/ensembl-ref-db

Conversation

@jweiser
Copy link
Copy Markdown
Member

@jweiser jweiser commented Feb 11, 2026

This PR updates Species.json and the inference code so it no longer creates reference databases for EnsEMBL.

Instead it pulls the EnsEMBL reference databases that already exist in the MySQL release database (derived from gk_central). There are 3 EnsEMBL reference databases that are needed for inference -- main, protist, and fungi.

It also pulls "alternative" reference databases from the release database if it already exists and only creates it if it doesn't exist.

@jweiser jweiser self-assigned this Feb 11, 2026
@jweiser jweiser added the enhancement New feature or request label Feb 11, 2026
@adamjohnwright adamjohnwright self-requested a review February 11, 2026 20:32
adamjohnwright
adamjohnwright previously approved these changes Feb 11, 2026
Copy link
Copy Markdown
Contributor

@adamjohnwright adamjohnwright left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"The new code adds imports for FileNotFoundException and JSONParser/ParseException. I'd want to verify that failures in fetchSchema or
fetchOrCreateEnsemblDbInstance (e.g., missing refdb.json, malformed JSON, DB query returning unexpected results) produces clear error messages rather than opaque stack traces." Claude.

I agree that, when possible, we should add proper error messages. If this is easy to do please do.

@jweiser jweiser requested a review from adamjohnwright March 17, 2026 18:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants