Hi
I recently read your work on the Enzyme Engineering Database (EnzEngDB) and found the platform's ability to consolidate sequence-function relationships for non-natural chemistries highly impressive.
I am particularly interested in the reaction datasets described in your results:
The manually curated gold-standard dataset containing 635 reactions.
The automated extraction dataset containing 1,078 reactions pulled via the LLM pipeline.
As I am working on a related project in computational enzyme design, I was wondering if these specific reaction datasets (in SMILES or CSV format) are available for direct download, or if you could provide a link to the specific repository containing these curated records?
Thank you for your contribution to the field and for providing such a valuable resource.