Skip to content

AIChemEco 47k amide coupling conditions (#228)#229

Merged
skearnes merged 4 commits intomainfrom
#228
Jul 30, 2025
Merged

AIChemEco 47k amide coupling conditions (#228)#229
skearnes merged 4 commits intomainfrom
#228

Conversation

@bdeadman
Copy link
Copy Markdown
Collaborator

This is the 47k amide coupling conditions dataset from AIChemEco and Guangzhou National Laboratory which I have been pre-reviewing with @pengj28.

This is the 47k amide coupling conditions dataset from AIChemEco and Guangzhou National Laboratory which I have been pre-reviewing with the authors.
@bdeadman bdeadman marked this pull request as draft July 25, 2025 08:38
@bdeadman bdeadman self-assigned this Jul 25, 2025
@github-actions
Copy link
Copy Markdown

Change summary:

Filename Added Removed Changed
AIChemEco_dataset.pb.gz 0 0 0
0 0 0

@bdeadman bdeadman closed this Jul 25, 2025
@bdeadman bdeadman reopened this Jul 25, 2025
@bdeadman
Copy link
Copy Markdown
Collaborator Author

Reaction count by Github Actions incorrect. This happens sometimes so may not be a real problem. I've closed and reopened the PR to trigger the checks again.

@github-actions
Copy link
Copy Markdown

Change summary:

Filename Added Removed Changed
AIChemEco_dataset.pb.gz 0 0 0
0 0 0

@bdeadman bdeadman closed this Jul 25, 2025
@bdeadman bdeadman reopened this Jul 25, 2025
@github-actions
Copy link
Copy Markdown

Change summary:

Filename Added Removed Changed
data/47/ord_dataset-47eaacc46c3a4487bbdf99adb1a15e41.pb.gz 47015 0 0
47015 0 0

@bdeadman
Copy link
Copy Markdown
Collaborator Author

submission support files.zip
Includes the following files:

  • csv file (semicolon separated) of reaction data
  • conditions dictionary in json (merged into data table)
  • template reaction in pbtxt format
  • Jupyter notebook which includes sections for:
    • mapping the conditions dictionary onto the data table from csv file
    • checking the chemical naming and smiles assignment for the reagents
    • enumeration of the template over the data table, with allowance for blank reagents
    • validation and some checks of the dataset

@bdeadman bdeadman marked this pull request as ready for review July 25, 2025 21:56
@bdeadman bdeadman requested review from connorcoley and skearnes July 25, 2025 21:57
@bdeadman
Copy link
Copy Markdown
Collaborator Author

@connorcoley @skearnes this is the 47k amide coupling conditions dataset which we have recently discussed by email. Their student has prepared the template and data table with some early review support from me. This should be ready to go but let me know if you want an additional review conducted.

@skearnes skearnes closed this Jul 30, 2025
@skearnes skearnes reopened this Jul 30, 2025
@github-actions
Copy link
Copy Markdown

Change summary:

Filename Added Removed Changed
data/47/ord_dataset-47eaacc46c3a4487bbdf99adb1a15e41.pb.gz 47015 0 0
47015 0 0

@skearnes skearnes merged commit 9685114 into main Jul 30, 2025
4 checks passed
@skearnes skearnes deleted the #228 branch July 30, 2025 18:06
@ctapobep
Copy link
Copy Markdown

ctapobep commented Jul 31, 2025

I loaded the dataset to Meve, but the ORD page doesn't open. Has the data not reached the ORD web app?

PS: also noticed that the Scientists attribute is filled with the company name. Not sure if it's important though.

@skearnes
Copy link
Copy Markdown
Member

I loaded the dataset to Meve, but the ORD page doesn't open. Has the data not reached the ORD web app?

PS: also noticed that the Scientists attribute is filled with the company name. Not sure if it's important though.

I'm updating the web app today; I'll post here when it's ready.

@skearnes
Copy link
Copy Markdown
Member

https://open-reaction-database.org/dataset/ord_dataset-47eaacc46c3a4487bbdf99adb1a15e41

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants