Skip to content

Example showing how to update a dataset in ord-data#758

Draft
bdeadman wants to merge 7 commits intomainfrom
example-update
Draft

Example showing how to update a dataset in ord-data#758
bdeadman wants to merge 7 commits intomainfrom
example-update

Conversation

@bdeadman
Copy link
Copy Markdown
Collaborator

No description provided.

@bdeadman
Copy link
Copy Markdown
Collaborator Author

New example showing how I updated the Golden dataset (open-reaction-database/ord-data#214) after we were notified that some reactant and product SMILES strings were incorrect. The idea behind this example is to document the process of:

  1. loading in a dataset, and some replacement data in .csv
  2. looping over the reactions in the dataset and updating the required fields, including adding a new record_modified entry
  3. comparing the old and new datasets to review the changes
  4. preparing the file for upload to ord-data

Example enumerating over 2 template files (to account for differences in reaction analysis), and then merging the dataset.
Write the resulting dataset to a pb.gz file with the assinged ord id.
@bdeadman
Copy link
Copy Markdown
Collaborator Author

Additional example showing how to generate dataset by enumerating template files over spreadsheets. Two templates used to accommodate some differences in the tabulated data, and then the datasets are merged together.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants