-
Notifications
You must be signed in to change notification settings - Fork 62
Ease packaging and publishing process in python #180
Copy link
Copy link
Open
Labels
area/modelModel related functions, including model warehouse, model compression, model evaluation, etc.Model related functions, including model warehouse, model compression, model evaluation, etc.enhancementNew feature or requestNew feature or requestkind/designCategorizes issue or PR as related to design.Categorizes issue or PR as related to design.kind/featureCategorizes issue or PR as related to a new feature.Categorizes issue or PR as related to a new feature.
Metadata
Metadata
Assignees
Labels
area/modelModel related functions, including model warehouse, model compression, model evaluation, etc.Model related functions, including model warehouse, model compression, model evaluation, etc.enhancementNew feature or requestNew feature or requestkind/designCategorizes issue or PR as related to design.Categorizes issue or PR as related to design.kind/featureCategorizes issue or PR as related to a new feature.Categorizes issue or PR as related to a new feature.
/kind feature
What happened:
My team is working with the Kubeflow platform and we're investigating using
ormbto share and publish our ML models and other stateful artifacts like transformers (e.g. standard scaler, pca, tf-idf vectorizer) on Harbor.As far as I understand, to publish a stateful artifact after it processed data, the following steps need to be performed:
<artifact_name>/model/directory<artifact_name>/ormbfile.yamlartifact config file containing the artifact's metadataormbsaveandpushcommands to package and publish the stateful artifactAs some of the metadata can:
createddatetime,sizeof the artifact , run-dependenthyperparameters,metrics)revision,frameworkwith its version used)<artifact_name>/ormbfile.yamlartifact config file needs to be programmatically written/modified. This step – without any utilities – requires to write a lot of logic on the user side.What you expected to happen:
Have a process of publishing ML stateful artifacts as convenient & automated as possible for the end user, i.e. the data scientist.
Maybe we could implement some utilities within
ormbpython sdk to make the process more convenient in practice.How to reproduce it (as minimally and precisely as possible):
Anything else we need to know?:
I'm not that familiar with image based registries, the underlying concepts, and the tools of that ecosystem, so feel free to correct me or suggest me any useful materials.