databricks-aml-mlops-workshop

Step 1: Data lake structure

Set-up common data lake structure to

raw: this is where you can upload the sample sensor CSV file
delta: this is where you save the delta tables
curated: this is where you can save the ML-ready and ML-predictions datasets

Step 2: Databricks

Create a Databricks cluster with runtime 10.4 LTS (includes Apache Spark 3.2.1, Scala 2.12) and ensure that these two libraries are installed:

azureml-core
azureml-mlflow

You will want to create a mount point from Databricks to the data lake using the utils/mount.py example using access key for blob storage option (simplest option without any other dependencies).

You will also want to set up Repo integration with Azure Repos and Databricks Repos: https://learn.microsoft.com/en-us/azure/databricks/repos/repos-setup

Step 3: Spark analysis

Run the delta table and feature engineering / modeling scripts to

Create delta table and enable SQL-queries
Run feature engineering and modeling
Register model with Azure ML and mlflow

Step 4: Azure DevOps

See the example DevOps pipeline for how to create a model training and model deployment pipeline. Here is a really good example for using the Databricks APIs: https://github.com/crflynn/databricks-api

Name		Name	Last commit message	Last commit date
Latest commit History 112 Commits
aml		aml
data		data
databricks-connect		databricks-connect
functions		functions
pipeline		pipeline
poc		poc
utils		utils
README.md		README.md
azure-pipelines-aml-ado.yml		azure-pipelines-aml-ado.yml
azure-pipelines-aml-v2.yml		azure-pipelines-aml-v2.yml
azure-pipelines.yml		azure-pipelines.yml
model_training.py		model_training.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

databricks-aml-mlops-workshop

Step 1: Data lake structure

Step 2: Databricks

Step 3: Spark analysis

Step 4: Azure DevOps

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

databricks-aml-mlops-workshop

Step 1: Data lake structure

Step 2: Databricks

Step 3: Spark analysis

Step 4: Azure DevOps

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages