MLflow Experiment Setup

Prerequisites

Ensure that the you have access to a Databricks SQL Warehouse
An .env file exists in the project's root directory with the following information:

DATABRICKS_TOKEN=<databricks-personal-access-token>
DATABRICKS_HOST=https://<workspace-name>.cloud.databricks.com
MLFLOW_TRACKING_URI=databricks
MLFLOW_REGISTRY_URI=databricks-uc
MLFLOW_TRACING_SQL_WAREHOUSE_ID=<SQL_WAREHOUSE_ID>
MLFLOW_TRACING_DESTINATION=<catalog.schema> # Replace with your schema

Process

1.0 Code Execution

Execute the following sample script with the correct parameters:

# Example values for the placeholders below:
# MLFLOW_TRACING_SQL_WAREHOUSE_ID: "abc123def456" (found in SQL warehouse URL)
# experiment_name: "/Users/[email protected]/traces"
# catalog_name: "main" or "my_catalog"
# schema_name: "mlflow_traces" or "production_traces"

import os
import mlflow
from mlflow.exceptions import MlflowException
from mlflow.entities import UCSchemaLocation
from mlflow.tracing.enablement import set_experiment_trace_location
from dotenv import load_dotenv

# Load environment variables from .env file
load_dotenv()

mlflow.set_tracking_uri("databricks")

# Specify the ID of a SQL warehouse you have access to.
os.environ["MLFLOW_TRACING_SQL_WAREHOUSE_ID"] = "<SQL_WAREHOUSE_ID>"
# Specify the name of the MLflow Experiment to use for viewing traces in the UI.
experiment_name = "<MLFLOW_EXPERIMENT_NAME>"
# Specify the name of the Catalog to use for storing traces.
catalog_name = "<UC_CATALOG_NAME>"
# Specify the name of the Schema to use for storing traces.
schema_name = "<UC_SCHEMA_NAME>"

if experiment := mlflow.get_experiment_by_name(experiment_name):
    experiment_id = experiment.experiment_id

Prerequisites

Ensure that the you have access to a Databricks SQL Warehouse

An .env file exists in the project's root directory with the following information:

DATABRICKS_TOKEN=<databricks-personal-access-token> DATABRICKS_HOST=https://<workspace-name>.cloud.databricks.com MLFLOW_TRACKING_URI=databricks MLFLOW_REGISTRY_URI=databricks-uc MLFLOW_TRACING_SQL_WAREHOUSE_ID=<SQL_WAREHOUSE_ID> MLFLOW_TRACING_DESTINATION=<catalog.schema> # Replace with your schema

Process

1.0 Code Execution

Execute the following sample script with the correct parameters:

# Example values for the placeholders below: # MLFLOW_TRACING_SQL_WAREHOUSE_ID: "abc123def456" (found in SQL warehouse URL) # experiment_name: "/Users/[email protected]/traces" # catalog_name: "main" or "my_catalog" # schema_name: "mlflow_traces" or "production_traces" import os import mlflow from mlflow.exceptions import MlflowException from mlflow.entities import UCSchemaLocation from mlflow.tracing.enablement import set_experiment_trace_location from dotenv import load_dotenv # Load environment variables from .env file load_dotenv() mlflow.set_tracking_uri("databricks") # Specify the ID of a SQL warehouse you have access to. os.environ["MLFLOW_TRACING_SQL_WAREHOUSE_ID"] = "<SQL_WAREHOUSE_ID>" # Specify the name of the MLflow Experiment to use for viewing traces in the UI. experiment_name = "<MLFLOW_EXPERIMENT_NAME>" # Specify the name of the Catalog to use for storing traces. catalog_name = "<UC_CATALOG_NAME>" # Specify the name of the Schema to use for storing traces. schema_name = "<UC_SCHEMA_NAME>" if experiment := mlflow.get_experiment_by_name(experiment_name): experiment_id = experiment.experiment_id

Databricks Mlflow Setup

MLflow Experiment Setup

Prerequisites

Process

1.0 Code Execution

Databricks Mlflow Setup

MLflow Experiment Setup

Prerequisites

Process

1.0 Code Execution

Bluebubbles

Add Tracing

Analytics Events

Add Expert

Arthas

Arthas Eagleeye Traceid