Legacy dbt Semantic Layer migration guide
Introduction
The legacy Semantic Layer will be deprecated in H2 2023. Additionally, the dbt_metrics
package will not be supported in dbt v1.6 and later. If you are using dbt_metrics
, you'll need to upgrade your configurations before upgrading to v1.6. This guide is for people who have the legacy dbt Semantic Layer setup and would like to migrate to the new dbt Semantic Layer. The estimated migration time is two weeks.
Migrate metric configs to the new spec
The metrics specification in dbt Core is changed in v1.6 to support the integration of MetricFlow. It's strongly recommended that you refer to Build your metrics and before getting started so you understand the core concepts of the Semantic Layer.
dbt Labs recommends completing these steps in a local dev environment (such as the dbt Cloud CLI) instead of the dbt Cloud IDE:
-
Create new Semantic Model configs as YAML files in your dbt project.*
-
Upgrade the metrics configs in your project to the new spec.*
-
Delete your old metrics file or remove the
.yml
file extension so they're ignored at parse time. Remove thedbt-metrics
package from your project. Remove any macros that referencedbt-metrics
, likemetrics.calculate()
. Make sure that any packages you’re using don't have references to the old metrics spec. -
Install the dbt Cloud CLI to run MetricFlow commands and define your semantic model configurations.
- If you're using dbt Core, install the MetricFlow CLI with
python -m pip install "dbt-metricflow[your_adapter_name]"
. For example:
python -m pip install "dbt-metricflow[snowflake]"
Note - MetricFlow commands aren't yet supported in the dbt CLoud IDE at this time.
- If you're using dbt Core, install the MetricFlow CLI with
-
Run
dbt parse
. This parses your project and creates asemantic_manifest.json
file in your target directory. MetricFlow needs this file to query metrics. If you make changes to your configs, you will need to parse your project again. -
Run
mf list metrics
to view the metrics in your project. -
Test querying a metric by running
mf query --metrics <metric_name> --group-by <dimensions_name>
. For example:mf query --metrics revenue --group-by metric_time
-
Run
mf validate-configs
to run semantic and warehouse validations. This ensures your configs are valid and the underlying objects exist in your warehouse. -
Push these changes to a new branch in your repo.
ref
not supportedThe dbt Semantic Layer API doesn't support ref
to call dbt objects. This is currently due to differences in architecture between the legacy Semantic Layer and the re-released Semantic Layer. Instead, use the complete qualified table name. If you're using dbt macros at query time to calculate your metrics, you should move those calculations into your Semantic Layer metric definitions as code.
*To make this process easier, dbt Labs provides a custom migration tool that automates these steps for you. You can find installation instructions in the README. Derived metrics aren’t supported in the migration tool, and will have to be migrated manually.
Audit metric values after the migration
You might need to audit metric values during the migration to ensure that the historical values of key business metrics are the same.
-
In the CLI, query the metric(s) and dimensions you want to test and include the
--explain
option. For example:mf query --metrics orders,revenue --group-by metric_time__month,customer_type --explain
-
Use SQL MetricFlow to create a temporary model in your project, like
tmp_orders_revenue audit.sql
. You will use this temporary model to compare against your legacy metrics. -
If you haven’t already done so, create a model using
metrics.calculate()
for the metrics you want to compare against. For example:select *
from {{ metrics.calculate(
[metric('orders)',
metric('revenue)'],
grain='week',
dimensions=['metric_time', 'customer_type'],
) }} -
Run the dbt-audit helper on both models to compare the metric values.
Setup the Semantic Layer in a new environment
This step is only relevant to users who want the legacy and new semantic layer to run in parallel for a short time. This will let you recreate content in downstream tools like Hex and Mode with minimal downtime. If you do not need to recreate assets in these tools skip to step 5.
-
Create a new deployment environment in dbt Cloud and set the dbt version to 1.6 or higher.
-
Select Only run on a custom branch and point to the branch that has the updated metric definition.
-
Set the deployment schema to a temporary migration schema, such as
tmp_sl_migration
. Optional, you can create a new database for the migration. -
Create a job to parse your project, such as
dbt parse
, and run it. Make sure this job succeeds. There needs to be a successful job in your environment in order to set up the semantic layer. -
Select Account Settings -> Projects -> Project details and choose Configure the Semantic Layer.
-
Under Environment, select the deployment environment you created in the previous step. Save your configuration.
-
In the Project details page, click Generate service token and grant it Semantic Layer Only and Metadata Only permissions. Save this token securely. You will need it to connect to the semantic layer.
At this point, both the new semantic layer and the old semantic layer will be running. The new semantic layer will be pointing at your migration branch with the updated metrics definitions.
Update connection in downstream integrations
Now that your Semantic Layer is set up, you will need to update any downstream integrations that used the legacy Semantic Layer.
Migration guide for Hex
To learn more about integrating with Hex, check out their documentation for more info. Additionally, refer to dbt Semantic Layer cells to set up SQL cells in Hex.
-
Set up a new connection for the dbt Semantic Layer for your account. Something to note is that your legacy connection will still work.
-
Re-create the dashboards or reports that use the legacy dbt Semantic Layer.
-
For specific SQL syntax details, refer to Querying the API for metric metadata to query metrics using the API.
- Note — You will need to update your connection to your production environment once you merge your changes to main. Currently, this connection will be pointing at the semantic layer migration environment
Migration guide for Mode
-
Set up a new connection for the semantic layer for your account. Follow Mode's docs to setup your connection.
-
Re-create the dashboards or reports that use the legacy dbt Semantic Layer.
-
For specific SQL syntax details, refer to Querying the API for metric metadata to query metrics using the API.
Merge your metrics migration branch to main, and upgrade your production environment to 1.6.
-
Upgrade your production environment to 1.6 or higher.
- Note — The old metrics definitions are no longer valid so your dbt jobs will not pass.
-
Merge your updated metrics definitions to main. At this point the legacy semantic layer will no longer work.
If you created a new environment in Step 3:
-
Update your Environment in Account Settings -> Project Details -> Edit Semantic Layer Configuration to point to your production environment
-
Delete your migration environment. Be sure to update your connection details in any downstream tools to account for the environment change.