Skip to main content

How to ingest Snowflake metadata into DataHub

Datahub can ingest Snowflake metadata by connecting to Snowflake directly.

Configure DataHub Snowflake Source

To set up Snowflake ingestion in DataHub, follow these steps:

  1. Create a DataHub secret: This example requires a secret named svc_datahub_psw for authentication. Ensure this secret is created before proceeding.
  2. Paste the YAML configuration: In DataHub's Snowflake Ingestion wizard, switch to the YAML view and insert the following configuration.
source:
type: snowflake
config:
account_id: <your snowflake account withouth .snowflakecomputing.com >
convert_urns_to_lowercase: true
include_table_lineage: true
include_view_lineage: true
include_tables: true
include_views: true
include_usage_stats: true
email_domain: example.com
format_sql_queries: true
profiling:
enabled: true
profile_table_level_only: false
profile_if_updated_since_days: 1
stateful_ingestion:
enabled: true
remove_stale_metadata: false
warehouse: wh_catalog
username: svc_datahub
role: catalog
password: '${svc_datahub_psw}'
schema_pattern:
deny:
- DBT_ARTIFACTS
- DBT_TEST__AUDIT
database_pattern:
allow:
- '^(?!.*(?:PR|pr)).*$'

Notes:

  • The database_pattern setting excludes databases with PR or pr in their names.
  • Ensure your Snowflake credentials and DataHub secrets are correctly set up before running the ingestion process.