Destination

Destination is a location in which dlt creates and maintains the current version of the schema and loads your data. Destinations come in various forms: databases, datalakes, vector stores or files. dlt deals with this variety via modules which you declare when creating a pipeline.

We maintain a set of built-in destinations that you can use right away.

Declare the destination type

We recommend that you declare the destination type when creating a pipeline instance with dlt.pipeline. This allows the run method to synchronize your local pipeline state with destination and extract and normalize to create compatible load packages and schemas. You can also pass the destination to run and load methods.

Use destination shorthand type

import dlt

pipeline = dlt.pipeline("pipeline", destination="filesystem")

Above we want to use filesystem built-in destination. You can use shorthand types only for built-ins.

Use full destination class type

import dlt

pipeline = dlt.pipeline("pipeline", destination="dlt.destinations.filesystem")

Above we use built in filesystem destination by providing a class type filesystem from module dlt.destinations. You can pass destinations from external modules as well.

Import destination class

import dlt
from dlt.destinations import filesystem

pipeline = dlt.pipeline("pipeline", destination=filesystem)

Above we import destination class for filesystem and pass it to the pipeline.

All examples above will create the same destination class with default parameters and pull required config and secret values from configuration - they are equivalent.

Pass explicit parameters and a name to a destination

You can instantiate destination class yourself to configure it explicitly. When doing this you work with destinations the same way you work with sources

import dlt

azure_bucket = filesystem("az://dlt-azure-bucket", destination_name="production_az_bucket")
pipeline = dlt.pipeline("pipeline", destination=azure_bucket)

Above we import and instantiate the filesystem destination class. We pass explicit url of the bucket and name the destination to production_az_bucket.

If destination is not named, its shorthand type (the Python class name) serves as a destination name. Name your destination explicitly if you need several separate configurations of destinations of the same type (i.e. you wish to maintain credentials for development, staging and production storage buckets in the same config file). Destination name is also stored in the load info and pipeline traces so use them also when you need more descriptive names (other than, for example, filesystem).

Configure a destination

We recommend to pass the credentials and other required parameters to configuration via TOML files, environment variables or other config providers. This allows you, for example, to easily switch to production destinations after deployment.

We recommend to use the default config section layout as below:

[destination.filesystem]
bucket_url="az://dlt-azure-bucket"
[destination.filesystem.credentials]
azure_storage_account_name="dltdata"
azure_storage_account_key="storage key"

or via environment variables:

DESTINATION__FILESYSTEM__BUCKET_URL=az://dlt-azure-bucket
DESTINATION__FILESYSTEM__CREDENTIALS__AZURE_STORAGE_ACCOUNT_NAME=dltdata
DESTINATION__FILESYSTEM__CREDENTIALS__AZURE_STORAGE_ACCOUNT_KEY="storage key"

For named destinations you use their names in the config section

[destination.production_az_bucket]
bucket_url="az://dlt-azure-bucket"
[destination.production_az_bucket.credentials]
azure_storage_account_name="dltdata"
azure_storage_account_key="storage key"

Note that when you use dlt init command to create or add a data source, dlt creates a sample configuration for selected destination.

Pass explicit credentials

You can pass credentials explicitly when creating destination class instance. This replaces the credentials argument in dlt.pipeline and pipeline.load methods - which is now deprecated. You can pass the required credentials object, its dictionary representation or the supported native form like below:

import dlt
from dlt.destinations import postgres

# pass full credentials - together with the password (not recommended)
pipeline = dlt.pipeline(
    "pipeline",
    destination=postgres(credentials="postgresql://loader:loader@localhost:5432/dlt_data"),
)

tip

You can create and pass partial credentials and dlt will fill the missing data. Below we pass postgres connection string but without password and expect that it will be present in environment variables (or any other config provider)

import dlt
from dlt.destinations import postgres

# pass credentials without password
# dlt will retrieve the password from ie. DESTINATION__POSTGRES__CREDENTIALS__PASSWORD
prod_postgres = postgres(credentials="postgresql://loader@localhost:5432/dlt_data")
pipeline = dlt.pipeline("pipeline", destination=prod_postgres)

import dlt
from dlt.destinations import filesystem
from dlt.sources.credentials import AzureCredentials

credentials = AzureCredentials()
# fill only the account name, leave key to be taken from secrets
credentials.azure_storage_account_name = "production_storage"
pipeline = dlt.pipeline(
    "pipeline", destination=filesystem("az://dlt-azure-bucket", credentials=credentials)
)

Please read how to use various built in credentials types.

Access a destination

When loading data, dlt will access the destination in two cases:

At the beginning of the run method to sync the pipeline state with the destination (or if you call pipeline.sync_destination explicitly).
In the pipeline.load method - to migrate schema and load the load package.

Obviously, dlt will access the destination when you instantiate sql_client.

note

dlt will not import the destination dependencies or access destination configuration if access is not needed. You can build multi-stage pipelines where steps are executed in separate processes or containers - the extract and normalize step do not need destination dependencies, configuration and actual connection.

import dlt
from dlt.destinations import filesystem

# just declare the destination.
pipeline = dlt.pipeline("pipeline", destination="filesystem")
# no destination credentials not config needed to extract
pipeline.extract(["a", "b", "c"], table_name="letters")
# same to normalize
pipeline.normalize()
# here dependencies dependencies will be imported, secrets pulled and destination accessed
# we pass bucket_url explicitly and expect credentials passed by config provider
load_info = pipeline.load(destination=filesystem(bucket_url=bucket_url))
load_info.raise_on_failed_jobs()

Declare external destination

You can implement your own destination and pass the destination class type or instance to dlt pipeline.

Destination

Declare the destination type

Pass explicit parameters and a name to a destination

Configure a destination

Pass explicit credentials

Access a destination

Declare external destination

DHelp

Ask a question

Destination

Declare the destination type​

Pass explicit parameters and a name to a destination​

Configure a destination​

Pass explicit credentials​

Access a destination​

Declare external destination​

DHelp

Ask a question

Declare the destination type

Pass explicit parameters and a name to a destination

Configure a destination

Pass explicit credentials

Access a destination

Declare external destination