Skip to content

Clickhouse Server

The primary way to use splinkclickhouse is with data in a running Clickhouse server.

To use Clickhouse with Splink you will need to connect to your Clickhouse instance with clickhouse-connect.

You will need to configure a client to your running version, and use this to create a Splink DatabaseAPI:

import clickhouse_connect

from splinkclickhouse import ClickhouseServerAPI

client = clickhouse_connect.get_client(
    host="127.0.0.1",
    port=8443,
    username="splink_user",
    password="password",
    database="database",
)

db_api = ClickhouseServerAPI(client)

From this point onwards your code should generally be the same as that appearing in the Splink docs. The exceptions will be if you want to take advantage of Clickhouse-specific comparisons or need advanced usage.

Dates

If you are working with dates, particularly those before 1900, you may wish to read the guide on working with dates.