Add dbt-cloud integration command to dp cli #99

rdziadosz · 2023-03-16T07:13:59Z

Adds a command configure-cloud to dp cli that creates dbt Cloud project with configured connection to BigQuery.

data_pipelines_cli/cli_commands/cloud.py

data_pipelines_cli/dbt_cloud_api_client.py

data_pipelines_cli/cli_commands/cloud.py

p-pekala · 2023-03-21T10:51:51Z

We will also need upgrade documentation docs dir in this repo

p-pekala · 2023-04-14T08:30:36Z

I don't see any documentation. I have no idea how dbtcloud.yml should look like.

data_pipelines_cli/cli_commands/dbtcloud.py

p-pekala · 2023-04-14T08:32:47Z

data_pipelines_cli/cli_commands/dbtcloud.py

+            credentials_id = client.create_credentials(environment["dataset"], project_id)
+        else:
+            credentials_id = None
+        environment_id = client.create_environment(project_id, environment["type"], environment["name"],


I would extract the IF + env creation to separate method create_environment

data_pipelines_cli/cli_commands/dbtcloud.py

data_pipelines_cli/dbt_cloud_api_client.py

p-pekala · 2023-04-26T07:20:38Z

docs/configuration.rst

+     - string
+     - The cron expression with which the example job will be run
+   * - default_gcp_project


I think there shouldn't be a default one, it always should be taken from bigquery.yml

p-pekala · 2023-04-26T07:21:53Z

docs/configuration.rst

+     - string
+     - Target dataset for this environment
+   * - dbt_version


why is it per environment? Can we make it global?

In dbt Cloud this is set for each environment: https://docs.getdbt.com/docs/collaborate/environments/dbt-cloud-environments#common-environment-settings We could make the setting the same for all environments, but this way we would be limited to one version. I assume that someone might want to test the code on a separate environment e.g. before upgrading dbt for the whole project. Are you sure I should make such a change?

p-pekala · 2023-04-26T07:23:27Z

docs/configuration.rst

+     - string
+     - The dbt version used in this environment
+   * - bq_config_dir


I would either remove "bq_" prefix or use env name as dev/prod etc.

p-pekala · 2023-04-26T07:24:59Z

docs/configuration.rst

+     - string
+     - Name of the environment that will be created in dbt Cloud
+   * - dataset


dataset should also be taken from bigquery.yml

p-pekala · 2023-04-26T07:42:08Z

docs/configuration.rst

+     - Details of the environments to be created in dbt Cloud
+
+Configuration of the environments:


field "type" is missing

p-pekala · 2023-04-26T07:52:06Z

data_pipelines_cli/cli_commands/dbtcloud.py

+        bq_config = read_bigquery_config(environment["bq_config_dir"])
+        environments_projects[environment["name"]] = bq_config["project"]
+
+    client.create_environment_variable(project_id, dbtcloud_config["default_gcp_project"],


you don't need default_gcp_project. This default is in "base" config dir and you already read it in read_bigquery_config method.

Okay, I changed it to be taken from "base." config I assume that the only use of this value would be if someone added another environment and did not update this environment variable.

p-pekala · 2023-04-26T07:57:00Z

data_pipelines_cli/cli_commands/dbtcloud.py

+    dbtcloud_config = read_dbtcloud_config()
+    file = open(keyfile)
+    keyfile_data = json.load(file)
+    project_id = client.create_project(dbtcloud_config["project_name"])


I would change the name of vaialble into dbtcloud_project_id

p-pekala · 2023-04-26T07:59:51Z

data_pipelines_cli/cli_commands/dbtcloud.py

+    for environment in dbtcloud_config["environments"]:
+        environment_id = create_environment(client, environment, project_id)
+        if environment["type"] == "deployment":
+            client.create_job(project_id, environment_id, dbtcloud_config["schedule_interval"],


schedule interval could be per environment I think.

p-pekala · 2023-04-26T08:07:40Z

data_pipelines_cli/dbt_cloud_api_client.py

+        new_env = {
+            "env_var": env_var
+        }
+        print(new_env)


please remove print or replace with logging

p-pekala · 2023-04-26T08:11:14Z

data_pipelines_cli/cli_commands/dbtcloud.py

+            client.create_job(project_id, environment_id, dbtcloud_config["schedule_interval"],
+                              "Job - " + environment["name"])
+        bq_config = read_bigquery_config(environment["bq_config_dir"])
+        environments_projects[environment["name"]] = bq_config["project"]


does it resolve the project properly? In bigquery.yml we have project: "{{ env_var('GCP_PROJECT') }}" currently. It should be taken from env durring deployment isn't it?

I added resolving env vars using dbt show command.

# Environment variable

# Code formatting

# Documentation

# CR

# remove resolving jinja / env vars

# code formatting

p-pekala requested changes Mar 21, 2023

View reviewed changes

p-pekala requested changes Apr 14, 2023

View reviewed changes

p-pekala requested changes Apr 26, 2023

View reviewed changes

rdziadosz force-pushed the dbt-cloud branch from a084a47 to 591dc60 Compare May 25, 2023 11:20

Radosław Dziadosz added 8 commits May 25, 2023 23:36

Add dbt-cloud integration command to dp cli

a0e7d6d

Add dbt-cloud integration command to dp cli

21132c1

Add dbt-cloud integration command to dp cli

e2427e6

Add dbt-cloud integration command to dp cli

4a4835f

Add dbt-cloud integration command to dp cli

1ddbb09

# Environment variable

Add dbt-cloud integration command to dp cli

ad56045

# Code formatting

Add dbt-cloud integration command to dp cli

8584855

# Documentation

Add dbt-cloud integration command to dp cli

464321c

# CR

rdziadosz force-pushed the dbt-cloud branch from 591dc60 to fcb0e88 Compare May 25, 2023 21:44

Add dbt-cloud integration command to dp cli

ec3f647

# remove resolving jinja / env vars

rdziadosz force-pushed the dbt-cloud branch from fcb0e88 to ec3f647 Compare May 25, 2023 21:48

rdziadosz requested a review from p-pekala May 29, 2023 10:10

p-pekala approved these changes May 30, 2023

View reviewed changes

Add dbt-cloud integration command to dp cli

becddfe

# code formatting

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add dbt-cloud integration command to dp cli #99

Add dbt-cloud integration command to dp cli #99

rdziadosz commented Mar 16, 2023

p-pekala commented Mar 21, 2023

p-pekala commented Apr 14, 2023

p-pekala Apr 14, 2023

rdziadosz Apr 25, 2023

p-pekala Apr 26, 2023

rdziadosz Apr 28, 2023

p-pekala Apr 26, 2023

rdziadosz Apr 28, 2023

p-pekala Apr 26, 2023

rdziadosz Apr 28, 2023

p-pekala Apr 26, 2023

rdziadosz Apr 28, 2023

p-pekala Apr 26, 2023

rdziadosz Apr 28, 2023

p-pekala Apr 26, 2023

rdziadosz Apr 28, 2023

p-pekala Apr 26, 2023

rdziadosz Apr 28, 2023

p-pekala Apr 26, 2023

rdziadosz Apr 28, 2023

p-pekala Apr 26, 2023

rdziadosz Apr 28, 2023

p-pekala Apr 26, 2023

rdziadosz May 25, 2023

		- Details of the environments to be created in dbt Cloud

		Configuration of the environments:

Add dbt-cloud integration command to dp cli #99

Are you sure you want to change the base?

Add dbt-cloud integration command to dp cli #99

Conversation

rdziadosz commented Mar 16, 2023

p-pekala commented Mar 21, 2023

p-pekala commented Apr 14, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment