New dbt recipe allowing you to rerun your dbt DAG from a failed node - by Alex Streed

anna_geller · April 19, 2022, 4:49pm

Hi Prefectionists!

If you are using Prefect with dbt, we have a great new recipe you should try out!
@desertaxle has just released a recipe allowing you to easily rerun failed dbt models from a failed dbt DAG node.

Get the recipe code

https://github.com/PrefectHQ/customer-success-recipes/tree/main/prefect/flows/dbt/rerun-models-from-failure

The problem this recipe helps to solve

When running a suite of dbt models, sometimes a few models may fail due to exogenous circumstances (e.g., network issues, improper authorization, etc.). Retrying the entire DbtShellTask with all your dbt models may be expensive and time-consuming, especially if only one of those models failed.

Solution

You can extend your dbt Prefect flow by adding a task that can rerun only failed models. The recipe uses the --select flag together with --defer and --state to only rerun your dbt DAG from a failed node:

dbt build --select result:error+ --defer --state ./target

The value for the --state flag can be modified to point to another location where dbt state artifacts are stored.
The flow uses the all_failed trigger to ensure that the rerun task only runs if the initial dbt task fails.
For more details, check out the README.

The flow code is located here:
https://github.com/PrefectHQ/customer-success-recipes/blob/main/prefect/flows/dbt/rerun-models-from-failure/flow.py

noahholm · April 21, 2022, 12:38pm

Wow this is a really nice use case of all the capabilities the dbt cli offers.

I have a bit of a hard time seeing when an immediate retry is useful as most failures rely on upstream issues and not flakes, network issues and such. But it could be very different with different warehouses or organisations of course. Of course, it probably doesn’t hurt either. If it can avoid some days waking up to alerts of failures that’s probably worth it.

Another idea here could be to add a manual trigger for the rerun task, so that you just have to click on the rerun manual approval once you’ve fixed any upstream issues if any. I’ve found myself kicking of a new flow and manually giving it a rerun command like dbt run --select failed_model+

anna_geller · April 27, 2022, 11:39pm

@noahholm I’ve recently got a response from dbt about WHEN to use this feature - sharing in case this might be useful:

Timeout issues with the database( you can simply rerun the prefect flow as is)
Lack of permissions to read/write with tables/schemas etc (need to manually invoke grant statements in the database before running this prefect flow again)
Another dbt job is touching the same table as the current job and causing concurrency issues at the database level (you can simply rerun the prefect flow as is)

serina · October 3, 2022, 8:02pm

Flow code and README location have changed to the following location:

Topic		Replies	Views
Checkpoint/restart capability Archive	1	970	August 31, 2022
Ideas on future dbt integrations with Prefect Archive prefect-1-0 , prefect-2-0 , dbt , task-library	0	928	March 3, 2022
How can I send Slack alert on failure including dbt output? Archive prefect-1-0 , dag-flow-structure , dbt , state_handlers , task-library , slack-notifications , notifications , failure , data-dependencies	7	3320	May 17, 2023
Trigger_dbt_cli_command does not returns log when the dbt command fails Help prefect-2-0 , dbt , task-runner	1	671	July 4, 2023
Are there any thoughts out there on how to make a DbtShellTask dynamic? Archive prefect-1-0 , dbt	0	378	April 2, 2022

New dbt recipe allowing you to rerun your dbt DAG from a failed node - by Alex Streed

Get the recipe code

The problem this recipe helps to solve

Solution

Related topics