Release resources on task timeout

giorgiobasile · December 1, 2022, 5:29pm

I’m having an issue with proper clean up of acquired resources on task timeouts.
I have a Process deployment for the following flow and task, that connects to a Dask.distributed cluster in the following way:

from distributed import Client
from prefect import task, flow

@task(timeout_seconds=5)
def my_task():
    client = Client("127.0.0.1:8787")
    # some long computational operation
    client.close()

@flow
def flow():
    my_task()

Unfortunately, when the timeout goes off, even though there is an exception thrown, the client seems not to be deallocated, and the execution of the computation on the Dask graph keeps going.
I was hoping the Dask client would go out of scope, its connection be forcefully closed, and the computation be canceled from the cluster, but apparently, it is more complicated than this.
Any suggestion on how to make sure resources are properly released? Thanks!

anna_geller · December 2, 2022, 12:31am

We have a WIP about server-side timeout implementation. Can you try doing it on a future level? e.g. when you submit a task to a DaskTaskRunner, you can do:

@flow(task_runner=DaskTaskRunner(...))
def test_flow():
	future = test_task.submit()
	future.wait(10)

here 10 means 10 seconds before timeout

giorgiobasile · December 2, 2022, 10:01am

I’m not sure I understand your suggestion, I do not use a DaskTaskRunner, but a SequentialTaskRunner, and I want the task to create a connection to the Dask.distributed cluster from within the task itself - I could go into the reasons why I choose to do it like this, but not sure it is relevant.
I believe I would have the same issue with an SQL connection not being released on timeout, or other similar situations. Does this make sense?

anna_geller · December 2, 2022, 2:16pm

Yes it does, I understand. As I said:

Topic		Replies	Views
TimeoutError when Flow is finishing Archive prefect-2-0 , dask , slurm	6	1904	November 2, 2022
Running numerous tasks on dask cluster with large data inputs/outputs causes timeout errors Archive prefect-2-0 , dask , parallel-processing , timeout	3	1118	August 22, 2022
Long running tasks on prefect-dask clusters Archive prefect-2-0 , dask , timeout	5	1660	August 11, 2022
Timeout_seconds functionality not working while running tasks concurrently Help prefect-2-0 , deployment , ui , timeout , task	0	29	September 30, 2024
Server TimeoutError in db stmt execute creating task run when flow submits task for concurrent execution Help prefect-2-0	2	567	August 22, 2023

Release resources on task timeout

Related topics