Why do completed tasks still show as running on the timeline

simon_mackenzie · March 18, 2022, 2:15pm

I have mapped tasks that feed into a reducing task. The mapped tasks complete in around 2 minutes and are shown as completed in the table on the server display. However the timeline display shows them as still running as long as the reducing task is running. The bar gets longer and if I hover it shows duration increasing.

When the reduce task completes the bars for those tasks shrink to their correct duration. Is this a bug or is there some reason for it?

anna_geller · March 19, 2022, 2:23am

If I understood correctly, I’d say it’s normal - it takes some time to update the task run states, and until then, the task is shown as Running.

But if you could share your flow code and some UI screenshots, it would be easier to assess what’s happening.

Also, it would be helpful if you could share the output of the command prefect diagnostics.

simon_mackenzie · March 19, 2022, 11:09am

The blue bar cannot start until the five long green bars have finished. Those tasks finished in 3 minutes but still show as a bar. When I hover they show duration 10 minutes increasing. Once the blue task has finished the bars revert to their correct duration of 3 minutes.

The flow is as below. The running blue task is “predict”

basename_pngs = basename.map(pngs)
rotated = pngtasks.rotate.map(pngs, unmapped(csv), basename_pngs)
boxes = pngtasks.OCR.map(rotated, unmapped(csv), basename_pngs)
parsed = pngtasks.parse.map(boxes, basename_pngs)
matched = pngtasks.match.map(parsed, unmapped(csv), basename_pngs)

# merge
merged = jobtasks.merge(matched, csv)
data = jobtasks.prepare_data(merged)
preds = jobtasks.predict(data)

anna_geller · March 19, 2022, 11:19am

Thanks for explaining. The only issue I see here is that your mapped tasks accept two mapped inputs. You should only have one input to map over and all other arguments to your mapped tasks should be wrapped with unmapped.

basename_pngs = basename.map(pngs)
rotated = pngtasks.rotate.map(pngs, unmapped(csv), unmapped(basename_pngs))
boxes = pngtasks.OCR.map(rotated, unmapped(csv), unmapped(basename_pngs))
parsed = pngtasks.parse.map(boxes, unmapped(basename_pngs))
matched = pngtasks.match.map(parsed, unmapped(csv), unmapped(basename_pngs))

simon_mackenzie · March 19, 2022, 11:24am

pngs is a list of paths. basename_pngs is the basename of those paths which is used in a target template. So I need to map both as it won’t work as a list. Are you saying it is not possible to do that?

EDIT: if only the first parameter can be mapped then why is there a need for the unmapped function?

anna_geller · March 19, 2022, 11:35am

This is needed so that Prefect knows which mapped input to iterate over, and which values are supposed to be used as constant values. This documentation page explains it best.

As with almost anything in Prefect, it certainly is possible the only question is what’s the right way to design a given flow. For instance, if you need to map over both of those paths, the input to your pngtasks.rotate could be a list of dictionaries rather than a list of strings (paths). This way, within your task, you can grab the right element of that dict.

[{"png": "aaa", "basename_png": "xxx"}, 
{"png": "bbb", "basename_png": "yyy"},
{"png": "ccc", "basename_png": "zzz"}, ...]

simon_mackenzie · March 19, 2022, 11:47am

The docs you link to imply you can have more than one mapped input and indeed it does seem to work except for the progress bar issue. Would not work as a dictionary as I have @task(target=“path/to/{basename}”) which requires basename to be one of the arguments.

In next iteration I will move to prefect2 and use the database rather than targets anyway. Will update if the progress bar issue still happens or not.

anna_geller · March 19, 2022, 11:54am

Where do you see it? If you mean apply_map, that’s a completely different thing.

I can only see:

Sometimes, we don’t want to iterate over one of the inputs – perhaps it’s a constant value, or a list that’s required in its entirety. Prefect supplies a convenient unmapped() annotation for this case.

Sounds like a good plan!

simon_mackenzie · March 19, 2022, 12:04pm

Sometimes, we don’t want to iterate over one of the inputs

If you had 3 inputs and don’t want to iterate over one then that implies you do want to iterate over the other two.

Topic		Replies	Views
How can I report progress on a long-running task run in the UI? Archive prefect-1-0 , artifacts-api , progress-bar	3	2585	March 6, 2024
On task mark as success, still inner loops are in progress Archive prefect-1-0 , graphql	12	1581	May 6, 2022
Flow/task run runtime estimate Archive prefect-2-0 , getting-started	2	483	March 8, 2023
Flow remains in a "Running" state, and my Task is stuck in a "Pending" state Help prefect-2-0 , deployment , server , concurrency-limits , task	0	108	July 8, 2024
My flow won't stop running in the background Archive	7	1405	December 3, 2022

Why do completed tasks still show as running on the timeline

Related topics