-
In a simple use case - where DAG is allowed to run in max 1 instance, and when there is a daily processing i.e. DAG should be executed once a day - I would like to know if there is a way to reset DAG's last dataset event trigger - i.e. manually define that your last dataset triggered date is midnight. What is the problemTwo waiting DAGs - WAIT1 and WAIT2 are waiting for something and then setting DATASET1 and DATASET2 as outlets. MyDAG should be triggered after both updates, but in the current day. This works when you set up everything at once, but got out of sync if you re-execute one of the WAIT dags. For example:
I tried to manually ruyn MyDAG - hopin that it will reset the Datasets time, so basically it will show 0/2 datasets ready, but no, does not help. Possible workaroundSo the only option is to rerun at the same time ALL WAIT dags and then let MyDAG to trigger again. But this is costly (as there can be many downstream dags) - and it can be out of sync if somebody manually runs WAIT2. |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 3 replies
-
And my propose solution with "RESET" function:
|
Beta Was this translation helpful? Give feedback.
-
Came up with this workaround, so whenever the dataset scheduled DAG gets " out of sync" I reset the datasets state by running this helper DAG. Essentially I am clearing out the DAG's dataset queue.
|
Beta Was this translation helpful? Give feedback.
Came up with this workaround, so whenever the dataset scheduled DAG gets " out of sync" I reset the datasets state by running this helper DAG. Essentially I am clearing out the DAG's dataset queue.