-
Notifications
You must be signed in to change notification settings - Fork 14.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tasks in DAGs with depends_on_past
or task_concurrency
are not being scheduled
#12659
Comments
On Master, this was fixed by #7402 & further optimised by #7503 in 1.10.13 -- this was clubbed by the following 2 commits:
I will investigate this further |
@nathadfield has confirmed the issue does not exist on 2.0.0b3 |
Since v1.10.13 we also noticed, for some dags, the tasks are not being scheduled. They stay forever with a None state. Nothing in the scheduler logs (DEBUG level). Running the tasks manually work fine though. In our case, some/most of the dags have indeed |
Can you check if the other DAGs (not using |
I did not check absolutely all dags we have but yes, the "broken" dags either have |
depends_on_past
are not being scheduleddepends_on_past
or task_concurrency
are not being scheduled
I can confirm the bug. I was able to reproduce it with task with from airflow import models
from airflow.operators.dummy_operator import DummyOperator
from airflow.operators.bash_operator import BashOperator
from datetime import datetime, timedelta
default_args = {
'owner': 'airflow',
'start_date': datetime(2018, 10, 31),
'retries': 3,
'retry_delay': timedelta(minutes=5)
}
dag_name = 'dag-bugcheck'
with models.DAG(dag_name,
default_args=default_args,
schedule_interval='0 0 * * *',
catchup=False,
max_active_runs=5,
) as dag:
test1 = DummyOperator(
task_id='test1',
task_concurrency=10,
)
test2 = BashOperator(
task_id='test2',
bash_command='echo hi',
depends_on_past=True,
)
test3 = BashOperator(
task_id='test3',
bash_command='echo hi',
) |
#12663 should fix it @nathadfield @mthoretton |
Nice one @kaxil! Will this force the need for a 1.10.14 then? |
Yup, indeed. I hope to get it out by early next week |
closes apache#12659 (cherry picked from commit 67807ee)
Closed by #12663 |
closes apache#12659 (cherry picked from commit 67807ee) (cherry picked from commit 56e4468)
…663) closes apache/airflow#12659 GitOrigin-RevId: 2a7944da85c3ec7cc913640ff89a66e06d486480
…663) closes apache/airflow#12659 GitOrigin-RevId: 2a7944da85c3ec7cc913640ff89a66e06d486480
Apache Airflow version: 1.10.13
What happened:
After performing an upgrade to
v1.10.13
we noticed that tasks in some of our DAGs were not be scheduled. After a bit of investigation we discovered that by commenting out'depends_on_past': True
the issue went away.What you expected to happen:
We think the issue might have something to do with this which was introduced to
1.10.13
[AIRFLOW-3607] Only query DB once per DAG run for TriggerRuleDep (#4751)
How to reproduce it:
The text was updated successfully, but these errors were encountered: