Not trying to muddy the waters, but the observation of Jeremiah (non
deterministic outcomes) might have to do something with #3. I didn’t dive in
deeper, yet.
======================================================================
ERROR: test_backfill_examples (tests.BackfillJobTest)
----------------------------------------------------------------------
Traceback (most recent call last):
File "/home/travis/build/apache/incubator-airflow/tests/jobs.py", line 164,
in test_backfill_examples
job.run()
File "/home/travis/build/apache/incubator-airflow/airflow/jobs.py", line 200,
in run
self._execute()
File "/home/travis/build/apache/incubator-airflow/airflow/jobs.py", line
1999, in _execute
raise AirflowException(err)
AirflowException: ---------------------------------------------------
Some task instances failed:
set([('example_short_circuit_operator', 'condition_is_True',
datetime.datetime(2016, 1, 1, 0, 0))])
https://s3.amazonaws.com/archive.travis-ci.org/jobs/204780706/log.txt
<https://s3.amazonaws.com/archive.travis-ci.org/jobs/204780706/log.txt>
Bolke
> On 25 Feb 2017, at 09:07, Bolke de Bruin <[email protected]> wrote:
>
> Hi Dan,
>
> - Backfill indeed runs only one dagrun at the time, see line 1755 of jobs.py.
> I’ll think about how to fix this over the weekend (I think it was my change
> that introduced this). Suggestions always welcome. Depending the impact it is
> a blocker or not. We don’t often use backfills and definitely not at your
> size, so that is why it didn’t pop up with us. I’m assuming blocker for now,
> btw.
> - Speculation on the High DB Load. I’m not sure what your benchmark is here
> (1.7.1 + multi processor dags?), but as you mentioned in the code
> dependencies are checked a couple of times for one run and even task
> instance. Dependency checking requires aggregation on the DB, which is a
> performance killer. Annoying but not a blocker.
> - Skipped tasks potentially cause a dagrun to be marked failure/success
> prematurely. BranchOperators are widely used if it affects these operators,
> then it is a blocker.
>
> - Bolke
>
>> On 25 Feb 2017, at 02:04, Dan Davydov <[email protected]> wrote:
>>
>> Update on old pending issues:
>> - Black Squares in UI: Fix merged
>> - Double Trigger Issue That Alex G Mentioned: Alex has a PR in flight
>>
>> New Issues:
>> - Backfill seems to be having issues (only running one dagrun at a time),
>> we are still investigating - might be a blocker
>> - High DB Load (~8x more than 1.7) - We are still investigating but it's
>> probably not a blocker for the release
>> - Skipped tasks potentially cause a dagrun to be marked as failure/success
>> prematurely - not sure whether or not to classify this as a blocker (only
>> really an issue for users who use the BranchingPythonOperator, which AirBnB
>> does)
>>
>> On Thu, Feb 23, 2017 at 5:59 PM, siddharth anand <[email protected]> wrote:
>>
>>> IMHO, a DAG run without a start date is non-sensical but is not enforced
>>> That said, our UI allows for the manual creation of DAG Runs without a
>>> start date as shown in the images below:
>>>
>>>
>>> - https://www.dropbox.com/s/3sxcqh04eztpl7p/Screenshot%
>>> 202017-02-22%2016.00.40.png?dl=0
>>> <https://www.dropbox.com/s/3sxcqh04eztpl7p/Screenshot%
>>> 202017-02-22%2016.00.40.png?dl=0>
>>> - https://www.dropbox.com/s/4q6rr9dwghag1yy/Screenshot%
>>> 202017-02-22%2016.02.22.png?dl=0
>>> <https://www.dropbox.com/s/4q6rr9dwghag1yy/Screenshot%
>>> 202017-02-22%2016.02.22.png?dl=0>
>>>
>>>
>>> On Wed, Feb 22, 2017 at 2:26 PM, Maxime Beauchemin <
>>> [email protected]> wrote:
>>>
>>>> Our database may have edge cases that could be associated with running
>>> any
>>>> previous version that may or may not have been part of an official
>>> release.
>>>>
>>>> Let's see if anyone else reports the issue. If no one does, one option is
>>>> to release 1.8.0 as is with a comment in the release notes, and have a
>>>> future official minor apache release 1.8.1 that would fix these minor
>>>> issues that are not deal breaker.
>>>>
>>>> @bolke, I'm curious, how long does it take you to go through one release
>>>> cycle? Oh, and do you have a documented step by step process for
>>> releasing?
>>>> I'd like to add the Pypi part to this doc and add committers that are
>>>> interested to have rights on the project on Pypi.
>>>>
>>>> Max
>>>>
>>>> On Wed, Feb 22, 2017 at 2:00 PM, Bolke de Bruin <[email protected]>
>>> wrote:
>>>>
>>>>> So it is a database integrity issue? Afaik a start_date should always
>>> be
>>>>> set for a DagRun (create_dagrun) does so I didn't check the code
>>> though.
>>>>>
>>>>> Sent from my iPhone
>>>>>
>>>>>> On 22 Feb 2017, at 22:19, Dan Davydov <[email protected].
>>> INVALID>
>>>>> wrote:
>>>>>>
>>>>>> Should clarify this occurs when a dagrun does not have a start date,
>>>> not
>>>>> a
>>>>>> dag (which makes it even less likely to happen). I don't think this
>>> is
>>>> a
>>>>>> blocker for releasing.
>>>>>>
>>>>>>> On Wed, Feb 22, 2017 at 1:15 PM, Dan Davydov <
>>> [email protected]>
>>>>> wrote:
>>>>>>>
>>>>>>> I rolled this out in our prod and the webservers failed to load due
>>> to
>>>>>>> this commit:
>>>>>>>
>>>>>>> [AIRFLOW-510] Filter Paused Dags, show Last Run & Trigger Dag
>>>>>>> 7c94d81c390881643f94d5e3d7d6fb351a445b72
>>>>>>>
>>>>>>> This fixed it:
>>>>>>> - </a> <span id="statuses_info"
>>>>>>> class="glyphicon glyphicon-info-sign" aria-hidden="true"
>>> title="Start
>>>>> Date:
>>>>>>> {{last_run.start_date.strftime('%Y-%m-%d %H:%M')}}"></span>
>>>>>>> + </a> <span id="statuses_info"
>>>>>>> class="glyphicon glyphicon-info-sign" aria-hidden="true"></span>
>>>>>>>
>>>>>>> This is caused by assuming that all DAGs have start dates set, so a
>>>>> broken
>>>>>>> DAG will take down the whole UI. Not sure if we want to make this a
>>>>> blocker
>>>>>>> for the release or not, I'm guessing for most deployments this would
>>>>> occur
>>>>>>> pretty rarely. I'll submit a PR to fix it soon.
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> On Tue, Feb 21, 2017 at 9:49 AM, Chris Riccomini <
>>>> [email protected]
>>>>>>
>>>>>>> wrote:
>>>>>>>
>>>>>>>> Ack that the vote has already passed, but belated +1 (binding)
>>>>>>>>
>>>>>>>> On Tue, Feb 21, 2017 at 7:42 AM, Bolke de Bruin <[email protected]
>>>>
>>>>>>>> wrote:
>>>>>>>>
>>>>>>>>> IPMC Voting can be found here:
>>>>>>>>>
>>>>>>>>> http://mail-archives.apache.org/mod_mbox/incubator-general/
>>>>>>>> 201702.mbox/%
>>>>>>>>> [email protected]%3e <
>>>>>>>>> http://mail-archives.apache.org/mod_mbox/incubator-general/
>>>>>>>> 201702.mbox/%
>>>>>>>>> [email protected]%3E>
>>>>>>>>>
>>>>>>>>> Kind regards,
>>>>>>>>> Bolke
>>>>>>>>>
>>>>>>>>>> On 21 Feb 2017, at 08:20, Bolke de Bruin <[email protected]>
>>>> wrote:
>>>>>>>>>>
>>>>>>>>>> Hello,
>>>>>>>>>>
>>>>>>>>>> Apache Airflow (incubating) 1.8.0 (based on RC4) has been
>>> accepted.
>>>>>>>>>>
>>>>>>>>>> 9 “+1” votes received:
>>>>>>>>>>
>>>>>>>>>> - Maxime Beauchemin (binding)
>>>>>>>>>> - Arthur Wiedmer (binding)
>>>>>>>>>> - Dan Davydov (binding)
>>>>>>>>>> - Jeremiah Lowin (binding)
>>>>>>>>>> - Siddharth Anand (binding)
>>>>>>>>>> - Alex van Boxel (binding)
>>>>>>>>>> - Bolke de Bruin (binding)
>>>>>>>>>>
>>>>>>>>>> - Jayesh Senjaliya (non-binding)
>>>>>>>>>> - Yi (non-binding)
>>>>>>>>>>
>>>>>>>>>> Vote thread (start):
>>>>>>>>>> http://mail-archives.apache.org/mod_mbox/incubator-
>>>>>>>>> airflow-dev/201702.mbox/%3cD360D9BE-C358-42A1-9188-
>>>>>>>>> [email protected]%3e <http://mail-archives.apache.
>>>>>>>>> org/mod_mbox/incubator-airflow-dev/201702.mbox/%3C7EB7B6D6-
>>>>>>>> 092E-48D2-AA0F-
>>>>>>>>> [email protected]%3E>
>>>>>>>>>>
>>>>>>>>>> Next steps:
>>>>>>>>>> 1) will start the voting process at the IPMC mailinglist. I do
>>>> expect
>>>>>>>>> some changes to be required mostly in documentation maybe a
>>> license
>>>>> here
>>>>>>>>> and there. So, we might end up with changes to stable. As long as
>>>>> these
>>>>>>>> are
>>>>>>>>> not (significant) code changes I will not re-raise the vote.
>>>>>>>>>> 2) Only after the positive voting on the IPMC and finalisation I
>>>> will
>>>>>>>>> rebrand the RC to Release.
>>>>>>>>>> 3) I will upload it to the incubator release page, then the tar
>>>> ball
>>>>>>>>> needs to propagate to the mirrors.
>>>>>>>>>> 4) Update the website (can someone volunteer please?)
>>>>>>>>>> 5) Finally, I will ask Maxime to upload it to pypi. It seems we
>>> can
>>>>>>>> keep
>>>>>>>>> the apache branding as lib cloud is doing this as well (
>>>>>>>>> https://libcloud.apache.org/downloads.html#pypi-package <
>>>>>>>>> https://libcloud.apache.org/downloads.html#pypi-package>).
>>>>>>>>>>
>>>>>>>>>> Jippie!
>>>>>>>>>>
>>>>>>>>>> Bolke
>>>>>>>>>
>>>>>>>>>
>>>>>>>>
>>>>>>>
>>>>>>>
>>>>>
>>>>
>>>
>