Thanks for thorough consideration Jarek. I follow your concerns. The idea behind this AIP was to reduce the cognitive load on users by staying as much pythonic as we can and to be gentle with the Airflow-isms. So I hope to limit that "yet another abstraction". I do agree that having great examples and documentation are going to be important. As a random idea, this https://medium.com/@fninsiima/de-mini-series-part-two-57770ff7cdf9 , can now be significantly simplified.
One less worry I hope is that aiobotocore is actually starting to relax its botocore requirements bringing it much closer to latest release: https://github.com/aio-libs/aiobotocore/pull/1037 On the requirements side there are actually not that many additional dependencies being brought in. Core fsspec does not bring any requirements. s3fs brings in three which are all covered by current ones. adlfs brings in five, all already part of our current set. Of course it does bring some complexity, but I do hope you see that it is fairly limited and if it does bring in anything it is well supported. The reason for creating common.io as a provider was that it was suggested that we might want to move a bit faster than core on the very simple (yet powerful ;-) ) FileTransferOperator. Considering this I hope you would like to make your measly +1 into a strong +1 :-). Cheers Bolke On Thu, 19 Oct 2023 at 19:48, Jarek Potiuk <ja...@potiuk.com> wrote: > Finally caught up with this one, looked through code and discussions. I am > a little torn on that one but I did some more research and I think it's a > useful abstraction. > > +1(binding) > > The big + of using fsspec is that it is already supported by the most > important "consumers" that are likely to be used in Airflow. Pandas, > Pyarrow, Iceberg. The fact that you will be able to take an S3/GCS > ObjectStoragePath as an input directly and it will transparently use the > connection of Airflow is a big plus. > > I would just add that we should get real-life DAG examples on how this > might simplify code of their DAGs, it's cool. I think the quality and > clarity of the documentation that will come with it - clearly explaining > some cases and examples on how DAG authors can make use of it to make their > DAG authoring "better" - is a key to success of this one. If we fail to > explain it, it might become yet another rarely used feature of Airflow > > There is one worry I have - it adds "yet another abstraction" to learn and > "yet another set of dependencies" to Airflow. We have a new "common.io" > provider, we have many new dependencies, we have aiobotocore as a > requirement for AWS integration for example. I already looked at the PR and > attempted to help with some of the dependency questions and problems. but > we will have a few more of those to solve and some decisions to mke should > apache-airflow-provider-common-io be default? Should it be included in the > reference image? etc. etc. This will make Airflow and its dependencies more > complex than simpler. That's why I am not strong +1! just measly +1 - > because I see how it can make airflow even "heavier" than it is now. > > J. > > > > On Thu, Oct 19, 2023 at 4:34 PM Igor Kholopov <ikholo...@google.com.invalid > > > wrote: > > > Thanks for incorporating the feedback! > > > > +1 (non-binding) > > > > On Thu, Oct 19, 2023 at 1:55 PM Dennis Akpenyi <dennisakpe...@gmail.com> > > wrote: > > > > > +1 (non-binding) > > > > > > On Thu, Oct 19, 2023 at 12:24 PM Bolke de Bruin <bdbr...@gmail.com> > > wrote: > > > > > > > Dear Community, > > > > > > > > I would like to start a vote for "AIP-58 Add Airflow ObjectStore". > > > > > > > > You can find the AIP here: > > > > > > > > > > https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=263430565 > > > > > > > > Implementing PR (most of the discussion happened here): > > > > https://github.com/apache/airflow/pull/34729 > > > > > > > > Discussion Thread (not much has happened here :-) ): > > > > Note: the title has changed from its original. > > > > > > > > https://lists.apache.org/thread/l3fkr0h6j2g4tlmsov14fywmj58t3mtp > > > > > > > > This is my binding +1m the vote will last until 12:00 UTC on 26th > > > October, > > > > and until at least 3 binding votes have been cast. > > > > > > > > Please vote accordingly: > > > > > > > > [ ] + 1 approve > > > > [ ] + 0 no opinion > > > > [ ] - 1 disapprove with the reason > > > > > > > > Only votes from PMC members and committers are binding, but other > > members > > > > of the community are encouraged to check the AIP and vote with > > > > "(non-binding)". > > > > > > > > Cheers > > > > Bolke > > > > -- > > > > > > > > -- > > > > Bolke de Bruin > > > > bdbr...@gmail.com > > > > > > > > > > -- -- Bolke de Bruin bdbr...@gmail.com