GitHub user MaxGekk opened a pull request:

    https://github.com/apache/spark/pull/21769

    [SPARK-24805][SQL] Do not ignore avro files without extensions

    ## What changes were proposed in this pull request?
    
    In the PR, I propose to change default behaviour of AVRO datasource which 
currently ignores files without `.avro` extension in read by default. This PR 
sets the default value for `avro.mapred.ignore.inputs.without.extension` to 
`false` in the case if the parameter is not set by an user.
    
    ## How was this patch tested?
    
    Added a test file without extension in AVRO format, and new test for 
reading the file with and wihout specified schema.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/MaxGekk/spark-1 avro-without-extension

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/21769.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #21769
    
----
commit 35063ef8e734bdeb39316e02b2e8451d0d75d43a
Author: Maxim Gekk <maxim.gekk@...>
Date:   2018-07-14T09:49:30Z

    Test for reading files without avro extension

commit 760f98e7aecb5a4c267599b318479d7f2ade165a
Author: Maxim Gekk <maxim.gekk@...>
Date:   2018-07-14T10:38:33Z

    Fix tests

commit 8562a8d43868d551efa6a0e9a00d50cdc838a178
Author: Maxim Gekk <maxim.gekk@...>
Date:   2018-07-14T10:53:24Z

    Adding ticket number to test title

----


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to