I see the following error (Drill 1.5.0):
org.apache.drill.common.exceptions.UserRemoteException: SYSTEM ERROR:
SocketTimeoutException: connect timed out
However, I am able to connect to the Azure Data Lake from the server that
reported the error using curl, so it does not appear to be a standard
connectivity issue.
My Storage configuration is very basic:
{
"type": "file",
"enabled": true,
"connection": "swebhdfs://azuredatalakestoragename.azuredatalakestore.net",
"workspaces": {
"root": {
"location": "/",
"writable": true,
"defaultInputFormat": null
}
},
"formats": {
"psv": {
"type": "text",
"extensions": [
"tbl",
"psv"
],
"delimiter": "|"
},
"csv": {
"type": "text",
"extensions": [
"csv"
],
"delimiter": ","
},
"tsv": {
"type": "text",
"extensions": [
"tsv"
],
"delimiter": "\t"
},
"txt": {
"type": "text",
"extensions": [
"txt"
],
"delimiter": ","
},
"parquet": {
"type": "parquet"
},
"json": {
"type": "json"
},
"avro": {
"type": "avro"
}
}
}
-----Original Message-----
From: Sudheesh Katkam [mailto:[email protected]]
Sent: Monday, August 1, 2016 11:03 AM
To: [email protected]
Subject: Re: Connecting Drill to Azure Data Lake
What failure(s) do you see?
Thank you,
Sudheesh
> On Jul 29, 2016, at 4:07 PM, Kevin Verhoeven <[email protected]>
> wrote:
>
> Hi Drill Community,
>
> Has anyone attempted to connect Drill to the Azure Data Lake? Microsoft has
> implemented a WebHDFS API over Azure Data Lake, so Drill should be able to
> connect. I'm guessing this will be similar to s3. My initial attempts have
> failed, does anyone have any ideas or experience with this connection?
>
> Regards,
>
> Kevin
>