Hi Wouter

You may want to upgrade to jets3t 0.7.1 or higher which will likely solve your 
problem. The release notes of jets3t 0.7.1 do say :
REST implementation was mistakenly limited to 20 simultaneous connections.

Alternatively, you can avoid some of these types of problems by using Amazon 
Elastic MapReduce.

Thanks
Vaibhav

From: Wouter de Bie [mailto:wou...@spotify.com]
Sent: Wednesday, July 06, 2011 11:21 AM
To: user@hive.apache.org
Subject: Re: Hive session locking up after 4 queries using S3

Hi!

I'm using Hive 0.7.0 and Hadoop 0.20, both from Cloudera's cdh3u0. Jets3t is 
used from jets3t-0.6.1.jar. I've just found a post 
(https://forums.aws.amazon.com/thread.jspa?threadID=19076&tstart=0) that 
describes this issue and I'm trying to figure out if this bug is in this 
version.

// Wouter

--
Wouter de Bie
Developer Business Intelligence, Spotify
wou...@spotify.com<mailto:wou...@spotify.com>
+46 72 018 0777

This e-mail (including any attachments) may contain information that is 
confidential and/or privileged. It is intended only for the recipient(s). If 
you have reason to believe that you are not the intended recipient of this 
e-mail, please contact the sender immediately and delete the e-mail from your 
computer.


On Wednesday, July 6, 2011 at 7:39 PM, Aggarwal, Vaibhav wrote:

Could you please tell us which Hadoop and Hive version are you using?

Looks like you might be using an older version of Hadoop (more specifically one 
which ships with old version of jets3t).



Thanks

Vaibhav



From: Wouter de Bie [mailto:wou...@spotify.com]
Sent: Wednesday, July 06, 2011 9:07 AM
To: user@hive.apache.org<mailto:user@hive.apache.org>
Subject: Hive session locking up after 4 queries using S3



Hi all,



I'm using Hive with the s3native FS. Today, I noticed that hive locks up after 
4 queries that directly access S3 (select * from mytable limit 10). With debug 
logging on, I get the following output:



2011-07-06 15:54:31,459 DEBUG s3native.NativeS3FileSystem 
(NativeS3FileSystem.java:getFileStatus(393)) - getFileStatus retrieving 
metadata for key 
'tmp/hive-mapred/hive_2011-07-06_15-54-29_881_4253697128840334916/-mr-10000'

2011-07-06 15:54:31,459 DEBUG httpclient.RestS3Service 
(RestS3Service.java:getObjectImpl(1511)) - Retrieving Head information for 
bucket XXXXXXXX and object 
tmp/hive-mapred/hive_2011-07-06_15-54-29_881_4253697128840334916/-mr-10000

2011-07-06 15:54:31,460 DEBUG service.Jets3tProperties 
(Jets3tProperties.java:getBoolProperty(314)) - 
s3service.disable-dns-buckets=false

2011-07-06 15:54:31,460 DEBUG httpclient.RestS3Service 
(RestS3Service.java:setupConnection(811)) - S3 URL: 
https://XXXXXXXX:443/tmp%2Fhive-mapred%2Fhive_2011-07-06_15-54-29_881_4253697128840334916%2F-mr-10000

2011-07-06 15:54:31,460 DEBUG httpclient.RestS3Service 
(RestS3Service.java:performRequest(334)) - Performing HEAD request for 
'https://XXXXXXXXXXXXXX/tmp%2Fhive-mapred%2Fhive_2011-07-06_15-54-29_881_4253697128840334916%2F-mr-10000',
 expecting response code 200

2011-07-06 15:54:31,461 DEBUG httpclient.RestS3Service 
(RestS3Service.java:buildAuthorizationString(872)) - Adding authorization for 
AWS Access Key 'XXXXXXXXXXXXX'.

2011-07-06 15:54:31,461 DEBUG httpclient.RestS3Service 
(RestS3Service.java:buildAuthorizationString(922)) - Canonical string ('|' is a 
newline): HEAD|||Wed, 06 Jul 2011 15:54:31 
GMT|/XXXXXXXXX/tmp%2Fhive-mapred%2Fhive_2011-07-06_15-54-29_881_4253697128840334916%2F-mr-10000

2011-07-06 15:54:31,461 DEBUG httpclient.HttpClient 
(HttpClient.java:executeMethod(322)) - enter 
HttpClient.executeMethod(HttpMethod)

2011-07-06 15:54:31,462 DEBUG httpclient.HttpClient 
(HttpClient.java:executeMethod(373)) - enter 
HttpClient.executeMethod(HostConfiguration,HttpMethod,HttpState)

2011-07-06 15:54:31,462 DEBUG httpclient.MultiThreadedHttpConnectionManager 
(MultiThreadedHttpConnectionManager.java:getConnectionWithTimeout(383)) - enter 
HttpConnectionManager.getConnectionWithTimeout(HostConfiguration, long)

2011-07-06 15:54:31,462 DEBUG httpclient.MultiThreadedHttpConnectionManager 
(MultiThreadedHttpConnectionManager.java:getConnectionWithTimeout(390)) - 
HttpConnectionManager.getConnection:  config = 
HostConfiguration[host=https://XXXXXXXXX.s3.amazonaws.com], timeout = 0

2011-07-06 15:54:31,462 DEBUG httpclient.MultiThreadedHttpConnectionManager 
(MultiThreadedHttpConnectionManager.java:getHostPool(775)) - enter 
HttpConnectionManager.ConnectionPool.getHostPool(HostConfiguration)

2011-07-06 15:54:31,463 DEBUG httpclient.MultiThreadedHttpConnectionManager 
(MultiThreadedHttpConnectionManager.java:doGetConnection(494)) - Unable to get 
a connection, waiting..., 
hostConfig=HostConfiguration[host=https://XXXXXXXXXX.s3.amazonaws.com]



Does anyone know if I can do anything to prevent this? It looks like 
connections are not returned correctly to the pool..



// Wouter



Reply via email to