[jira] [Created] (HBASE-5846) HBase rpm packing is broken at multiple places

2012-04-20 Thread Shrijeet Paliwal (Created) (JIRA)
HBase rpm packing is broken at multiple places
--

 Key: HBASE-5846
 URL: https://issues.apache.org/jira/browse/HBASE-5846
 Project: HBase
  Issue Type: Bug
  Components: build
Affects Versions: 0.92.1
 Environment: CentOS release 5.7 (Final)
Reporter: Shrijeet Paliwal


Here is how I executed rpm build: 
{noformat}
MAVEN_OPTS=-Xmx2g mvn clean package assembly:single -Prpm -DskipTests
{noformat}

The issues with the rpm build are: 
* There is no clean (%clean) section in the hbase.spec file . Last run can 
leave stuff in RPM_BUILD_ROOT which in turn will fail build. As a fix I added 
'rm -rf $RPM_BUILD_ROOT' to %clean section

* The Buildroot is set to _build_dir . The build fails with this error. 
{noformat}
cp: cannot copy a directory, 
`/data/9adda425-1f1e-4fe5-8a53-83bd2ce5ad45/app/jenkins/workspace/hbase.92/target/rpm/hbase/BUILD',
 into itself, 
`/data/9adda425-1f1e-4fe5-8a53-83bd2ce5ad45/app/jenkins/workspace/hbase.92/target/rpm/hbase/BUILD/BUILD'
{noformat}
If we set it to ' %{_tmppath}/%{name}-%{version}-root' build passes

* The src/packages/update-hbase-env.sh script will leave inconsistent state if 
'yum update hbase' is executed. It deletes data from /etc/init.d/hbase* and 
does not put scripts back during update. 




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (HBASE-5779) Master goes to infinite loop if AccessControlException occurs while setting cluster id during initialization

2012-04-12 Thread Shrijeet Paliwal (Created) (JIRA)
Master goes to infinite loop if AccessControlException occurs while setting 
cluster id during initialization 
-

 Key: HBASE-5779
 URL: https://issues.apache.org/jira/browse/HBASE-5779
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.92.1
Reporter: Shrijeet Paliwal


Steps to reproduce: 
- change permission of /hbase to a user other than one running hbase
- delete hbase.id if already exists
- start master, it will try to create cluster ID file in /hbase and fail while 
doing so with org.apache.hadoop.security.AccessControlException

From this point it will go to infinite loop. 

Reason: org.apache.hadoop.hbase.util.FSUtils.setClusterId  has a wait  0 and 
no control over retries when called during master initialization. 

Quoting : checkRootDir in MasterFileSystem
{noformat}
// Make sure cluster ID exists
if (!FSUtils.checkClusterIdExists(fs, rd, c.getInt(
HConstants.THREAD_WAKE_FREQUENCY, 10 * 1000))) {
  FSUtils.setClusterId(fs, rd, UUID.randomUUID().toString(), c.getInt(
  HConstants.THREAD_WAKE_FREQUENCY, 10 * 1000));
}

{noformat}


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (HBASE-5585) Add a feature in shell to allow adding entries into META easily

2012-03-15 Thread Shrijeet Paliwal (Created) (JIRA)
Add a feature in shell to allow adding entries into META easily
---

 Key: HBASE-5585
 URL: https://issues.apache.org/jira/browse/HBASE-5585
 Project: HBase
  Issue Type: Improvement
  Components: shell
Reporter: Shrijeet Paliwal
Priority: Minor


From IRC 

{noformat}
sulabhc St^Ack: that fixed most of the issue, just one last question, I removed 
some META entries how do I put it back ?
sulabhc St^Ack: putting from Hbase-shell does not seem to be possible
St^Ack  sulabhc: it is but the content in .META. is serialized Writable of 
HRegionInfo.
St^Ack  Can you find the region in the fs that you want to put back?
St^Ack  If so, cat its .regioninfo
St^Ack  it has binary format of the HRegionInfo that needs to be in .META. and 
a txt version.
St^Ack  You'll need to create an HRegionInfo, serialize it as bytes, and then 
put that into .META.
St^Ack  Look around in the catalog package, the MetaEditor class, for how to do 
this from java.
St^Ack  We should really make this easier to do in shell...
St^Ack  or look in bin to see how we do some of this via (j)ruby
{noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (HBASE-5582) No HServerInfo found for should be a WARNING message

2012-03-14 Thread Shrijeet Paliwal (Created) (JIRA)
No HServerInfo found for should be a WARNING message
--

 Key: HBASE-5582
 URL: https://issues.apache.org/jira/browse/HBASE-5582
 Project: HBase
  Issue Type: Improvement
  Components: master
Affects Versions: 0.90.4
Reporter: Shrijeet Paliwal
Priority: Trivial


The message from RegionServerTracker No HServerInfo found for... is easy to 
miss. It should not be INFO. 


From irc chat 

{noformat}
jdcryans
JohnP789: can you grep for No HServerInfo found for in that log?
jdcryans
wait I see it
jdcryans
ok there's your problem
shrijeet_
Yes it is there
shrijeet_
jdcryans: it should be INFO, why?
jdcryans
it shouldn't be INFO, it's so easy to miss
jdcryans
it's not the first time we have to look super closely to figure this one out
shrijeet_
yes , I will file a jira
jdcryans
in any case it's a mismatch in that machine's DNS config
shrijeet_
anyways JohnP789 is waiting :) go on
JohnP789
haha!
JohnP789
yes...  ???  :-)
jdcryans
the master is expecting a RS called localhost.localdomain,53875,1328924863478
17:26 jdcryans
but the RS calls itself localhost,53875,1328924863478
{noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (HBASE-5080) Make HConnectionManager's prefetchRegionCache error trace more helpful

2011-12-20 Thread Shrijeet Paliwal (Created) (JIRA)
Make HConnectionManager's prefetchRegionCache error trace more helpful
--

 Key: HBASE-5080
 URL: https://issues.apache.org/jira/browse/HBASE-5080
 Project: HBase
  Issue Type: Sub-task
  Components: client
Affects Versions: 0.90.3
Reporter: Shrijeet Paliwal
Priority: Minor


We catch RuntimeException in this HConnectionManager's  prefetchRegionCache 
method. The trace is not very helpful since it eats the information about the 
line where RuntimeException actually happened. 



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (HBASE-5041) Major compaction on non existing table does not throw error

2011-12-15 Thread Shrijeet Paliwal (Created) (JIRA)
Major compaction on non existing table does not throw error 


 Key: HBASE-5041
 URL: https://issues.apache.org/jira/browse/HBASE-5041
 Project: HBase
  Issue Type: Bug
  Components: regionserver, shell
Affects Versions: 0.90.3
Reporter: Shrijeet Paliwal


Following will not complain even if fubar does not exist

{code}
echo major_compact 'fubar' | $HBASE_HOME/bin/hbase shell
{code}

The downside for this defect is that major compaction may be skipped due to
a typo by Ops.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (HBASE-5035) Runtime exceptions during meta scan

2011-12-14 Thread Shrijeet Paliwal (Created) (JIRA)
Runtime exceptions during meta scan
---

 Key: HBASE-5035
 URL: https://issues.apache.org/jira/browse/HBASE-5035
 Project: HBase
  Issue Type: Bug
  Components: client
Affects Versions: 0.90.3
Reporter: Shrijeet Paliwal


Version: 0.90.3 + patches back ported 

The other day our client started spitting these two runtime exceptions. Not all 
clients connected to the cluster were under impact. Only 4 of them. While 3 of 
them were throwing NPE, one of them was throwing 
ArrayIndexOutOfBoundsException. The errors are : 

1. http://pastie.org/2987926
2. http://pastie.org/2987927

Clients did not recover from this and I had to restart them. 

Motive of this jira is to identify and put null checks at appropriate places. 
Also with the given stack trace I can not tell which line caused NPE of AIOBE, 
hence additional motive is to make the trace more helpful. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (HBASE-4980) Null pointer exception in HBaseClient receiveResponse

2011-12-07 Thread Shrijeet Paliwal (Created) (JIRA)
Null pointer exception in HBaseClient receiveResponse
-

 Key: HBASE-4980
 URL: https://issues.apache.org/jira/browse/HBASE-4980
 Project: HBase
  Issue Type: Bug
  Components: client
Affects Versions: 0.92.0
Reporter: Shrijeet Paliwal


Relevant Stack trace: 

2011-11-30 13:10:26,557 [IPC Client (47) connection to 
xx.xx.xx/172.22.4.68:60020 from an unknown user] WARN  
org.apache.hadoop.ipc.HBaseClient - Unexpected exception receiving call 
responses
java.lang.NullPointerException
-at 
org.apache.hadoop.hbase.ipc.HBaseClient$Connection.receiveResponse(HBaseClient.java:583)
-at 
org.apache.hadoop.hbase.ipc.HBaseClient$Connection.run(HBaseClient.java:511)


{code}
  if (LOG.isDebugEnabled())
  LOG.debug(getName() +  got value # + id);
Call call = calls.remove(id);
// Read the flag byte
byte flag = in.readByte();
boolean isError = ResponseFlag.isError(flag);
if (ResponseFlag.isLength(flag)) {
  // Currently length if present is unused.
  in.readInt();
}
int state = in.readInt(); // Read the state.  Currently unused.
if (isError) {
  //noinspection ThrowableInstanceNeverThrown
  call.setException(new RemoteException( WritableUtils.readString(in),
  WritableUtils.readString(in)));
} else {
{code}

This line {code}Call call = calls.remove(id);{code}  may return a null 'call'. 
It is so because if you have rpc timeout enable, we proactively clean up other 
calls which have expired their lifetime along with the call for which socket 
timeout exception happend.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (HBASE-4633) Potential memory leak in client RPC timeout mechanism

2011-10-19 Thread Shrijeet Paliwal (Created) (JIRA)
Potential memory leak in client RPC timeout mechanism
-

 Key: HBASE-4633
 URL: https://issues.apache.org/jira/browse/HBASE-4633
 Project: HBase
  Issue Type: Bug
  Components: client
Affects Versions: 0.90.3
 Environment: HBase version: 0.90.3 + Patches , Hadoop version: CDH3u0
Reporter: Shrijeet Paliwal


Relevant Jiras: https://issues.apache.org/jira/browse/HBASE-2937,
https://issues.apache.org/jira/browse/HBASE-4003

We have been using the 'hbase.client.operation.timeout' knob
introduced in 2937 for quite some time now. It helps us enforce SLA.
We have two HBase clusters and two HBase client clusters. One of them
is much busier than the other.

We have seen a deterministic behavior of clients running in busy
cluster. Their (client's) memory footprint increases consistently
after they have been up for roughly 24 hours.
This memory footprint almost doubles from its usual value (usual case
== RPC timeout disabled). After much investigation nothing concrete
came out and we had to put a hack
which keep heap size in control even when RPC timeout is enabled. Also
note , the same behavior is not observed in 'not so busy
cluster.

The patch is here : https://gist.github.com/1288023

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira