Shwetha G S created OOZIE-1456:
----------------------------------

             Summary: MemoryLocks.getLock() doesn't work
                 Key: OOZIE-1456
                 URL: https://issues.apache.org/jira/browse/OOZIE-1456
             Project: Oozie
          Issue Type: Bug
            Reporter: Shwetha G S
            Priority: Critical


    private LockToken getLock(String resource, Type type, long wait) throws 
InterruptedException {
        ReentrantReadWriteLock lockEntry;
        synchronized (locks) {
            if (locks.containsKey(resource)) {
                lockEntry = locks.get(resource);
            }
            else {
                lockEntry = new ReentrantReadWriteLock(true);
                locks.put(resource, lockEntry);
            }
        }

        Lock lock = (type.equals(Type.READ)) ? lockEntry.readLock() : 
lockEntry.writeLock();

        if (wait == -1) {
            lock.lock();
        }
        else {
            if (wait > 0) {
                if (!lock.tryLock(wait, TimeUnit.MILLISECONDS)) {
                    return null;
                }
            }
            else {
                if (!lock.tryLock()) {
                    return null;
                }
            }
        }
        synchronized (locks) {
            if (!locks.containsKey(resource)) {
                locks.put(resource, lockEntry);
            }
        }
        return new LockToken(lockEntry, lock, resource);
    }

If there are 2 concurrent threads trying to acquire lock on same key and lock 
is free, both will get lockEntry = new ReentrantReadWriteLock(true); and both 
will get the same lock!

Here is the log from oozie:
2013-07-12 14:48:32,503 DEBUG CoordActionInputCheckXCommand:545 - USER[-] 
GROUP[-] TOKEN[-] APP[-] JOB[0012816-130326154523555-oozie-oozi-C] 
ACTION[0012816-130326154523555-oozie-oozi-C@154290] Acquired lock for 
[0012816-130326154523555-oozie-oozi-C] in [coord_action_input]
2013-07-12 14:48:32,503 DEBUG CoordMaterializeTransitionXCommand:545 - 
USER[fetl] GROUP[-] TOKEN[] APP[<coord name>] 
JOB[0012816-130326154523555-oozie-oozi-C] ACTION[-] Acquired lock for 
[0055279-130607113249683-oozie-oozi-C] in [coord_mater]

After this the materialization was stuck and we had to bounce oozie. Still need 
to figure out why this paused materialization

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to