[jira] [Comment Edited] (FALCON-1068) When scheduling a process, Falcon throws "Bad Request;Could not commit transaction due to exception during persistence"

Sowmya Ramesh (JIRA) Thu, 05 Mar 2015 11:36:43 -0800

    [ 
https://issues.apache.org/jira/browse/FALCON-1068?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14349334#comment-14349334
 ]


Sowmya Ramesh edited comment on FALCON-1068 at 3/5/15 7:34 PM:
---------------------------------------------------------------

>From the log there is already read lock held when write is performed and 
>LockTimeoutException occurs when trying to promote the lock. Are you 
>performing any other operation before submitANdSchedule other than delete?

>From the code walk MetadataMappingService.onRemove doesn't have any 
>implementation and hence no graph operation is done on deletion of entity. 
>Also can you confirm if reties are being performed for this?

Form https://github.com/thinkaurelius/titan/wiki/Graph-Configuration: default 
storage.lock-retries is set to 3 and storage.write-attempts to 5

{code}
Caused by: com.sleepycat.je.LockTimeoutException: (JE 5.0.73) Lock expired. 
Locker 1719836551 9040_1915320434@qtp-1490009389-89 - 
723ce874-0b88-4087-a742-3ae233e05df0_Txn: waited for lock on database=edgestore 
LockAddr:864123167 LSN=0x3/0x4502f9 type=WRITE grant=WAIT_PROMOTION 
timeoutMillis=500 startTime=1425489845022 endTime=1425489845523
Owners: [<LockInfo locker="1719836551 9040_1915320434@qtp-1490009389-89 - 
723ce874-0b88-4087-a742-3ae233e05df0_Txn" type="READ"/>, <LockInfo 
locker="1687215918 9031_344550172@qtp-1490009389-77_Txn" type="READ"/>]
Waiters: []
Transaction 1719836551 9040_1915320434@qtp-1490009389-89 - 
723ce874-0b88-4087-a742-3ae233e05df0_Txn owns LockAddr:864123167 <LockInfo 
locker="1719836551 9040_1915320434@qtp-1490009389-89 - 
723ce874-0b88-4087-a742-3ae233e05df0_Txn" type="READ"/>
Transaction 1719836551 9040_1915320434@qtp-1490009389-89 - 
723ce874-0b88-4087-a742-3ae233e05df0_Txn waits for LockAddr:864123167
{code}


was (Author: sowmyaramesh):
>From the log there is already read lock held when write is performed and 
>LockTimeoutException occurs when trying to promote the lock. Are you 
>performing any other operation before submitANdSchedule other than delete?

For the code walk MetadataMappingService.onRemove doesn't have any 
implementation and hence no graph operation is done on deletion of entity. Also 
can you confirm is reties are performed for this?

Form https://github.com/thinkaurelius/titan/wiki/Graph-Configuration: default 
storage.lock-retries is set to 3 and storage.write-attempts to 5

{code}
Caused by: com.sleepycat.je.LockTimeoutException: (JE 5.0.73) Lock expired. 
Locker 1719836551 9040_1915320434@qtp-1490009389-89 - 
723ce874-0b88-4087-a742-3ae233e05df0_Txn: waited for lock on database=edgestore 
LockAddr:864123167 LSN=0x3/0x4502f9 type=WRITE grant=WAIT_PROMOTION 
timeoutMillis=500 startTime=1425489845022 endTime=1425489845523
Owners: [<LockInfo locker="1719836551 9040_1915320434@qtp-1490009389-89 - 
723ce874-0b88-4087-a742-3ae233e05df0_Txn" type="READ"/>, <LockInfo 
locker="1687215918 9031_344550172@qtp-1490009389-77_Txn" type="READ"/>]
Waiters: []
Transaction 1719836551 9040_1915320434@qtp-1490009389-89 - 
723ce874-0b88-4087-a742-3ae233e05df0_Txn owns LockAddr:864123167 <LockInfo 
locker="1719836551 9040_1915320434@qtp-1490009389-89 - 
723ce874-0b88-4087-a742-3ae233e05df0_Txn" type="READ"/>
Transaction 1719836551 9040_1915320434@qtp-1490009389-89 - 
723ce874-0b88-4087-a742-3ae233e05df0_Txn waits for LockAddr:864123167
{code}

> When scheduling a process, Falcon throws "Bad Request;Could not commit 
> transaction due to exception during persistence"
> -----------------------------------------------------------------------------------------------------------------------
>
>                 Key: FALCON-1068
>                 URL: https://issues.apache.org/jira/browse/FALCON-1068
>             Project: Falcon
>          Issue Type: Bug
>            Reporter: Adam Kawa
>         Attachments: falcon.application.log.FALCON-1068.rtf
>
>
> I have a simple script "manage-entity.sh process dss" that deletes, submit 
> and schedules a Falcon process. 
> A couple of times per week, I get the "FalconCLIException: Bad Request;Could 
> not commit transaction due to exception during persistence" when submitting 
> the process. 
> The workaround is to restart Falcon server...
> e.g.:
> {code}
> $ ./manage-entity.sh process dss my-process.xml
> falcon/default/my-process(process) removed successfully (KILLED in ENGINE)
> Stacktrace:
> org.apache.falcon.client.FalconCLIException: Bad Request;Could not commit 
> transaction due to exception during persistence
>       at 
> org.apache.falcon.client.FalconCLIException.fromReponse(FalconCLIException.java:44)
>       at 
> org.apache.falcon.client.FalconClient.checkIfSuccessful(FalconClient.java:1162)
>       at 
> org.apache.falcon.client.FalconClient.sendEntityRequestWithObject(FalconClient.java:684)
>       at 
> org.apache.falcon.client.FalconClient.submitAndSchedule(FalconClient.java:347)
>       at org.apache.falcon.cli.FalconCLI.entityCommand(FalconCLI.java:371)
>       at org.apache.falcon.cli.FalconCLI.run(FalconCLI.java:182)
>       at org.apache.falcon.cli.FalconCLI.main(FalconCLI.java:132)
> $ ./falcon-restart.sh
> Hadoop is installed, adding hadoop classpath to falcon classpath
> Hadoop is installed, adding hadoop classpath to falcon classpath
> falcon started using hadoop version:  Hadoop 2.5.0
> $ ./manage-entity.sh process dss my-process.xml
> falcon/default/my-process(process) removed successfully (KILLED in ENGINE)
> schedule/default/my-process(process) scheduled successfully
> submit/falcon/default/Submit successful (process) my-process
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Comment Edited] (FALCON-1068) When scheduling a process, Falcon throws "Bad Request;Could not commit transaction due to exception during persistence"

Reply via email to