Hi,

my previous two mails somehow didn't make it to the list, so here is another attempt:

I am trying to use gridway(5.6.1) to schedule simple test jobs (/bin/ls)
across two machines, both with gt4.2.1 installed and running. One of the
machines is running Debian, the other is a VM running Ubuntu.
Communication and Authentification apparently works fine, the machines
see and trust each other, and the jobs gets scheduled. However, in *both
directions*, only those jobs running on the local machine (from where
they are started using gwsubmit) actually get "done" - the others remain
in "wrap pend" state. Apparently they are executed correctly on the
remote machine, since the result output is there, but somehow the
notification to the originating machine fails. Searching the list,
enabling debugging and digging in the logs I found this warning/exception at the
end of the globus container.log on the remote machine:

<...snip...>
2011-07-01T15:40:41.231+02:00 INFO  impl.DefaultIndexService
[ServiceThread-58,performDefaultRegistrations:261]
guid=b646c8e0-a3e7-11e0-b059-b76342becd29
event=org.globus.mds.index.performDefaultRegistrations.end status=0
2011-07-01T15:41:21.751+02:00 INFO
PersistentManagedExecutableJobResource.ce605ef0-a3e7-11e0-b059-b76342becd29
[ServiceThread-57,start:761] Job ce605ef0-a3e7-11e0-b059-b76342becd29
with client submission-id null accepted for local user 'the'
2011-07-01T15:41:22.032+02:00 INFO  handler.SubmitStateHandler
[pool-1-thread-7,process:172] Job ce605ef0-a3e7-11e0-b059-b76342becd29
submitted with local job ID 'ce9b08e8-a3e7-11e0-bcc8-b7ebd4913b23:17697'
2011-07-01T15:41:23.327+02:00 DEBUG impl.SimpleSubscriptionTopicListener
[pool-1-thread-1,setPort:287] Security properties not null: not secure conv
2011-07-01T15:41:23.327+02:00 DEBUG impl.SimpleSubscriptionTopicListener
[pool-1-thread-1,setPort:314] set port with false
2011-07-01T15:41:23.366+02:00 DEBUG impl.SimpleSubscriptionTopicListener
[pool-1-thread-1,setPort:290] Setting security properties
2011-07-01T15:41:23.482+02:00 DEBUG impl.SimpleSubscriptionTopicListener
[pool-1-thread-3,setPort:287] Security properties not null: not secure conv
2011-07-01T15:41:23.483+02:00 DEBUG impl.SimpleSubscriptionTopicListener
[pool-1-thread-3,setPort:314] set port with false
2011-07-01T15:41:23.483+02:00 DEBUG impl.SimpleSubscriptionTopicListener
[pool-1-thread-3,setPort:290] Setting security properties
2011-07-01T15:41:23.505+02:00 WARN  impl.SimpleSubscriptionTopicListener
[pool-1-thread-3,topicChanged:129] [JWSCORE-169] Failed to send
notification for subscription with key
'B26B14DD21498C52B1E38CC2F042B0AF0E65BAE6+ce647da0-a3e7-11e0-b059-b76342becd29':

java.net.ConnectException: Connection refused
2011-07-01T15:41:23.506+02:00 DEBUG impl.SimpleSubscriptionTopicListener
[pool-1-thread-3,topicChanged:132]
javax.xml.rpc.JAXRPCException: java.net.ConnectException: Connection refused
        at org.apache.axis.client.Call.invokeOneWay(Call.java:1871)
        at
org.oasis.wsn.NotificationConsumerSOAPBindingStub.notify(NotificationConsumerSOAPBindingStub.java:701)
        at
org.globus.wsrf.impl.SimpleSubscriptionTopicListener.notify(SimpleSubscriptionTopicListener.java:256)
        at
org.globus.wsrf.impl.SimpleSubscriptionTopicListener.topicChanged(SimpleSubscriptionTopicListener.java:123)
        at org.globus.wsrf.impl.SimpleTopic.topicChanged(SimpleTopic.java:205)
        at org.globus.wsrf.impl.SimpleTopic.notify(SimpleTopic.java:112)
        at
org.globus.exec.service.exec.ManagedExecutableJobResource.setState(ManagedExecutableJobResource.java:909)
        at
org.globus.exec.service.exec.processing.handler.CleanUpStateHandler.process(CleanUpStateHandler.java:56)
        at
org.globus.exec.service.exec.processing.handler.InternalStateHandler.processInternalState(InternalStateHandler.java:49)
        at
org.globus.exec.service.exec.processing.StateMachine.processInternalState(StateMachine.java:121)
        at
org.globus.exec.service.exec.processing.StateProcessingTask.run(StateProcessingTask.java:82)
        at
java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
        at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
        at java.lang.Thread.run(Thread.java:662)
<...snip...>

The VM has a static IP. Have you got any clue for me on what could be
the problem? Anything else I should provide for analysis?

Thanks,
Timo

Reply via email to