Hi David,
The application is failing on Data torrent UI.

Application Overview Application Overview
smallWindow1  restart
id: 0009 user: hadoop uptime: 00:00:06
State

FAILED (FAILED)
state

-
current wID

-
recovery wID
Performance

-
latency (ms)

-
processed/s

-
emitted/s

-
total processed

-
total emitted
Physical

-
operators

- / -
planned/alloc. ctnrs

-
allocated mem.


Below is the Exception I am getting in .dt/logs/dt.log but  do not think, it is 
causing failures as I have seen this even when there is no failure.



java.io.FileNotFoundException: File does not exist: 
/user/eid/datatorrent/apps/application_1467820033799_0009/events/index.txt

        at 
org.apache.hadoop.hdfs.server.namenode.INodeFile.valueOf(INodeFile.java:71)

        at 
org.apache.hadoop.hdfs.server.namenode.INodeFile.valueOf(INodeFile.java:61)

        at 
org.apache.hadoop.hdfs.server.namenode.FSNamesystem.getBlockLocationsInt(FSNamesystem.java:1828)

        at 
org.apache.hadoop.hdfs.server.namenode.FSNamesystem.getBlockLocations(FSNamesystem.java:1799)

        at 
org.apache.hadoop.hdfs.server.namenode.FSNamesystem.getBlockLocations(FSNamesystem.java:1712)

        at 
org.apache.hadoop.hdfs.server.namenode.NameNodeRpcServer.getBlockLocations(NameNodeRpcServer.java:587)

        at 
org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocolServerSideTranslatorPB.getBlockLocations(ClientNamenodeProtocolServerSideTranslatorPB.java:365)

        at 
org.apache.hadoop.hdfs.protocol.proto.ClientNamenodeProtocolProtos$ClientNamenodeProtocol$2.callBlockingMethod(ClientNamenodeProtocolProtos.java)

        at 
org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:632)

        at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:969)

        at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2049)

        at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2045)

        at java.security.AccessController.doPrivileged(Native Method)

        at javax.security.auth.Subject.doAs(Subject.java:422)

        at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1657)

        at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2043)


        at sun.reflect.GeneratedConstructorAccessor94.newInstance(Unknown 
Source)

        at 
sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)

        at java.lang.reflect.Constructor.newInstance(Constructor.java:423)

        at 
org.apache.hadoop.ipc.RemoteException.instantiateException(RemoteException.java:106)

        at 
org.apache.hadoop.ipc.RemoteException.unwrapRemoteException(RemoteException.java:73)

        at 
org.apache.hadoop.hdfs.DFSClient.callGetBlockLocations(DFSClient.java:1231)

        at 
org.apache.hadoop.hdfs.DFSClient.getLocatedBlocks(DFSClient.java:1216)

        at 
org.apache.hadoop.hdfs.DFSClient.getLocatedBlocks(DFSClient.java:1204)

        at 
org.apache.hadoop.hdfs.DFSInputStream.fetchLocatedBlocksAndGetLastBlockLength(DFSInputStream.java:314)

        at 
org.apache.hadoop.hdfs.DFSInputStream.openInfo(DFSInputStream.java:280)

        at org.apache.hadoop.hdfs.DFSInputStream.<init>(DFSInputStream.java:272)

        at org.apache.hadoop.hdfs.DFSClient.open(DFSClient.java:1529)

        at 
org.apache.hadoop.hdfs.DistributedFileSystem$3.doCall(DistributedFileSystem.java:303)

        at 
org.apache.hadoop.hdfs.DistributedFileSystem$3.doCall(DistributedFileSystem.java:299)

        at 
org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81)

        at 
org.apache.hadoop.hdfs.DistributedFileSystem.open(DistributedFileSystem.java:299)

        at org.apache.hadoop.fs.FileSystem.open(FileSystem.java:771)

        at 
com.datatorrent.stram.client.EventsAgent.getLatestEvents(EventsAgent.java:118)

                                                                                
                                                                       4093,2-9 
     98%


On the Hadoop Yarn, I do see container failure exception but it does not allow 
me the capture the details:


Application application_1467820033799_0009 failed 2 times due to AM Container 
for appattempt_1467820033799_0009_000002 exited with exitCode: 1
For more detailed output, check application tracking 
page:http://ip-10-205-84-36.dqa.capitalone.com:8088/cluster/app/application_1467820033799_0009Then,
 click on links to logs of each attempt.
Diagnostics: Exception from container-launch.
Container id: container_1467820033799_0009_02_000001
Exit code: 1
Stack trace: ExitCodeException exitCode=1:
at org.apache.hadoop.util.Shell.runCommand(Shell.java:545)
at org.apache.hadoop.util.Shell.run(Shell.java:456)
at org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:722)
at 
org.apache.hadoop.yarn.server.nodemanager.DefaultContainerExecutor.launchContainer(DefaultContainerExecutor.java:212)
at 
org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:302)
at 
org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:82)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
at java.lang.Thread.run(Thread.java:745)
Container exited with a non-zero exit code 1
Failing this attempt. Failing the application.

If I go to node level logs,  it says —> Container is not started or Already 
completed or does to belong to this.


Sushil Chaudhary


From: David Yan <[email protected]<mailto:[email protected]>>
Reply-To: "[email protected]<mailto:[email protected]>" 
<[email protected]<mailto:[email protected]>>
Date: Thursday, July 7, 2016 at 3:27 PM
To: "[email protected]<mailto:[email protected]>" 
<[email protected]<mailto:[email protected]>>
Subject: Re: Support for Apex malhar 3.4.0 running on Data torrent

Hi Sushil,

Can you tell us more details on how it fails after upgrading to 3.4.0?
Any error messages? App master logs? Container logs?

David

On Thu, Jul 7, 2016 at 12:23 PM, Chaudhary, Sushil (CONT) 
<[email protected]<mailto:[email protected]>> wrote:
We already tried Apex Malhar  <malhar.version>3.4.0</malhar.version> with 
Datatorrent RTS 3.4.0.  As soon as we upgrade the Apex Malhar library to 3.4.0, 
out Data torrent application started failing.

However application is working good if we use malhar version 3.3.0-incubating.


Also, once I move to malhar.version>3.4.0</malhar.version>, I need to define 
below library dependencies separately. Is there any third party interdependency 
which could be causing Data torrent job getting failed.



 <dependency>

                        <groupId>org.apache.apex</groupId>

                        <artifactId>apex-shaded-ning19</artifactId>

                        <version>1.0.0</version>

                 </dependency>

                 <dependency>

                        <groupId>net.sf.supercsv</groupId>

                        <artifactId>super-csv</artifactId>

                        <version>2.3.1</version>

                 </dependency>


Sushil Chaudhary
Cell #: 408-466-2004<tel:408-466-2004>
Email: [email protected]<mailto:[email protected]>

From: David Yan <[email protected]<mailto:[email protected]>>
Reply-To: "[email protected]<mailto:[email protected]>" 
<[email protected]<mailto:[email protected]>>
Date: Wednesday, July 6, 2016 at 7:54 PM
To: "[email protected]<mailto:[email protected]>" 
<[email protected]<mailto:[email protected]>>
Subject: Re: Support for Apex malhar 3.4.0 running on Data torrent

Hi Sushil,

We do have one backward incompatibility issue between Apex 3.3.x and Apex 3.4.x 
because of a security problem with a 3rd party library we depend on. Details 
are in the announcement email here:

http://mail-archives.apache.org/mod_mbox/www-announce/201605.mbox/%3cca%2b5xao1zyso6azubjokpvtjqm%[email protected]%3E

Can you try Apex 3.4.0?

David

On Fri, Jul 1, 2016 at 6:30 AM, Chaudhary, Sushil (CONT) 
<[email protected]<mailto:[email protected]>> wrote:


Sushil Chaudhary
Cell #: 408-466-2004<tel:408-466-2004>
Email: [email protected]<mailto:[email protected]>

From: "Chaudhary , Sushil (CONT)" 
<[email protected]<mailto:[email protected]>>
Date: Friday, July 1, 2016 at 9:25 AM
To: "[email protected]<mailto:[email protected]>" 
<[email protected]<mailto:[email protected]>>
Subject: Support for Apex malhar 3.4.0 running on Data torrent

Hi,
I am trying to use Apache apex malhar version 3.4.0 with data torrent 
application and using IMDB Geode operator.  Currently using 3.3.1-incubating 
for the apache apex with Data torrent and it is working good. But as soon as I 
upgrade it to 3.4.0 Data torrent container start throwing exception :

Container already closed or not available, or not running…

Please let me know if you have any clue on the same.

Sushil Chaudhary
Email: [email protected]<mailto:[email protected]>

________________________________

The information contained in this e-mail is confidential and/or proprietary to 
Capital One and/or its affiliates and may only be used solely in performance of 
work or services for Capital One. The information transmitted herewith is 
intended only for use by the individual or entity to which it is addressed. If 
the reader of this message is not the intended recipient, you are hereby 
notified that any review, retransmission, dissemination, distribution, copying 
or other use of, or taking of any action in reliance upon this information is 
strictly prohibited. If you have received this communication in error, please 
contact the sender and delete the material from your computer.


________________________________

The information contained in this e-mail is confidential and/or proprietary to 
Capital One and/or its affiliates and may only be used solely in performance of 
work or services for Capital One. The information transmitted herewith is 
intended only for use by the individual or entity to which it is addressed. If 
the reader of this message is not the intended recipient, you are hereby 
notified that any review, retransmission, dissemination, distribution, copying 
or other use of, or taking of any action in reliance upon this information is 
strictly prohibited. If you have received this communication in error, please 
contact the sender and delete the material from your computer.

________________________________________________________

The information contained in this e-mail is confidential and/or proprietary to 
Capital One and/or its affiliates and may only be used solely in performance of 
work or services for Capital One. The information transmitted herewith is 
intended only for use by the individual or entity to which it is addressed. If 
the reader of this message is not the intended recipient, you are hereby 
notified that any review, retransmission, dissemination, distribution, copying 
or other use of, or taking of any action in reliance upon this information is 
strictly prohibited. If you have received this communication in error, please 
contact the sender and delete the material from your computer.

Reply via email to