Hi Vivek, I tried out the BoundedDedupOperator and processed ~ 50k records, seems to be running okay. Here is the example I tried out: https://github.com/apache /apex-malhar/tree/master/examples/dedup. Make sure to replace TimeBasedDedupOperator with BoundedDedupOperator. Also note that this uses the latest code 3.8.0-SNAPSHOT (master).
Can you check if this helps? ~ Bhupesh _______________________________________________________ Bhupesh Chawda E: bhup...@datatorrent.com | Twitter: @bhupeshsc www.datatorrent.com | apex.apache.org On Fri, Jun 9, 2017 at 11:01 PM, Vivek Bhide <bhide.vi...@gmail.com> wrote: > Hi Bhupesh, > > I even tried using the TimeBoundedDedupe instead of BoundedDedup and even > that one fails with exception. In this case, the container starts properly > but as soon as it tries to process the tuples it fails. > > Below are configurations > ================ > > <property> > > <name>dt.application.DataUsageIngest.operator.dedupeOperator.port.input. > attr.TUPLE_CLASS</name> > > <value>com.tgt.dqs.datausageingest.object.DataSetAttributeWithChecksum</ > value> > </property> > <property> > > <name>dt.application.DataUsageIngest.operator.dedupeOperator.prop. > keyExpression</name> > <value>checksum</value> > </property> > <property> > > <name>dt.application.DataUsageIngest.operator.dedupeOperator.prop. > timeExpression</name> > <value>date.getTime()</value> > </property> > <property> > > <name>dt.application.DataUsageIngest.operator.dedupeOperator.prop. > bucketSpan</name> > <value>1800</value> > </property> > <property> > > <name>dt.application.DataUsageIngest.operator.dedupeOperator.prop. > expireBefore</name> > <value>180000</value> > </property> > > Below are the container logs > =================== > > 2017-06-08 18:59:53,569 INFO util.LoggerUtil > (LoggerUtil.java:changeLoggersLevel(274)) - changing level of > com.datatorrent.stram.util.LoggerUtil to INFO > 2017-06-08 18:59:53,590 INFO engine.StreamingContainer > (StreamingContainer.java:main(291)) - Child starting with classpath: > ./kafka-clients-0.9.0.0.jar:./jetty-io-8.1.10.v20130312.jar: > ./jetty-http-8.1.10.v20130312.jar:./jetty-server-8.1.10. > v20130312.jar:./activemq-client-5.8.0.jar:./bval-core- > 0.5.jar:./joda-time-2.9.1.jar:./jetty-servlet-8.1.10. > v20130312.jar:./mbassador-1.1.9.jar:./jersey-core-1.9.jar:./ > httpcore-4.3.2.jar:./avro-1.7.4.jar:./validation-api-1.1.0. > Final.jar:./malhar-contrib-3.7.0.jar:./named-regexp-0.2.3. > jar:./jetty-security-8.1.10.v20130312.jar:./jackson-core- > asl-1.9.13.jar:./commons-lang3-3.1.jar:./httpcore-4.3. > 3.jar:./jcip-annotations-1.0.jar:./malhar-hive-3.7.0.jar:./ > rhino-1.7R4.jar:./httpclient-4.3.5.jar:./commons-compress- > 1.4.1.jar:./jersey-client-1.9.jar:./xbean-asm5-shaded-4.3. > jar:./jetty-websocket-8.1.10.v20130312.jar:./apex-common-3. > 7.0-SNAPSHOT.jar:./jackson-annotations-2.7.0.jar:./ > mailapi-1.4.3.jar:./kafka_2.12-0.10.2.0.jar:./bval-jsr303- > 0.5.jar:./jctools-core-1.1.jar:./netlet-1.3.0.jar:./ > commons-collections-3.2.1.jar:./libthrift-0.9.3.jar:./ > malhar-kafka-3.7.0.jar:./aws-java-sdk-s3-1.10.73.jar:./ > janino-3.0.7.jar:./malhar-library-3.7.0.jar:./minlog-1. > 2.jar:./hawtbuf-1.9.jar:./datausageingest-1.0-SNAPSHOT. > jar:./jetty-continuation-8.1.10.v20130312.jar:./jetty-util- > 8.1.10.v20130312.jar:./jsr305-1.3.9.jar:./lz4-1.2.0.jar:./ > httpclient-4.3.6.jar:./apex-api-3.7.0-SNAPSHOT.jar:./ > log4j-1.2.17.jar:./activation-1.1.jar:./json-schema-core-1. > 0.2.jar:./scala-library-2.12.1.jar:./jopt-simple-5.0.3.jar: > ./kryo-2.24.0.jar:./snappy-java-1.0.4.1.jar:./fastutil-7. > 0.6.jar:./apex-engine.jar:./guava-11.0.2.jar:./adaptor- > commons-0.0.2-SNAPSHOT.jar:./geronimo-j2ee-management_1.1_ > spec-1.0.1.jar:./jersey-apache-client4-1.9.jar:./ > zookeeper-3.4.9.jar:./jooq-3.6.4.jar:./slf4j-api-1.7.5.jar: > ./hive-jdbc-2.0.0.jar:./metrics-core-2.2.0.jar:./ > commons-beanutils-1.9.2.jar:./slf4j-log4j12-1.7.21.jar:./ > apex-shaded-ning19-1.0.0.jar:./libphonenumber-5.3.jar:./aws- > java-sdk-kms-1.10.73.jar:./aws-java-sdk-core-1.10.73.jar: > ./hive-service-2.0.0.jar:./commons-logging-1.1.1.jar:./ > zkclient-0.10.jar:./jackson-core-2.7.0.jar:./scala-parser- > combinators_2.12-1.0.4.jar:./jackson-databind-2.5.4.jar:./ > jms-api-1.1-rev-1.jar:./paranamer-2.3.jar:./apex- > bufferserver-3.7.0-SNAPSHOT.jar:./hive-exec-0.13.1.jar:./ > json-schema-validator-2.0.1.jar:./commons-compiler-3.0.7. > jar:./javax.mail-1.5.0.jar:./geronimo-jms_1.1_spec-1.1.1. > jar:./jackson-mapper-asl-1.9.13.jar:./jackson-dataformat- > cbor-2.5.3.jar:./xz-1.0.jar:/usr/hdp/current/hadoop-client/ > conf:/usr/hdp/current/hadoop-client/hadoop-azure.jar:/usr/ > hdp/current/hadoop-client/hadoop-annotations.jar:/usr/ > hdp/current/hadoop-client/hadoop-nfs-2.7.3.2.5.3.0-37. > jar:/usr/hdp/current/hadoop-client/hadoop-nfs.jar:/usr/ > hdp/current/hadoop-client/hadoop-annotations-2.7.3.2.5. > 3.0-37.jar:/usr/hdp/current/hadoop-client/hadoop-azure-2. > 7.3.2.5.3.0-37.jar:/usr/hdp/current/hadoop-client/hadoop- > auth.jar:/usr/hdp/current/hadoop-client/hadoop-auth-2.7. > 3.2.5.3.0-37.jar:/usr/hdp/current/hadoop-client/hadoop- > common-2.7.3.2.5.3.0-37.jar:/usr/hdp/current/hadoop-client/ > hadoop-aws-2.7.3.2.5.3.0-37.jar:/usr/hdp/current/hadoop- > client/hadoop-common-tests.jar:/usr/hdp/current/hadoop- > client/hadoop-aws.jar:/usr/hdp/current/hadoop-client/ > hadoop-common-2.7.3.2.5.3.0-37-tests.jar:/usr/hdp/current/ > hadoop-client/hadoop-common.jar:/usr/hdp/current/hadoop- > client/lib/aws-java-sdk-s3-1.10.6.jar:/usr/hdp/current/ > hadoop-client/lib/ojdbc6.jar:/usr/hdp/current/hadoop-client/ > lib/asm-3.2.jar:/usr/hdp/current/hadoop-client/lib/ > ranger-yarn-plugin-shim-0.6.0.2.5.3.0-37.jar:/usr/hdp/ > current/hadoop-client/lib/snappy-java-1.0.4.1.jar:/usr/ > hdp/current/hadoop-client/lib/nimbus-jose-jwt-3.9.jar:/usr/ > hdp/current/hadoop-client/lib/curator-framework-2.7.1.jar:/ > usr/hdp/current/hadoop-client/lib/jaxb-impl-2.2.3-1.jar:/ > usr/hdp/current/hadoop-client/lib/commons-codec-1.4.jar:/ > usr/hdp/current/hadoop-client/lib/jsp-api-2.1.jar:/usr/hdp/ > current/hadoop-client/lib/slf4j-log4j12-1.7.10.jar:/usr/ > hdp/current/hadoop-client/lib/stax-api-1.0-2.jar:/usr/hdp/ > current/hadoop-client/lib/jackson-jaxrs-1.9.13.jar:/usr/ > hdp/current/hadoop-client/lib/junit-4.11.jar:/usr/hdp/ > current/hadoop-client/lib/paranamer-2.3.jar:/usr/hdp/ > current/hadoop-client/lib/aws-java-sdk-core-1.10.6.jar:/usr/ > hdp/current/hadoop-client/lib/java-xmlbuilder-0.4.jar:/usr/ > hdp/current/hadoop-client/lib/commons-net-3.1.jar:/usr/hdp/ > current/hadoop-client/lib/jsch-0.1.42.jar:/usr/hdp/ > current/hadoop-client/lib/jackson-databind-2.2.3.jar:/ > usr/hdp/current/hadoop-client/lib/commons-lang-2.6.jar:/usr/ > hdp/current/hadoop-client/lib/apacheds-i18n-2.0.0-M15.jar:/ > usr/hdp/current/hadoop-client/lib/ranger-hdfs-plugin-shim-0. > 6.0.2.5.3.0-37.jar:/usr/hdp/current/hadoop-client/lib/ > commons-logging-1.1.3.jar:/usr/hdp/current/hadoop-client/ > lib/jackson-core-2.2.3.jar:/usr/hdp/current/hadoop-client/ > lib/jackson-mapper-asl-1.9.13.jar:/usr/hdp/current/hadoop- > client/lib/jackson-core-asl-1.9.13.jar:/usr/hdp/current/ > hadoop-client/lib/commons-configuration-1.6.jar:/usr/ > hdp/current/hadoop-client/lib/jackson-annotations-2.2.3.jar: > /usr/hdp/current/hadoop-client/lib/xz-1.0.jar:/usr/ > hdp/current/hadoop-client/lib/guava-11.0.2.jar:/usr/hdp/ > current/hadoop-client/lib/commons-beanutils-core-1.8.0. > jar:/usr/hdp/current/hadoop-client/lib/gson-2.2.4.jar:/ > usr/hdp/current/hadoop-client/lib/htrace-core-3.1.0- > incubating.jar:/usr/hdp/current/hadoop-client/lib/ > commons-beanutils-1.7.0.jar:/usr/hdp/current/hadoop-client/ > lib/jettison-1.1.jar:/usr/hdp/current/hadoop-client/lib/ > ranger-plugin-classloader-0.6.0.2.5.3.0-37.jar:/usr/hdp/ > current/hadoop-client/lib/jaxb-api-2.2.2.jar:/usr/hdp/ > current/hadoop-client/lib/aws-java-sdk-kms-1.10.6.jar:/usr/ > hdp/current/hadoop-client/lib/httpcore-4.4.4.jar:/usr/hdp/ > current/hadoop-client/lib/netty-3.6.2.Final.jar:/usr/ > hdp/current/hadoop-client/lib/jersey-json-1.9.jar:/usr/hdp/ > current/hadoop-client/lib/jcip-annotations-1.0.jar:/usr/ > hdp/current/hadoop-client/lib/httpclient-4.5.2.jar:/usr/hdp/ > current/hadoop-client/lib/hamcrest-core-1.3.jar:/usr/ > hdp/current/hadoop-client/lib/curator-recipes-2.7.1.jar:/ > usr/hdp/current/hadoop-client/lib/commons-io-2.4.jar:/usr/ > hdp/current/hadoop-client/lib/commons-compress-1.4.1.jar:/ > usr/hdp/current/hadoop-client/lib/apacheds-kerberos-codec-2. > 0.0-M15.jar:/usr/hdp/current/hadoop-client/lib/protobuf- > java-2.5.0.jar:/usr/hdp/current/hadoop-client/lib/ > jetty-6.1.26.hwx.jar:/usr/hdp/current/hadoop-client/lib/ > jersey-core-1.9.jar:/usr/hdp/current/hadoop-client/lib/api- > asn1-api-1.0.0-M20.jar:/usr/hdp/current/hadoop-client/lib/ > jsr305-3.0.0.jar:/usr/hdp/current/hadoop-client/lib/ > xmlenc-0.52.jar:/usr/hdp/current/hadoop-client/lib/ > curator-client-2.7.1.jar:/usr/hdp/current/hadoop-client/lib/ > commons-math3-3.1.1.jar:/usr/hdp/current/hadoop-client/lib/ > jets3t-0.9.0.jar:/usr/hdp/current/hadoop-client/lib/ > jackson-xc-1.9.13.jar:/usr/hdp/current/hadoop-client/lib/ > activation-1.1.jar:/usr/hdp/current/hadoop-client/lib/api- > util-1.0.0-M20.jar:/usr/hdp/current/hadoop-client/lib/ > azure-keyvault-core-0.8.0.jar:/usr/hdp/current/hadoop- > client/lib/slf4j-api-1.7.10.jar:/usr/hdp/current/hadoop- > client/lib/avro-1.7.4.jar:/usr/hdp/current/hadoop-client/ > lib/commons-lang3-3.4.jar:/usr/hdp/current/hadoop-client/ > lib/commons-cli-1.2.jar:/usr/hdp/current/hadoop-client/lib/ > joda-time-2.8.1.jar:/usr/hdp/current/hadoop-client/lib/ > jetty-util-6.1.26.hwx.jar:/usr/hdp/current/hadoop-client/ > lib/servlet-api-2.5.jar:/usr/hdp/current/hadoop-client/lib/ > log4j-1.2.17.jar:/usr/hdp/current/hadoop-client/lib/ > commons-digester-1.8.jar:/usr/hdp/current/hadoop-client/lib/ > jersey-server-1.9.jar:/usr/hdp/current/hadoop-client/lib/ > zookeeper-3.4.6.2.5.3.0-37.jar:/usr/hdp/current/hadoop- > client/lib/azure-storage-4.2.0.jar:/usr/hdp/current/hadoop- > client/lib/commons-collections-3.2.2.jar:/usr/ > hdp/current/hadoop-client/lib/mockito-all-1.8.5.jar:/usr/ > hdp/current/hadoop-client/lib/json-smart-1.1.1.jar:/usr/hdp/ > current/hadoop-hdfs-client/hadoop-hdfs.jar:/usr/hdp/ > current/hadoop-hdfs-client/hadoop-hdfs-2.7.3.2.5.3.0-37- > tests.jar:/usr/hdp/current/hadoop-hdfs-client/hadoop- > hdfs-tests.jar:/usr/hdp/current/hadoop-hdfs-client/ > hadoop-hdfs-2.7.3.2.5.3.0-37.jar:/usr/hdp/current/hadoop- > hdfs-client/hadoop-hdfs-nfs-2.7.3.2.5.3.0-37.jar:/usr/hdp/ > current/hadoop-hdfs-client/hadoop-hdfs-nfs.jar:/usr/hdp/ > current/hadoop-hdfs-client/lib/asm-3.2.jar:/usr/hdp/ > current/hadoop-hdfs-client/lib/commons-codec-1.4.jar:/ > usr/hdp/current/hadoop-hdfs-client/lib/commons-lang-2.6. > jar:/usr/hdp/current/hadoop-hdfs-client/lib/xercesImpl-2. > 9.1.jar:/usr/hdp/current/hadoop-hdfs-client/lib/ > commons-logging-1.1.3.jar:/usr/hdp/current/hadoop-hdfs- > client/lib/jackson-mapper-asl-1.9.13.jar:/usr/hdp/current/ > hadoop-hdfs-client/lib/jackson-core-asl-1.9.13.jar:/ > usr/hdp/current/hadoop-hdfs-client/lib/okio-1.4.0.jar:/ > usr/hdp/current/hadoop-hdfs-client/lib/netty-all-4.0.23. > Final.jar:/usr/hdp/current/hadoop-hdfs-client/lib/guava- > 11.0.2.jar:/usr/hdp/current/hadoop-hdfs-client/lib/htrace- > core-3.1.0-incubating.jar:/usr/hdp/current/hadoop-hdfs- > client/lib/netty-3.6.2.Final.jar:/usr/hdp/current/hadoop- > hdfs-client/lib/commons-io-2.4.jar:/usr/hdp/current/hadoop- > hdfs-client/lib/protobuf-java-2.5.0.jar:/usr/hdp/current/ > hadoop-hdfs-client/lib/leveldbjni-all-1.8.jar:/usr/ > hdp/current/hadoop-hdfs-client/lib/jetty-6.1.26.hwx. > jar:/usr/hdp/current/hadoop-hdfs-client/lib/jersey-core-1. > 9.jar:/usr/hdp/current/hadoop-hdfs-client/lib/jsr305-3.0.0. > jar:/usr/hdp/current/hadoop-hdfs-client/lib/xmlenc-0.52. > jar:/usr/hdp/current/hadoop-hdfs-client/lib/okhttp-2.4.0. > jar:/usr/hdp/current/hadoop-hdfs-client/lib/commons-cli-1. > 2.jar:/usr/hdp/current/hadoop-hdfs-client/lib/xml-apis-1.3. > 04.jar:/usr/hdp/current/hadoop-hdfs-client/lib/jetty- > util-6.1.26.hwx.jar:/usr/hdp/current/hadoop-hdfs-client/ > lib/servlet-api-2.5.jar:/usr/hdp/current/hadoop-hdfs- > client/lib/log4j-1.2.17.jar:/usr/hdp/current/hadoop-hdfs- > client/lib/jersey-server-1.9.jar:/usr/hdp/current/hadoop- > hdfs-client/lib/commons-daemon-1.0.13.jar:/usr/hdp/ > current/hadoop-yarn-client/hadoop-yarn-server- > nodemanager-2.7.3.2.5.3.0-37.jar:/usr/hdp/current/hadoop- > yarn-client/hadoop-yarn-server-tests.jar:/usr/hdp/ > current/hadoop-yarn-client/hadoop-yarn-common-2.7.3.2.5. > 3.0-37.jar:/usr/hdp/current/hadoop-yarn-client/hadoop-yarn-server- > applicationhistoryservice.jar:/usr/hdp/current/hadoop-yarn- > client/hadoop-yarn-applications-distributedshell. > jar:/usr/hdp/current/hadoop-yarn-client/hadoop-yarn- > server-sharedcachemanager-2.7.3.2.5.3.0-37.jar:/usr/hdp/ > current/hadoop-yarn-client/hadoop-yarn-client-2.7.3.2.5. > 3.0-37.jar:/usr/hdp/current/hadoop-yarn-client/hadoop- > yarn-server-resourcemanager-2.7.3.2.5.3.0-37.jar:/usr/hdp/ > current/hadoop-yarn-client/hadoop-yarn-registry-2.7.3.2. > 5.3.0-37.jar:/usr/hdp/current/hadoop-yarn-client/hadoop- > yarn-server-common-2.7.3.2.5.3.0-37.jar:/usr/hdp/current/ > hadoop-yarn-client/hadoop-yarn-server-web-proxy.jar:/ > usr/hdp/current/hadoop-yarn-client/hadoop-yarn-server- > timeline-pluginstorage-2.7.3.2.5.3.0-37.jar:/usr/hdp/ > current/hadoop-yarn-client/hadoop-yarn-api-2.7.3.2.5.3.0- > 37.jar:/usr/hdp/current/hadoop-yarn-client/hadoop- > yarn-common.jar:/usr/hdp/current/hadoop-yarn-client/ > hadoop-yarn-server-timeline-pluginstorage.jar:/usr/hdp/ > current/hadoop-yarn-client/hadoop-yarn-server-common.jar: > /usr/hdp/current/hadoop-yarn-client/hadoop-yarn-server- > nodemanager.jar:/usr/hdp/current/hadoop-yarn-client/ > hadoop-yarn-api.jar:/usr/hdp/current/hadoop-yarn-client/ > hadoop-yarn-server-sharedcachemanager.jar:/usr/hdp/current/hadoop-yarn- > client/hadoop-yarn-registry.jar:/usr/hdp/current/hadoop- > yarn-client/hadoop-yarn-server-tests-2.7.3.2.5.3.0-37. > jar:/usr/hdp/current/hadoop-yarn-client/hadoop-yarn- > applications-unmanaged-am-launcher.jar:/usr/hdp/current/ > hadoop-yarn-client/hadoop-yarn-client.jar:/usr/hdp/ > current/hadoop-yarn-client/hadoop-yarn-server-web-proxy- > 2.7.3.2.5.3.0-37.jar:/usr/hdp/current/hadoop-yarn-client/ > hadoop-yarn-applications-distributedshell-2.7.3.2.5.3. > 0-37.jar:/usr/hdp/current/hadoop-yarn-client/hadoop- > yarn-server-resourcemanager.jar:/usr/hdp/current/hadoop- > yarn-client/hadoop-yarn-server-applicationhistoryservice-2.7. > 3.2.5.3.0-37.jar:/usr/hdp/current/hadoop-yarn-client/ > hadoop-yarn-applications-unmanaged-am-launcher-2.7.3.2. > 5.3.0-37.jar:/usr/hdp/current/hadoop-yarn-client/lib/asm-3. > 2.jar:/usr/hdp/current/hadoop-yarn-client/lib/snappy-java-1. > 0.4.1.jar:/usr/hdp/current/hadoop-yarn-client/lib/nimbus- > jose-jwt-3.9.jar:/usr/hdp/current/hadoop-yarn-client/ > lib/guice-3.0.jar:/usr/hdp/current/hadoop-yarn-client/ > lib/curator-framework-2.7.1.jar:/usr/hdp/current/hadoop- > yarn-client/lib/jersey-client-1.9.jar:/usr/hdp/current/ > hadoop-yarn-client/lib/jaxb-impl-2.2.3-1.jar:/usr/hdp/ > current/hadoop-yarn-client/lib/commons-codec-1.4.jar:/ > usr/hdp/current/hadoop-yarn-client/lib/jsp-api-2.1.jar:/ > usr/hdp/current/hadoop-yarn-client/lib/stax-api-1.0-2.jar: > /usr/hdp/current/hadoop-yarn-client/lib/jackson-jaxrs-1.9. > 13.jar:/usr/hdp/current/hadoop-yarn-client/lib/paranamer-2.3.jar:/usr/hdp/ > current/hadoop-yarn-client/lib/java-xmlbuilder-0.4.jar:/ > usr/hdp/current/hadoop-yarn-client/lib/commons-net-3.1. > jar:/usr/hdp/current/hadoop-yarn-client/lib/jsch-0.1.42. > jar:/usr/hdp/current/hadoop-yarn-client/lib/jackson- > databind-2.2.3.jar:/usr/hdp/current/hadoop-yarn-client/ > lib/commons-lang-2.6.jar:/usr/hdp/current/hadoop-yarn- > client/lib/apacheds-i18n-2.0.0-M15.jar:/usr/hdp/current/ > hadoop-yarn-client/lib/commons-logging-1.1.3.jar:/ > usr/hdp/current/hadoop-yarn-client/lib/jackson-core-2.2.3. > jar:/usr/hdp/current/hadoop-yarn-client/lib/jackson- > mapper-asl-1.9.13.jar:/usr/hdp/current/hadoop-yarn- > client/lib/jackson-core-asl-1.9.13.jar:/usr/hdp/current/ > hadoop-yarn-client/lib/metrics-core-3.0.1.jar:/usr/ > hdp/current/hadoop-yarn-client/lib/commons-configuration-1.6.jar:/usr/ > hdp/current/hadoop-yarn-client/lib/jackson-annotations-2.2.3.jar:/usr/ > hdp/current/hadoop-yarn-client/lib/xz-1.0.jar:/usr/ > hdp/current/hadoop-yarn-client/lib/guava-11.0.2.jar:/ > usr/hdp/current/hadoop-yarn-client/lib/commons-beanutils- > core-1.8.0.jar:/usr/hdp/current/hadoop-yarn-client/ > lib/gson-2.2.4.jar:/usr/hdp/current/hadoop-yarn-client/ > lib/htrace-core-3.1.0-incubating.jar:/usr/hdp/current/hadoop-yarn-client/ > lib/commons-beanutils-1.7.0.jar:/usr/hdp/current/hadoop- > yarn-client/lib/jettison-1.1.jar:/usr/hdp/current/hadoop- > yarn-client/lib/jaxb-api-2.2.2.jar:/usr/hdp/current/hadoop- > yarn-client/lib/httpcore-4.4.4.jar:/usr/hdp/current/hadoop- > yarn-client/lib/netty-3.6.2.Final.jar:/usr/hdp/current/ > hadoop-yarn-client/lib/jersey-json-1.9.jar:/usr/hdp/current/ > hadoop-yarn-client/lib/jcip-annotations-1.0.jar:/usr/hdp/ > current/hadoop-yarn-client/lib/javax.inject-1.jar:/usr/ > hdp/current/hadoop-yarn-client/lib/httpclient-4.5.2. > jar:/usr/hdp/current/hadoop-yarn-client/lib/curator- > recipes-2.7.1.jar:/usr/hdp/current/hadoop-yarn-client/ > lib/commons-io-2.4.jar:/usr/hdp/current/hadoop-yarn- > client/lib/objenesis-2.1.jar:/usr/hdp/current/hadoop-yarn- > client/lib/commons-compress-1.4.1.jar:/usr/hdp/current/ > hadoop-yarn-client/lib/apacheds-kerberos-codec-2.0.0- > M15.jar:/usr/hdp/current/hadoop-yarn-client/lib/ > protobuf-java-2.5.0.jar:/usr/hdp/current/hadoop-yarn- > client/lib/leveldbjni-all-1.8.jar:/usr/hdp/current/hadoop- > yarn-client/lib/jetty-6.1.26.hwx.jar:/usr/hdp/current/ > hadoop-yarn-client/lib/javassist-3.18.1-GA.jar:/usr/ > hdp/current/hadoop-yarn-client/lib/jersey-core-1.9. > jar:/usr/hdp/current/hadoop-yarn-client/lib/api-asn1-api- > 1.0.0-M20.jar:/usr/hdp/current/hadoop-yarn-client/ > lib/jsr305-3.0.0.jar:/usr/hdp/current/hadoop-yarn-client/ > lib/xmlenc-0.52.jar:/usr/hdp/current/hadoop-yarn-client/ > lib/curator-client-2.7.1.jar:/usr/hdp/current/hadoop-yarn- > client/lib/commons-math3-3.1.1.jar:/usr/hdp/current/hadoop- > yarn-client/lib/jets3t-0.9.0.jar:/usr/hdp/current/hadoop- > yarn-client/lib/jackson-xc-1.9.13.jar:/usr/hdp/current/ > hadoop-yarn-client/lib/guice-servlet-3.0.jar:/usr/hdp/ > current/hadoop-yarn-client/lib/activation-1.1.jar:/usr/ > hdp/current/hadoop-yarn-client/lib/api-util-1.0.0-M20. > jar:/usr/hdp/current/hadoop-yarn-client/lib/azure- > keyvault-core-0.8.0.jar:/usr/hdp/current/hadoop-yarn- > client/lib/avro-1.7.4.jar:/usr/hdp/current/hadoop-yarn- > client/lib/jersey-guice-1.9.jar:/usr/hdp/current/hadoop- > yarn-client/lib/commons-lang3-3.4.jar:/usr/hdp/current/ > hadoop-yarn-client/lib/commons-cli-1.2.jar:/usr/hdp/ > current/hadoop-yarn-client/lib/fst-2.24.jar:/usr/hdp/ > current/hadoop-yarn-client/lib/jetty-util-6.1.26.hwx.jar: > /usr/hdp/current/hadoop-yarn-client/lib/servlet-api-2.5. > jar:/usr/hdp/current/hadoop-yarn-client/lib/log4j-1.2.17. > jar:/usr/hdp/current/hadoop-yarn-client/lib/commons- > digester-1.8.jar:/usr/hdp/current/hadoop-yarn-client/ > lib/jersey-server-1.9.jar:/usr/hdp/current/hadoop-yarn- > client/lib/aopalliance-1.0.jar:/usr/hdp/current/hadoop- > yarn-client/lib/zookeeper-3.4.6.2.5.3.0-37-tests.jar:/usr/ > hdp/current/hadoop-yarn-client/lib/zookeeper-3.4.6.2. > 5.3.0-37.jar:/usr/hdp/current/hadoop-yarn-client/lib/azure- > storage-4.2.0.jar:/usr/hdp/current/hadoop-yarn-client/ > lib/commons-collections-3.2.2.jar:/usr/hdp/current/hadoop- > yarn-client/lib/json-smart-1.1.1.jar:. > 2017-06-08 18:59:54,005 WARN util.NativeCodeLoader > (NativeCodeLoader.java:<clinit>(62)) - Unable to load native-hadoop > library > for your platform... using builtin-java classes where applicable > 2017-06-08 18:59:55,009 WARN shortcircuit.DomainSocketFactory > (DomainSocketFactory.java:<init>(117)) - The short-circuit local reads > feature cannot be used because libhadoop cannot be loaded. > 2017-06-08 18:59:55,435 INFO storage.DiskStorage > (DiskStorage.java:<init>(53)) - using > /grid/10/hadoop/yarn/local/usercache/SVDATHDP/appcache/ > application_1496931225841_2096/container_e3093_ > 1496931225841_2096_01_000002/tmp > as the basepath for spooling. > 2017-06-08 18:59:55,438 INFO server.Server (Server.java:registered(112)) - > Server started listening at /0.0.0.0:41023 > 2017-06-08 18:59:56,471 INFO engine.StreamingContainer > (StreamingContainer.java:heartbeatLoop(711)) - Waiting for pending > request. > 2017-06-08 18:59:56,976 INFO engine.StreamingContainer > (StreamingContainer.java:heartbeatLoop(711)) - Waiting for pending > request. > 2017-06-08 18:59:57,482 INFO engine.StreamingContainer > (StreamingContainer.java:heartbeatLoop(711)) - Waiting for pending > request. > 2017-06-08 18:59:57,987 INFO engine.StreamingContainer > (StreamingContainer.java:heartbeatLoop(711)) - Waiting for pending > request. > 2017-06-08 18:59:58,491 INFO engine.StreamingContainer > (StreamingContainer.java:heartbeatLoop(711)) - Waiting for pending > request. > 2017-06-08 18:59:58,996 INFO engine.StreamingContainer > (StreamingContainer.java:heartbeatLoop(711)) - Waiting for pending > request. > 2017-06-08 18:59:59,500 INFO engine.StreamingContainer > (StreamingContainer.java:heartbeatLoop(711)) - Waiting for pending > request. > 2017-06-08 19:00:00,004 INFO engine.StreamingContainer > (StreamingContainer.java:heartbeatLoop(711)) - Waiting for pending > request. > 2017-06-08 19:00:00,414 INFO server.Server (Server.java:onMessage(599)) - > Received subscriber request: SubscribeRequestTuple{version=1.0, > identifier=tcp://d-2vwlw12.target.com:41023/5.unique.1, > windowId=ffffffffffffffff, type=uniquMessages/6.inputPort, > upstreamIdentifier=5.unique.1, mask=0, partitions=null, bufferSize=1024} > 2017-06-08 19:00:00,528 INFO engine.StreamingContainer > (StreamingContainer.java:processHeartbeatResponse(825)) - Deploy request: > [OperatorDeployInfo[id=5,name=dedupeOperator,type=GENERIC, > checkpoint={ffffffffffffffff, > 0, > 0},inputs=[OperatorDeployInfo.InputDeployInfo[portName=input,streamId= > checkDuplicates,sourceNodeId=3,sourcePortName=dedupePort, > locality=<null>,partitionMask=0,partitionKeys=<null>]], > outputs=[OperatorDeployInfo.OutputDeployInfo[portName= > unique,streamId=uniquMessages,bufferServer=d-2vwlw12.target.com]]]] > 2017-06-08 19:00:00,664 INFO server.Server (Server.java:onMessage(555)) - > Received publisher request: PublishRequestTuple{version=1.0, > identifier=5.unique.1, windowId=ffffffffffffffff} > 2017-06-08 19:00:03,105 INFO util.AsyncFSStorageAgent > (AsyncFSStorageAgent.java:save(91)) - using > /grid/10/hadoop/yarn/local/usercache/SVDATHDP/appcache/ > application_1496931225841_2096/container_e3093_ > 1496931225841_2096_01_000002/tmp/chkp4165334308239559126 > as the basepath for checkpointing. > 2017-06-08 19:05:57,806 ERROR engine.StreamingContainer > (StreamingContainer.java:run(1456)) - Operator set > [OperatorDeployInfo[id=5,name=dedupeOperator,type=GENERIC, > checkpoint={ffffffffffffffff, > 0, > 0},inputs=[OperatorDeployInfo.InputDeployInfo[portName=input,streamId= > checkDuplicates,sourceNodeId=3,sourcePortName=dedupePort, > locality=<null>,partitionMask=0,partitionKeys=<null>]], > outputs=[OperatorDeployInfo.OutputDeployInfo[portName= > unique,streamId=uniquMessages,bufferServer=d-2vwlw12.target.com]]]] > stopped running due to an exception. > java.lang.IllegalArgumentException: Invalid slice: offset=0, length=0 > array.length=0 > at com.datatorrent.netlet.util.Slice.<init>(Slice.java:43) > at > org.apache.apex.malhar.lib.utils.serde.BufferSlice.<init> > (BufferSlice.java:48) > at > org.apache.apex.malhar.lib.utils.serde.BufferSlice.<init> > (BufferSlice.java:58) > at > org.apache.apex.malhar.lib.utils.serde.SliceUtils. > toBufferSlice(SliceUtils.java:111) > at > org.apache.apex.malhar.lib.state.managed.Bucket$ > DefaultBucket.put(Bucket.java:421) > at > org.apache.apex.malhar.lib.state.managed.AbstractManagedStateImpl. > putInBucket(AbstractManagedStateImpl.java:286) > at > org.apache.apex.malhar.lib.state.managed.ManagedTimeUnifiedStateImpl.put( > ManagedTimeUnifiedStateImpl.java:72) > at > org.apache.apex.malhar.lib.dedup.TimeBasedDedupOperator.putManagedState( > TimeBasedDedupOperator.java:189) > at > org.apache.apex.malhar.lib.dedup.AbstractDeduper.processAuxiliary( > AbstractDeduper.java:316) > at > org.apache.apex.malhar.lib.dedup.AbstractDeduper. > endWindow(AbstractDeduper.java:337) > at > com.datatorrent.stram.engine.GenericNode.processEndWindow( > GenericNode.java:153) > at com.datatorrent.stram.engine.GenericNode.run(GenericNode. > java:397) > at > com.datatorrent.stram.engine.StreamingContainer$2.run( > StreamingContainer.java:1428) > 2017-06-08 19:05:58,082 INFO engine.StreamingContainer > (StreamingContainer.java:processHeartbeatResponse(808)) - Undeploy > request: > [5] > 2017-06-08 19:05:58,084 INFO engine.StreamingContainer > (StreamingContainer.java:undeploy(561)) - Undeploy complete. > 2017-06-08 19:05:58,085 INFO server.Server (Server.java:run(414)) - > Removing ln > LogicalNode@411ca26bidentifier=tcp://d-2vwlw12.target.com:41023/5.unique.1 > , > upstream=5.unique.1, group=uniquMessages/6.inputPort, partitions=[], > iterator=com.datatorrent.bufferserver.internal.DataList$DataListIterator@ > 64b843d2{da=com.datatorrent.bufferserver.internal.DataList$Block@2c9a38f1{ > identifier=5.unique.1, > data=67108864, readingOffset=5661, writingOffset=6062, > starting_window=5939e4e6000002cf, ending_window=5939e4e6000002ff, > refCount=2, uniqueIdentifier=0, next=null, future=null}}} from dl > com.datatorrent.bufferserver.internal.DataList@e9ca021 {5.unique.1} > > Regards > Vivek > > > > -- > View this message in context: http://apache-apex-users-list. > 78494.x6.nabble.com/BoundedDedupOperator-failing-with-java-lang- > IllegalArgumentException-bucket-conflict-tp1698p1703.html > Sent from the Apache Apex Users list mailing list archive at Nabble.com. >