[jira] [Created] (FLINK-8102) Formatting issues in Mesos documentation.

2017-11-17 Thread Joerg Schad (JIRA)
Joerg Schad created FLINK-8102:
--

 Summary: Formatting issues in Mesos documentation. 
 Key: FLINK-8102
 URL: https://issues.apache.org/jira/browse/FLINK-8102
 Project: Flink
  Issue Type: Bug
Reporter: Joerg Schad
Assignee: Joerg Schad
Priority: Minor


The Flink documentation renders incorrectly as some characters are not probably 
escaped.

https://ci.apache.org/projects/flink/flink-docs-release-1.3/setup/mesos.html
https://ci.apache.org/projects/flink/flink-docs-release-1.3/setup/config.html#mesos



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (FLINK-8101) Elasticsearch 6.x support

2017-11-17 Thread Hai Zhou UTC+8 (JIRA)
Hai Zhou UTC+8 created FLINK-8101:
-

 Summary: Elasticsearch 6.x support
 Key: FLINK-8101
 URL: https://issues.apache.org/jira/browse/FLINK-8101
 Project: Flink
  Issue Type: New Feature
  Components: ElasticSearch Connector
Affects Versions: 1.4.0
Reporter: Hai Zhou UTC+8
 Fix For: 1.5.0


Recently, elasticsearch 6.0.0 was released: 
https://www.elastic.co/blog/elasticsearch-6-0-0-released  
The minimum version of ES6 compatible Elasticsearch Java Client is 5.6.0




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (FLINK-8100) Consider introducing log4j-extras

2017-11-17 Thread Ted Yu (JIRA)
Ted Yu created FLINK-8100:
-

 Summary: Consider introducing log4j-extras 
 Key: FLINK-8100
 URL: https://issues.apache.org/jira/browse/FLINK-8100
 Project: Flink
  Issue Type: Improvement
Reporter: Ted Yu


log4j-extras allows log rotation as well as compression.

https://logging.apache.org/log4j/extras/download.html

We should consider using log4j-extras.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (FLINK-8099) Reduce default restart delay to 1 second

2017-11-17 Thread Aljoscha Krettek (JIRA)
Aljoscha Krettek created FLINK-8099:
---

 Summary: Reduce default restart delay to 1 second
 Key: FLINK-8099
 URL: https://issues.apache.org/jira/browse/FLINK-8099
 Project: Flink
  Issue Type: Bug
  Components: State Backends, Checkpointing
Reporter: Aljoscha Krettek
Assignee: Aljoscha Krettek
Priority: Blocker
 Fix For: 1.4.0


Currently, when a job fails Flink will wait for 10 seconds until restarting the 
job. Even zero delay is a reasonable setting but will result in "flooding" the 
logs and quickly increasing the restart counter because at zero delay you will 
always see failures when no standby resources are available.

Reducing this to 1 second should make for a nicer out-of-box experience and not 
flood too much.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (FLINK-8098) LeaseExpiredException when using FsStateBackend for checkpointing due to multiple mappers tries to access the same file.

2017-11-17 Thread Shashank Agarwal (JIRA)
Shashank Agarwal created FLINK-8098:
---

 Summary: LeaseExpiredException when using FsStateBackend for 
checkpointing due to multiple mappers tries to access the same file.
 Key: FLINK-8098
 URL: https://issues.apache.org/jira/browse/FLINK-8098
 Project: Flink
  Issue Type: Bug
  Components: State Backends, Checkpointing
Affects Versions: 1.3.2
 Environment: Yarn, HDFS 2.7.3, Kafka, scala streaming API, CEP
Reporter: Shashank Agarwal


I am running streaming application with parallelism 6. I have enabled 
checkpointing(1000). But application gets the crash after 1-2 days. After 
analysing logs i found following trace. 

{code}
2017-11-17 11:19:06,696 WARN  
org.apache.flink.streaming.runtime.tasks.StreamTask   - Could not 
properly clean up the async checkpoint runnable.
java.lang.Exception: Could not properly cancel managed keyed state future.
at 
org.apache.flink.streaming.api.operators.OperatorSnapshotResult.cancel(OperatorSnapshotResult.java:90)
at 
org.apache.flink.streaming.runtime.tasks.StreamTask$AsyncCheckpointRunnable.cleanup(StreamTask.java:1023)
at 
org.apache.flink.streaming.runtime.tasks.StreamTask$AsyncCheckpointRunnable.close(StreamTask.java:983)
at org.apache.flink.util.IOUtils.closeQuietly(IOUtils.java:262)
at org.apache.flink.util.IOUtils.closeAllQuietly(IOUtils.java:251)
at 
org.apache.flink.util.AbstractCloseableRegistry.close(AbstractCloseableRegistry.java:97)
at 
org.apache.flink.streaming.runtime.tasks.StreamTask.cancel(StreamTask.java:355)
at 
org.apache.flink.runtime.taskmanager.Task$TaskCanceler.run(Task.java:1463)
at java.lang.Thread.run(Thread.java:745)
Caused by: java.util.concurrent.ExecutionException: java.io.IOException: Could 
not flush and close the file system output stream to 
hdfs://xyz.com:8020/flink/sux/54944cea1f566ee801656e06cdeeabbc/chk-40191/cf145018-0599-4281-b254-96600a4e4965
 in order to obtain the stream state handle
at java.util.concurrent.FutureTask.report(FutureTask.java:122)
at java.util.concurrent.FutureTask.get(FutureTask.java:192)
at 
org.apache.flink.util.FutureUtil.runIfNotDoneAndGet(FutureUtil.java:43)
at 
org.apache.flink.runtime.state.StateUtil.discardStateFuture(StateUtil.java:85)
at 
org.apache.flink.streaming.api.operators.OperatorSnapshotResult.cancel(OperatorSnapshotResult.java:88)
... 8 more
Caused by: java.io.IOException: Could not flush and close the file system 
output stream to 
hdfs://xyz.com:8020/flink/sux/54944cea1f566ee801656e06cdeeabbc/chk-40191/cf145018-0599-4281-b254-96600a4e4965
 in order to obtain the stream state handle
at 
org.apache.flink.runtime.state.filesystem.FsCheckpointStreamFactory$FsCheckpointStateOutputStream.closeAndGetHandle(FsCheckpointStreamFactory.java:336)
at 
org.apache.flink.runtime.checkpoint.AbstractAsyncSnapshotIOCallable.closeStreamAndGetStateHandle(AbstractAsyncSnapshotIOCallable.java:100)
at 
org.apache.flink.runtime.state.heap.HeapKeyedStateBackend$1.performOperation(HeapKeyedStateBackend.java:351)
at 
org.apache.flink.runtime.state.heap.HeapKeyedStateBackend$1.performOperation(HeapKeyedStateBackend.java:329)
at 
org.apache.flink.runtime.io.async.AbstractAsyncIOCallable.call(AbstractAsyncIOCallable.java:72)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at 
org.apache.flink.util.FutureUtil.runIfNotDoneAndGet(FutureUtil.java:40)
at 
org.apache.flink.streaming.runtime.tasks.StreamTask$AsyncCheckpointRunnable.run(StreamTask.java:897)
at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
... 1 more
Caused by: 
org.apache.hadoop.ipc.RemoteException(org.apache.hadoop.hdfs.server.namenode.LeaseExpiredException):
 No lease 
flink/sux/54944cea1f566ee801656e06cdeeabbc/chk-40191/cf145018-0599-4281-b254-96600a4e4965
 (inode 812148671): File does not exist. [Lease.  Holder: 
DFSClient_NONMAPREDUCE_1721510813_94, pendingcreates: 161]
at 
org.apache.hadoop.hdfs.server.namenode.FSNamesystem.checkLease(FSNamesystem.java:3659)
at 
org.apache.hadoop.hdfs.server.namenode.FSNamesystem.completeFileInternal(FSNamesystem.java:3749)
at 
org.apache.hadoop.hdfs.server.namenode.FSNamesystem.completeFile(FSNamesystem.java:3716)
at 
org.apache.hadoop.hdfs.server.namenode.NameNodeRpcServer.complete(NameNodeRpcServer.java:911)
at 
org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocolServerSideTranslatorPB.complete(ClientNamenodeProtocolServerSideTranslatorPB.java:547)