Subject: Why did JM fail on K8s (see original thread below)


This is strange, the retry strategy was 20 times with 4 minute delay.  This
job tried once ( we had a hadoop Name Node hiccup )  but I think it could
not even get to NN and gave up ( as in did not retry the next 19 times )
....

*019-06-29 00:33:13,680 INFO
org.apache.flink.runtime.executiongraph.ExecutionGraph - Could not restart
the job Kafka-to-HDFS (00000000000000000000000000000005) because the
restart strategy prevented it.*

On Sat, Jun 29, 2019 at 10:03 AM Vishal Santoshi <[EMAIL PROTECTED]>
wrote: