Search in sources :

Example 16 with FailureHandlingResult

use of org.apache.flink.runtime.executiongraph.failover.flip1.FailureHandlingResult in project flink-mirror by flink-ci.

the class FailureHandlingResultSnapshotTest method testRootCauseVertexNotFailed.

@Test(expected = IllegalArgumentException.class)
public void testRootCauseVertexNotFailed() {
    final ExecutionVertex rootCauseExecutionVertex = extractExecutionVertex(0);
    final FailureHandlingResult failureHandlingResult = FailureHandlingResult.restartable(rootCauseExecutionVertex.getID(), new RuntimeException("Expected exception: root cause"), System.currentTimeMillis(), StreamSupport.stream(executionGraph.getAllExecutionVertices().spliterator(), false).map(ExecutionVertex::getID).collect(Collectors.toSet()), 0L, false);
    FailureHandlingResultSnapshot.create(failureHandlingResult, this::getLatestExecution);
}
Also used : FailureHandlingResult(org.apache.flink.runtime.executiongraph.failover.flip1.FailureHandlingResult) ExecutionVertex(org.apache.flink.runtime.executiongraph.ExecutionVertex) Test(org.junit.Test)

Example 17 with FailureHandlingResult

use of org.apache.flink.runtime.executiongraph.failover.flip1.FailureHandlingResult in project flink by splunk.

the class FailureHandlingResultSnapshot method create.

/**
 * Creates a {@code FailureHandlingResultSnapshot} based on the passed {@link
 * FailureHandlingResult} and {@link ExecutionVertex ExecutionVertices}.
 *
 * @param failureHandlingResult The {@code FailureHandlingResult} that is used for extracting
 *     the failure information.
 * @param latestExecutionLookup The look-up function for retrieving the latest {@link Execution}
 *     instance for a given {@link ExecutionVertexID}.
 * @return The {@code FailureHandlingResultSnapshot}.
 */
public static FailureHandlingResultSnapshot create(FailureHandlingResult failureHandlingResult, Function<ExecutionVertexID, Execution> latestExecutionLookup) {
    final Execution rootCauseExecution = failureHandlingResult.getExecutionVertexIdOfFailedTask().map(latestExecutionLookup).orElse(null);
    Preconditions.checkArgument(rootCauseExecution == null || rootCauseExecution.getFailureInfo().isPresent(), String.format("The execution %s didn't provide a failure info even though the corresponding ExecutionVertex %s is marked as having handled the root cause of this failure.", // added to make the compiler happy
    rootCauseExecution != null ? rootCauseExecution.getAttemptId() : "(null)", failureHandlingResult.getExecutionVertexIdOfFailedTask().map(Objects::toString).orElse("(null)")));
    final ExecutionVertexID rootCauseExecutionVertexId = failureHandlingResult.getExecutionVertexIdOfFailedTask().orElse(null);
    final Set<Execution> concurrentlyFailedExecutions = failureHandlingResult.getVerticesToRestart().stream().filter(executionVertexId -> !executionVertexId.equals(rootCauseExecutionVertexId)).map(latestExecutionLookup).filter(execution -> execution.getFailureInfo().isPresent()).collect(Collectors.toSet());
    return new FailureHandlingResultSnapshot(rootCauseExecution, ErrorInfo.handleMissingThrowable(failureHandlingResult.getError()), failureHandlingResult.getTimestamp(), concurrentlyFailedExecutions);
}
Also used : ErrorInfo(org.apache.flink.runtime.executiongraph.ErrorInfo) Set(java.util.Set) ExecutionVertexID(org.apache.flink.runtime.scheduler.strategy.ExecutionVertexID) Preconditions(org.apache.flink.util.Preconditions) Function(java.util.function.Function) Collectors(java.util.stream.Collectors) VisibleForTesting(org.apache.flink.annotation.VisibleForTesting) Execution(org.apache.flink.runtime.executiongraph.Execution) FailureHandlingResult(org.apache.flink.runtime.executiongraph.failover.flip1.FailureHandlingResult) Objects(java.util.Objects) Optional(java.util.Optional) ExecutionVertex(org.apache.flink.runtime.executiongraph.ExecutionVertex) Collections(java.util.Collections) Nullable(javax.annotation.Nullable) Execution(org.apache.flink.runtime.executiongraph.Execution) ExecutionVertexID(org.apache.flink.runtime.scheduler.strategy.ExecutionVertexID)

Example 18 with FailureHandlingResult

use of org.apache.flink.runtime.executiongraph.failover.flip1.FailureHandlingResult in project flink by splunk.

the class FailureHandlingResultSnapshotTest method testRootCauseVertexNotFailed.

@Test(expected = IllegalArgumentException.class)
public void testRootCauseVertexNotFailed() {
    final ExecutionVertex rootCauseExecutionVertex = extractExecutionVertex(0);
    final FailureHandlingResult failureHandlingResult = FailureHandlingResult.restartable(rootCauseExecutionVertex.getID(), new RuntimeException("Expected exception: root cause"), System.currentTimeMillis(), StreamSupport.stream(executionGraph.getAllExecutionVertices().spliterator(), false).map(ExecutionVertex::getID).collect(Collectors.toSet()), 0L, false);
    FailureHandlingResultSnapshot.create(failureHandlingResult, this::getLatestExecution);
}
Also used : FailureHandlingResult(org.apache.flink.runtime.executiongraph.failover.flip1.FailureHandlingResult) ExecutionVertex(org.apache.flink.runtime.executiongraph.ExecutionVertex) Test(org.junit.Test)

Example 19 with FailureHandlingResult

use of org.apache.flink.runtime.executiongraph.failover.flip1.FailureHandlingResult in project flink by splunk.

the class FailureHandlingResultSnapshotTest method testMissingThrowableHandling.

// see FLINK-22060/FLINK-21376
@Test
public void testMissingThrowableHandling() {
    final ExecutionVertex rootCauseExecutionVertex = extractExecutionVertex(0);
    final long rootCauseTimestamp = triggerFailure(rootCauseExecutionVertex, null);
    final FailureHandlingResult failureHandlingResult = FailureHandlingResult.restartable(rootCauseExecutionVertex.getID(), null, rootCauseTimestamp, StreamSupport.stream(executionGraph.getAllExecutionVertices().spliterator(), false).map(ExecutionVertex::getID).collect(Collectors.toSet()), 0L, false);
    final FailureHandlingResultSnapshot testInstance = FailureHandlingResultSnapshot.create(failureHandlingResult, this::getLatestExecution);
    final Throwable actualException = new SerializedThrowable(testInstance.getRootCause()).deserializeError(ClassLoader.getSystemClassLoader());
    assertThat(actualException, IsInstanceOf.instanceOf(FlinkException.class));
    assertThat(actualException, FlinkMatchers.containsMessage(ErrorInfo.handleMissingThrowable(null).getMessage()));
    assertThat(testInstance.getTimestamp(), is(rootCauseTimestamp));
    assertThat(testInstance.getRootCauseExecution().isPresent(), is(true));
    assertThat(testInstance.getRootCauseExecution().get(), is(rootCauseExecutionVertex.getCurrentExecutionAttempt()));
}
Also used : FailureHandlingResult(org.apache.flink.runtime.executiongraph.failover.flip1.FailureHandlingResult) SerializedThrowable(org.apache.flink.util.SerializedThrowable) ExecutionVertex(org.apache.flink.runtime.executiongraph.ExecutionVertex) FlinkException(org.apache.flink.util.FlinkException) SerializedThrowable(org.apache.flink.util.SerializedThrowable) Test(org.junit.Test)

Example 20 with FailureHandlingResult

use of org.apache.flink.runtime.executiongraph.failover.flip1.FailureHandlingResult in project flink by splunk.

the class FailureHandlingResultSnapshotTest method testLocalFailureHandlingResultSnapshotCreation.

@Test
public void testLocalFailureHandlingResultSnapshotCreation() {
    final ExecutionVertex rootCauseExecutionVertex = extractExecutionVertex(0);
    final Throwable rootCause = new RuntimeException("Expected exception: root cause");
    final ExecutionVertex otherFailedExecutionVertex = extractExecutionVertex(1);
    final Throwable otherFailure = new IllegalStateException("Expected exception: other failure");
    final long rootCauseTimestamp = triggerFailure(rootCauseExecutionVertex, rootCause);
    triggerFailure(otherFailedExecutionVertex, otherFailure);
    final FailureHandlingResult failureHandlingResult = FailureHandlingResult.restartable(rootCauseExecutionVertex.getID(), rootCause, rootCauseTimestamp, StreamSupport.stream(executionGraph.getAllExecutionVertices().spliterator(), false).map(ExecutionVertex::getID).collect(Collectors.toSet()), 0L, false);
    final FailureHandlingResultSnapshot testInstance = FailureHandlingResultSnapshot.create(failureHandlingResult, this::getLatestExecution);
    assertThat(testInstance.getRootCause(), is(rootCause));
    assertThat(testInstance.getTimestamp(), is(rootCauseTimestamp));
    assertThat(testInstance.getRootCauseExecution().isPresent(), is(true));
    assertThat(testInstance.getRootCauseExecution().get(), is(rootCauseExecutionVertex.getCurrentExecutionAttempt()));
    assertThat(testInstance.getConcurrentlyFailedExecution(), IsIterableContainingInOrder.contains(otherFailedExecutionVertex.getCurrentExecutionAttempt()));
}
Also used : FailureHandlingResult(org.apache.flink.runtime.executiongraph.failover.flip1.FailureHandlingResult) SerializedThrowable(org.apache.flink.util.SerializedThrowable) ExecutionVertex(org.apache.flink.runtime.executiongraph.ExecutionVertex) Test(org.junit.Test)

Aggregations

FailureHandlingResult (org.apache.flink.runtime.executiongraph.failover.flip1.FailureHandlingResult)21 ExecutionVertex (org.apache.flink.runtime.executiongraph.ExecutionVertex)15 Test (org.junit.Test)12 SerializedThrowable (org.apache.flink.util.SerializedThrowable)9 FlinkException (org.apache.flink.util.FlinkException)6 Collections (java.util.Collections)3 Objects (java.util.Objects)3 Optional (java.util.Optional)3 Set (java.util.Set)3 Function (java.util.function.Function)3 Collectors (java.util.stream.Collectors)3 Nullable (javax.annotation.Nullable)3 VisibleForTesting (org.apache.flink.annotation.VisibleForTesting)3 ErrorInfo (org.apache.flink.runtime.executiongraph.ErrorInfo)3 Execution (org.apache.flink.runtime.executiongraph.Execution)3 ExecutionVertexID (org.apache.flink.runtime.scheduler.strategy.ExecutionVertexID)3 Preconditions (org.apache.flink.util.Preconditions)3