Search in sources :

Example 1 with ChangeEventSequence

use of com.google.cloud.teleport.v2.templates.datastream.ChangeEventSequence in project DataflowTemplates by GoogleCloudPlatform.

the class SpannerTransactionWriterDoFn method processElement.

@ProcessElement
public void processElement(ProcessContext c) {
    FailsafeElement<String, String> msg = c.element();
    Ddl ddl = c.sideInput(ddlView);
    processedEvents.inc();
    /*
     * Try Catch block to capture any exceptions that might occur while processing
     * DataStream events while writing to Cloud Spanner. All Exceptions that are caught
     * can be retried based on the exception type.
     */
    try {
        JsonNode changeEvent = mapper.readTree(msg.getPayload());
        ChangeEventContext changeEventContext = ChangeEventContextFactory.createChangeEventContext(changeEvent, ddl, shadowTablePrefix, sourceType);
        // Sequence information for the current change event.
        ChangeEventSequence currentChangeEventSequence = ChangeEventSequenceFactory.createChangeEventSequenceFromChangeEventContext(changeEventContext);
        // Start transaction
        spannerAccessor.getDatabaseClient().readWriteTransaction().run((TransactionCallable<Void>) transaction -> {
            ChangeEventSequence previousChangeEventSequence = ChangeEventSequenceFactory.createChangeEventSequenceFromShadowTable(transaction, changeEventContext);
            if (previousChangeEventSequence != null && previousChangeEventSequence.compareTo(currentChangeEventSequence) >= 0) {
                return null;
            }
            transaction.buffer(changeEventContext.getMutations());
            return null;
        });
        com.google.cloud.Timestamp timestamp = com.google.cloud.Timestamp.now();
        c.output(timestamp);
        sucessfulEvents.inc();
    } catch (InvalidChangeEventException e) {
        // Errors that result from invalid change events.
        outputWithErrorTag(c, msg, e, SpannerTransactionWriter.PERMANENT_ERROR_TAG);
        skippedEvents.inc();
    } catch (ChangeEventConvertorException e) {
        // Errors that result during Event conversions are not retryable.
        outputWithErrorTag(c, msg, e, SpannerTransactionWriter.PERMANENT_ERROR_TAG);
        conversionErrors.inc();
    } catch (SpannerException se) {
        /* Errors that happen when writing to Cloud Spanner are considered retryable.
       * Since all event convertion errors are caught beforehand as permanent errors,
       * any other errors encountered while writing to Cloud Spanner can be retried.
       * Examples include:
       * 1. Deadline exceeded errors from Cloud Spanner.
       * 2. Failures due to foreign key/interleaved table constraints.
       * 3. Any transient errors in Cloud Spanner.
       */
        outputWithErrorTag(c, msg, se, SpannerTransactionWriter.RETRYABLE_ERROR_TAG);
        retryableErrors.inc();
    } catch (Exception e) {
        // Any other errors are considered severe and not retryable.
        outputWithErrorTag(c, msg, e, SpannerTransactionWriter.PERMANENT_ERROR_TAG);
        failedEvents.inc();
    }
}
Also used : TransactionCallable(com.google.cloud.spanner.TransactionRunner.TransactionCallable) InvalidChangeEventException(com.google.cloud.teleport.v2.templates.datastream.InvalidChangeEventException) Ddl(com.google.cloud.teleport.v2.templates.spanner.ddl.Ddl) LoggerFactory(org.slf4j.LoggerFactory) Timestamp(com.google.cloud.Timestamp) DeserializationFeature(com.fasterxml.jackson.databind.DeserializationFeature) Metrics(org.apache.beam.sdk.metrics.Metrics) ChangeEventConvertorException(com.google.cloud.teleport.v2.templates.datastream.ChangeEventConvertorException) TupleTag(org.apache.beam.sdk.values.TupleTag) ChangeEventSequenceFactory(com.google.cloud.teleport.v2.templates.datastream.ChangeEventSequenceFactory) JsonNode(com.fasterxml.jackson.databind.JsonNode) ExposedSpannerAccessor(org.apache.beam.sdk.io.gcp.spanner.ExposedSpannerAccessor) ChangeEventContext(com.google.cloud.teleport.v2.templates.datastream.ChangeEventContext) ChangeEventSequence(com.google.cloud.teleport.v2.templates.datastream.ChangeEventSequence) PrintWriter(java.io.PrintWriter) DoFn(org.apache.beam.sdk.transforms.DoFn) Logger(org.slf4j.Logger) StringWriter(java.io.StringWriter) Counter(org.apache.beam.sdk.metrics.Counter) ObjectMapper(com.fasterxml.jackson.databind.ObjectMapper) Serializable(java.io.Serializable) SpannerConfig(org.apache.beam.sdk.io.gcp.spanner.SpannerConfig) SpannerException(com.google.cloud.spanner.SpannerException) PCollectionView(org.apache.beam.sdk.values.PCollectionView) FailsafeElement(com.google.cloud.teleport.v2.values.FailsafeElement) Preconditions(com.google.common.base.Preconditions) ChangeEventContextFactory(com.google.cloud.teleport.v2.templates.datastream.ChangeEventContextFactory) JsonNode(com.fasterxml.jackson.databind.JsonNode) Timestamp(com.google.cloud.Timestamp) Ddl(com.google.cloud.teleport.v2.templates.spanner.ddl.Ddl) InvalidChangeEventException(com.google.cloud.teleport.v2.templates.datastream.InvalidChangeEventException) ChangeEventConvertorException(com.google.cloud.teleport.v2.templates.datastream.ChangeEventConvertorException) SpannerException(com.google.cloud.spanner.SpannerException) ChangeEventConvertorException(com.google.cloud.teleport.v2.templates.datastream.ChangeEventConvertorException) InvalidChangeEventException(com.google.cloud.teleport.v2.templates.datastream.InvalidChangeEventException) ChangeEventContext(com.google.cloud.teleport.v2.templates.datastream.ChangeEventContext) SpannerException(com.google.cloud.spanner.SpannerException) ChangeEventSequence(com.google.cloud.teleport.v2.templates.datastream.ChangeEventSequence)

Example 2 with ChangeEventSequence

use of com.google.cloud.teleport.v2.templates.datastream.ChangeEventSequence in project DataflowTemplates by GoogleCloudPlatform.

the class ShadowTableCreator method constructShadowTable.

/*
   * Constructs a shadow table for a data table in the information schema.
   * Note: Shadow tables for interleaved tables are not interleaved to
   * their shadow parent table.
   */
Table constructShadowTable(Ddl informationSchema, String dataTableName) {
    // Create a new shadow table with the given prefix.
    Table.Builder shadowTableBuilder = Table.builder();
    String shadowTableName = shadowTablePrefix + dataTableName;
    shadowTableBuilder.name(shadowTableName);
    // Add key columns from the data table to the shadow table builder.
    Table dataTable = informationSchema.table(dataTableName);
    Set<String> primaryKeyColNames = dataTable.primaryKeys().stream().map(k -> k.name()).collect(Collectors.toSet());
    List<Column> primaryKeyCols = dataTable.columns().stream().filter(col -> primaryKeyColNames.contains(col.name())).collect(Collectors.toList());
    for (Column col : primaryKeyCols) {
        shadowTableBuilder.addColumn(col);
    }
    // Add primary key constraints.
    for (IndexColumn keyColumn : dataTable.primaryKeys()) {
        if (keyColumn.order() == IndexColumn.Order.ASC) {
            shadowTableBuilder.primaryKey().asc(keyColumn.name()).end();
        } else if (keyColumn.order() == IndexColumn.Order.DESC) {
            shadowTableBuilder.primaryKey().desc(keyColumn.name()).end();
        }
    }
    // Add extra column to track ChangeEventSequence information
    addChangeEventSequenceColumns(shadowTableBuilder);
    return shadowTableBuilder.build();
}
Also used : List(java.util.List) Pair(org.apache.commons.lang3.tuple.Pair) DatastreamConstants(com.google.cloud.teleport.v2.templates.datastream.DatastreamConstants) Ddl(com.google.cloud.teleport.v2.templates.spanner.ddl.Ddl) IndexColumn(com.google.cloud.teleport.v2.templates.spanner.ddl.IndexColumn) Column(com.google.cloud.teleport.v2.templates.spanner.ddl.Column) Map(java.util.Map) Table(com.google.cloud.teleport.v2.templates.spanner.ddl.Table) Set(java.util.Set) Collectors(java.util.stream.Collectors) Table(com.google.cloud.teleport.v2.templates.spanner.ddl.Table) IndexColumn(com.google.cloud.teleport.v2.templates.spanner.ddl.IndexColumn) Column(com.google.cloud.teleport.v2.templates.spanner.ddl.Column) IndexColumn(com.google.cloud.teleport.v2.templates.spanner.ddl.IndexColumn)

Aggregations

Ddl (com.google.cloud.teleport.v2.templates.spanner.ddl.Ddl)2 DeserializationFeature (com.fasterxml.jackson.databind.DeserializationFeature)1 JsonNode (com.fasterxml.jackson.databind.JsonNode)1 ObjectMapper (com.fasterxml.jackson.databind.ObjectMapper)1 Timestamp (com.google.cloud.Timestamp)1 SpannerException (com.google.cloud.spanner.SpannerException)1 TransactionCallable (com.google.cloud.spanner.TransactionRunner.TransactionCallable)1 ChangeEventContext (com.google.cloud.teleport.v2.templates.datastream.ChangeEventContext)1 ChangeEventContextFactory (com.google.cloud.teleport.v2.templates.datastream.ChangeEventContextFactory)1 ChangeEventConvertorException (com.google.cloud.teleport.v2.templates.datastream.ChangeEventConvertorException)1 ChangeEventSequence (com.google.cloud.teleport.v2.templates.datastream.ChangeEventSequence)1 ChangeEventSequenceFactory (com.google.cloud.teleport.v2.templates.datastream.ChangeEventSequenceFactory)1 DatastreamConstants (com.google.cloud.teleport.v2.templates.datastream.DatastreamConstants)1 InvalidChangeEventException (com.google.cloud.teleport.v2.templates.datastream.InvalidChangeEventException)1 Column (com.google.cloud.teleport.v2.templates.spanner.ddl.Column)1 IndexColumn (com.google.cloud.teleport.v2.templates.spanner.ddl.IndexColumn)1 Table (com.google.cloud.teleport.v2.templates.spanner.ddl.Table)1 FailsafeElement (com.google.cloud.teleport.v2.values.FailsafeElement)1 Preconditions (com.google.common.base.Preconditions)1 PrintWriter (java.io.PrintWriter)1