use of io.opencensus.common.Scope in project instrumentation-java by census-instrumentation.
the class OpenCensusTraceLoggingEnhancerTest method getEnhancedLogEntry.
private static LogEntry getEnhancedLogEntry(LoggingEnhancer loggingEnhancer, Span span) {
Scope scope = tracer.withSpan(span);
try {
LogEntry.Builder builder = LogEntry.newBuilder(null);
loggingEnhancer.enhanceLogEntry(builder);
return builder.build();
} finally {
scope.close();
}
}
use of io.opencensus.common.Scope in project beam by apache.
the class QueryChangeStreamAction method run.
/**
* This method will dispatch a change stream query for the given partition, it delegate the
* processing of the records to one of the corresponding action classes registered and it will
* keep the state of the partition up to date in the Connector's metadata table.
*
* <p>The algorithm is as follows:
*
* <ol>
* <li>A change stream query for the partition is performed.
* <li>For each record, we check the type of the record and dispatch the processing to one of
* the actions registered.
* <li>If an {@link Optional} with a {@link ProcessContinuation#stop()} is returned from the
* actions, we stop processing and return.
* <li>Before returning we register a bundle finalizer callback to update the watermark of the
* partition in the metadata tables to the latest processed timestamp.
* <li>When a change stream query finishes successfully (no more records) we update the
* partition state to FINISHED.
* </ol>
*
* There might be cases where due to a split at the exact end timestamp of a partition's change
* stream query, this function could process a residual with an invalid timestamp. In this case,
* the error is ignored and no work is done for the residual.
*
* @param partition the current partition being processed
* @param tracker the restriction tracker of the {@link
* org.apache.beam.sdk.io.gcp.spanner.changestreams.dofn.ReadChangeStreamPartitionDoFn} SDF
* @param receiver the output receiver of the {@link
* org.apache.beam.sdk.io.gcp.spanner.changestreams.dofn.ReadChangeStreamPartitionDoFn} SDF
* @param watermarkEstimator the watermark estimator of the {@link
* org.apache.beam.sdk.io.gcp.spanner.changestreams.dofn.ReadChangeStreamPartitionDoFn} SDF
* @param bundleFinalizer the bundle finalizer for {@link
* org.apache.beam.sdk.io.gcp.spanner.changestreams.dofn.ReadChangeStreamPartitionDoFn} SDF
* bundles
* @return a {@link ProcessContinuation#stop()} if a record timestamp could not be claimed or if
* the partition processing has finished
*/
@SuppressWarnings("nullness")
@VisibleForTesting
public ProcessContinuation run(PartitionMetadata partition, RestrictionTracker<OffsetRange, Long> tracker, OutputReceiver<DataChangeRecord> receiver, ManualWatermarkEstimator<Instant> watermarkEstimator, BundleFinalizer bundleFinalizer) {
final String token = partition.getPartitionToken();
final Timestamp endTimestamp = partition.getEndTimestamp();
/*
* FIXME(b/202802422): Workaround until the backend is fixed.
* The change stream API returns invalid argument if we try to use a child partition start
* timestamp for a previously returned query. If we split at that exact time, we won't be able
* to obtain the child partition on the residual restriction, since it will start at the child
* partition start time.
* To circumvent this, we always start querying one microsecond before the restriction start
* time, and ignore any records that are before the restriction start time. This way the child
* partition should be returned within the query.
*/
final Timestamp restrictionStartTimestamp = Timestamp.ofTimeMicroseconds(tracker.currentRestriction().getFrom());
final Timestamp previousStartTimestamp = Timestamp.ofTimeMicroseconds(TimestampConverter.timestampToMicros(restrictionStartTimestamp) - 1);
final boolean isFirstRun = restrictionStartTimestamp.compareTo(partition.getStartTimestamp()) == 0;
final Timestamp startTimestamp = isFirstRun ? restrictionStartTimestamp : previousStartTimestamp;
try (Scope scope = TRACER.spanBuilder("QueryChangeStreamAction").setRecordEvents(true).startScopedSpan()) {
TRACER.getCurrentSpan().putAttribute(PARTITION_ID_ATTRIBUTE_LABEL, AttributeValue.stringAttributeValue(token));
// TODO: Potentially we can avoid this fetch, by enriching the runningAt timestamp when the
// ReadChangeStreamPartitionDoFn#processElement is called
final PartitionMetadata updatedPartition = Optional.ofNullable(partitionMetadataDao.getPartition(token)).map(partitionMetadataMapper::from).orElseThrow(() -> new IllegalStateException("Partition " + token + " not found in metadata table"));
try (ChangeStreamResultSet resultSet = changeStreamDao.changeStreamQuery(token, startTimestamp, endTimestamp, partition.getHeartbeatMillis())) {
while (resultSet.next()) {
final List<ChangeStreamRecord> records = changeStreamRecordMapper.toChangeStreamRecords(updatedPartition, resultSet.getCurrentRowAsStruct(), resultSet.getMetadata());
Optional<ProcessContinuation> maybeContinuation;
for (final ChangeStreamRecord record : records) {
if (record.getRecordTimestamp().compareTo(restrictionStartTimestamp) < 0) {
continue;
}
if (record instanceof DataChangeRecord) {
maybeContinuation = dataChangeRecordAction.run(updatedPartition, (DataChangeRecord) record, tracker, receiver, watermarkEstimator);
} else if (record instanceof HeartbeatRecord) {
maybeContinuation = heartbeatRecordAction.run(updatedPartition, (HeartbeatRecord) record, tracker, watermarkEstimator);
} else if (record instanceof ChildPartitionsRecord) {
maybeContinuation = childPartitionsRecordAction.run(updatedPartition, (ChildPartitionsRecord) record, tracker, watermarkEstimator);
} else {
LOG.error("[" + token + "] Unknown record type " + record.getClass());
throw new IllegalArgumentException("Unknown record type " + record.getClass());
}
if (maybeContinuation.isPresent()) {
LOG.debug("[" + token + "] Continuation present, returning " + maybeContinuation);
bundleFinalizer.afterBundleCommit(Instant.now().plus(BUNDLE_FINALIZER_TIMEOUT), updateWatermarkCallback(token, watermarkEstimator));
return maybeContinuation.get();
}
}
}
bundleFinalizer.afterBundleCommit(Instant.now().plus(BUNDLE_FINALIZER_TIMEOUT), updateWatermarkCallback(token, watermarkEstimator));
} catch (SpannerException e) {
if (isTimestampOutOfRange(e)) {
LOG.debug("[" + token + "] query change stream is out of range for " + startTimestamp + " to " + endTimestamp + ", finishing stream");
} else {
throw e;
}
}
}
final long endMicros = TimestampConverter.timestampToMicros(endTimestamp);
LOG.debug("[" + token + "] change stream completed successfully");
if (tracker.tryClaim(endMicros)) {
LOG.debug("[" + token + "] Finishing partition");
partitionMetadataDao.updateToFinished(token);
LOG.info("[" + token + "] Partition finished");
}
return ProcessContinuation.stop();
}
use of io.opencensus.common.Scope in project beam by apache.
the class ChildPartitionsRecordAction method run.
/**
* This is the main processing function for a {@link ChildPartitionsRecord}. It returns an {@link
* Optional} of {@link ProcessContinuation} to indicate if the calling function should stop or
* not. If the {@link Optional} returned is empty, it means that the calling function can continue
* with the processing. If an {@link Optional} of {@link ProcessContinuation#stop()} is returned,
* it means that this function was unable to claim the timestamp of the {@link
* ChildPartitionsRecord}, so the caller should stop.
*
* <p>When processing the {@link ChildPartitionsRecord} the following procedure is applied:
*
* <ol>
* <li>We try to claim the child partition record timestamp. If it is not possible, we stop here
* and return.
* <li>We update the watermark to the child partition record timestamp.
* <li>For each child partition, we try to insert them in the metadata tables if they do not
* exist.
* <li>For each child partition, we check if they originate from a split or a merge and
* increment the corresponding metric.
* </ol>
*
* Dealing with partition splits and merge cases is detailed below:
*
* <ul>
* <li>Partition Splits: child partition tokens should not exist in the partition metadata
* table, so new rows are just added to such table. In case of a bundle retry, we silently
* ignore duplicate entries.
* <li>Partition Merges: the first parent partition that receives the child token should succeed
* in inserting it. The remaining parents will silently ignore and skip the insertion.
* </ul>
*
* @param partition the current partition being processed
* @param record the change stream child partition record received
* @param tracker the restriction tracker of the {@link
* org.apache.beam.sdk.io.gcp.spanner.changestreams.dofn.ReadChangeStreamPartitionDoFn} SDF
* @param watermarkEstimator the watermark estimator of the {@link
* org.apache.beam.sdk.io.gcp.spanner.changestreams.dofn.ReadChangeStreamPartitionDoFn} SDF
* @return {@link Optional#empty()} if the caller can continue processing more records. A non
* empty {@link Optional} with {@link ProcessContinuation#stop()} if this function was unable
* to claim the {@link ChildPartitionsRecord} timestamp
*/
@VisibleForTesting
public Optional<ProcessContinuation> run(PartitionMetadata partition, ChildPartitionsRecord record, RestrictionTracker<OffsetRange, Long> tracker, ManualWatermarkEstimator<Instant> watermarkEstimator) {
final String token = partition.getPartitionToken();
try (Scope scope = TRACER.spanBuilder("ChildPartitionsRecordAction").setRecordEvents(true).startScopedSpan()) {
TRACER.getCurrentSpan().putAttribute(PARTITION_ID_ATTRIBUTE_LABEL, AttributeValue.stringAttributeValue(token));
LOG.debug("[" + token + "] Processing child partition record " + record);
final Timestamp startTimestamp = record.getStartTimestamp();
final Instant startInstant = new Instant(startTimestamp.toSqlTimestamp().getTime());
final long startMicros = TimestampConverter.timestampToMicros(startTimestamp);
if (!tracker.tryClaim(startMicros)) {
LOG.debug("[" + token + "] Could not claim queryChangeStream(" + startTimestamp + "), stopping");
return Optional.of(ProcessContinuation.stop());
}
watermarkEstimator.setWatermark(startInstant);
for (ChildPartition childPartition : record.getChildPartitions()) {
processChildPartition(partition, record, childPartition);
}
LOG.debug("[" + token + "] Child partitions action completed successfully");
return Optional.empty();
}
}
use of io.opencensus.common.Scope in project beam by apache.
the class ChildPartitionsRecordAction method processChildPartition.
// Unboxing of runInTransaction result will not produce a null value, we can ignore it
@SuppressWarnings("nullness")
private void processChildPartition(PartitionMetadata partition, ChildPartitionsRecord record, ChildPartition childPartition) {
try (Scope scope = TRACER.spanBuilder("ChildPartitionsRecordAction.processChildPartition").setRecordEvents(true).startScopedSpan()) {
TRACER.getCurrentSpan().putAttribute(PARTITION_ID_ATTRIBUTE_LABEL, AttributeValue.stringAttributeValue(partition.getPartitionToken()));
final String partitionToken = partition.getPartitionToken();
final String childPartitionToken = childPartition.getToken();
final boolean isSplit = isSplit(childPartition);
LOG.debug("[" + partitionToken + "] Processing child partition" + (isSplit ? " split" : " merge") + " event");
final PartitionMetadata row = toPartitionMetadata(record.getStartTimestamp(), partition.getEndTimestamp(), partition.getHeartbeatMillis(), childPartition);
LOG.debug("[" + partitionToken + "] Inserting child partition token " + childPartitionToken);
final Boolean insertedRow = partitionMetadataDao.runInTransaction(transaction -> {
if (transaction.getPartition(childPartitionToken) == null) {
transaction.insert(row);
return true;
} else {
return false;
}
}).getResult();
if (insertedRow && isSplit) {
metrics.incPartitionRecordSplitCount();
} else if (insertedRow) {
metrics.incPartitionRecordMergeCount();
} else {
LOG.debug("[" + partitionToken + "] Child token " + childPartitionToken + " already exists, skipping...");
}
}
}
use of io.opencensus.common.Scope in project beam by apache.
the class ReadChangeStreamPartitionDoFn method processElement.
/**
* Performs a change stream query for a given partition. A different action will be taken
* depending on the type of record received from the query. This component will also reflect the
* partition state in the partition metadata tables.
*
* <p>The processing of a partition is delegated to the {@link QueryChangeStreamAction}.
*
* @param partition the partition to be queried
* @param tracker an instance of {@link ReadChangeStreamPartitionRangeTracker}
* @param receiver a {@link DataChangeRecord} {@link OutputReceiver}
* @param watermarkEstimator a {@link ManualWatermarkEstimator} of {@link Instant}
* @param bundleFinalizer the bundle finalizer
* @return a {@link ProcessContinuation#stop()} if a record timestamp could not be claimed or if
* the partition processing has finished
*/
@ProcessElement
public ProcessContinuation processElement(@Element PartitionMetadata partition, RestrictionTracker<OffsetRange, Long> tracker, OutputReceiver<DataChangeRecord> receiver, ManualWatermarkEstimator<Instant> watermarkEstimator, BundleFinalizer bundleFinalizer) {
final String token = partition.getPartitionToken();
try (Scope scope = TRACER.spanBuilder("ReadChangeStreamPartitionDoFn.processElement").setRecordEvents(true).startScopedSpan()) {
TRACER.getCurrentSpan().putAttribute(PARTITION_ID_ATTRIBUTE_LABEL, AttributeValue.stringAttributeValue(token));
LOG.debug("[" + token + "] Processing element with restriction " + tracker.currentRestriction());
return queryChangeStreamAction.run(partition, tracker, receiver, watermarkEstimator, bundleFinalizer);
}
}
Aggregations