Search in sources :

Example 1 with DataCatalogClient

use of com.google.cloud.datacatalog.v1beta1.DataCatalogClient in project DataflowTemplates by GoogleCloudPlatform.

the class DataCatalogSchemaUtils method getSchemasForEntryGroup.

/**
 * Retrieve all of the {@link Schema}s associated to {@link Entry}s in an {@link EntryGroup}.
 */
public static Map<String, Schema> getSchemasForEntryGroup(String gcpProject, String entryGroupId) {
    DataCatalogClient client = null;
    try {
        client = DataCatalogClient.create();
    } catch (IOException e) {
        throw new RuntimeException("Unable to create a DataCatalogClient", e);
    }
    if (client == null) {
        return null;
    }
    String formattedParent = DataCatalogClient.formatEntryGroupName(gcpProject, DEFAULT_LOCATION, entryGroupId);
    List<Entry> entries = new ArrayList<>();
    ListEntriesRequest request = ListEntriesRequest.newBuilder().setParent(formattedParent).build();
    while (true) {
        ListEntriesResponse response = client.listEntriesCallable().call(request);
        entries.addAll(response.getEntriesList());
        String nextPageToken = response.getNextPageToken();
        if (!Strings.isNullOrEmpty(nextPageToken)) {
            request = request.toBuilder().setPageToken(nextPageToken).build();
        } else {
            break;
        }
    }
    LOG.debug("Fetched entries: {}", entries);
    return entries.stream().collect(Collectors.toMap(Entry::getDescription, e -> SchemaUtils.toBeamSchema(e.getSchema())));
}
Also used : ListEntriesResponse(com.google.cloud.datacatalog.v1beta1.ListEntriesResponse) Logger(org.slf4j.Logger) DataCatalogClient(com.google.cloud.datacatalog.v1beta1.DataCatalogClient) CreateEntryRequest(com.google.cloud.datacatalog.v1beta1.CreateEntryRequest) LocationName(com.google.cloud.datacatalog.v1beta1.LocationName) LoggerFactory(org.slf4j.LoggerFactory) IOException(java.io.IOException) UpdateEntryRequest(com.google.cloud.datacatalog.v1beta1.UpdateEntryRequest) EntryGroupName(com.google.cloud.datacatalog.v1.EntryGroupName) Collectors(java.util.stream.Collectors) Schema(org.apache.beam.sdk.schemas.Schema) AlreadyExistsException(com.google.api.gax.rpc.AlreadyExistsException) ApiException(com.google.api.gax.rpc.ApiException) LookupEntryRequest(com.google.cloud.datacatalog.v1beta1.LookupEntryRequest) ArrayList(java.util.ArrayList) EntryGroup(com.google.cloud.datacatalog.v1beta1.EntryGroup) ListEntriesResponse(com.google.cloud.datacatalog.v1beta1.ListEntriesResponse) Strings(com.google.common.base.Strings) List(java.util.List) Entry(com.google.cloud.datacatalog.v1beta1.Entry) Map(java.util.Map) ListEntriesRequest(com.google.cloud.datacatalog.v1beta1.ListEntriesRequest) CreateEntryGroupRequest(com.google.cloud.datacatalog.v1beta1.CreateEntryGroupRequest) Entry(com.google.cloud.datacatalog.v1beta1.Entry) ArrayList(java.util.ArrayList) DataCatalogClient(com.google.cloud.datacatalog.v1beta1.DataCatalogClient) IOException(java.io.IOException) ListEntriesRequest(com.google.cloud.datacatalog.v1beta1.ListEntriesRequest)

Example 2 with DataCatalogClient

use of com.google.cloud.datacatalog.v1beta1.DataCatalogClient in project DataflowTemplates by GoogleCloudPlatform.

the class DataCatalogSchemaUtils method getSchemaFromPubSubTopic.

/**
 * Retrieve the {@link Schema} associated to a Pub/Sub topics in an.
 *
 * <p>This method is to be used in multi-topic mode, where a single {@link Schema} is associated
 * to a single Pub/Sub topic.
 */
public static Schema getSchemaFromPubSubTopic(String gcpProject, String pubsubTopic) {
    DataCatalogClient client = null;
    try {
        client = DataCatalogClient.create();
    } catch (IOException e) {
        throw new RuntimeException("Unable to create a DataCatalogClient", e);
    }
    if (client == null) {
        return null;
    }
    Entry entry = lookupPubSubEntry(client, pubsubTopic, gcpProject);
    if (entry == null) {
        // TODO(pabloem) Handle a failed entry lookup
        return null;
    }
    return SchemaUtils.toBeamSchema(entry.getSchema());
}
Also used : Entry(com.google.cloud.datacatalog.v1beta1.Entry) DataCatalogClient(com.google.cloud.datacatalog.v1beta1.DataCatalogClient) IOException(java.io.IOException)

Example 3 with DataCatalogClient

use of com.google.cloud.datacatalog.v1beta1.DataCatalogClient in project java-docs-samples by GoogleCloudPlatform.

the class CreateFilesetEntry method createEntry.

// Create Fileset Entry.
public static void createEntry(String projectId, String entryGroupId, String entryId) {
    // Currently, Data Catalog stores metadata in the us-central1 region.
    String location = "us-central1";
    // the "close" method on the client to safely clean up any remaining background resources.
    try (DataCatalogClient dataCatalogClient = DataCatalogClient.create()) {
        // that will be used in step 3.
        try {
            dataCatalogClient.deleteEntry(EntryName.of(projectId, location, entryGroupId, entryId).toString());
        } catch (PermissionDeniedException | NotFoundException e) {
            // PermissionDeniedException or NotFoundException are thrown if
            // Entry does not exist.
            System.out.println("Entry does not exist.");
        }
        // that will be used in step 2.
        try {
            dataCatalogClient.deleteEntryGroup(EntryGroupName.of(projectId, location, entryGroupId).toString());
        } catch (PermissionDeniedException | NotFoundException e) {
            // PermissionDeniedException or NotFoundException are thrown if
            // Entry Group does not exist.
            System.out.println("Entry Group does not exist.");
        }
        // 2. Create an Entry Group.
        // Construct the EntryGroup for the EntryGroup request.
        EntryGroup entryGroup = EntryGroup.newBuilder().setDisplayName("My Fileset Entry Group").setDescription("This Entry Group consists of ....").build();
        // Construct the EntryGroup request to be sent by the client.
        CreateEntryGroupRequest entryGroupRequest = CreateEntryGroupRequest.newBuilder().setParent(LocationName.of(projectId, location).toString()).setEntryGroupId(entryGroupId).setEntryGroup(entryGroup).build();
        // Use the client to send the API request.
        EntryGroup entryGroupResponse = dataCatalogClient.createEntryGroup(entryGroupRequest);
        System.out.printf("\nEntry Group created with name: %s\n", entryGroupResponse.getName());
        // 3. Create a Fileset Entry.
        // Construct the Entry for the Entry request.
        Entry entry = Entry.newBuilder().setDisplayName("My Fileset").setDescription("This fileset consists of ....").setGcsFilesetSpec(GcsFilesetSpec.newBuilder().addFilePatterns("gs://cloud-samples-data/*").build()).setSchema(Schema.newBuilder().addColumns(ColumnSchema.newBuilder().setColumn("first_name").setDescription("First name").setMode("REQUIRED").setType("STRING").build()).addColumns(ColumnSchema.newBuilder().setColumn("last_name").setDescription("Last name").setMode("REQUIRED").setType("STRING").build()).addColumns(ColumnSchema.newBuilder().setColumn("addresses").setDescription("Addresses").setMode("REPEATED").setType("RECORD").addSubcolumns(ColumnSchema.newBuilder().setColumn("city").setDescription("City").setMode("NULLABLE").setType("STRING").build()).addSubcolumns(ColumnSchema.newBuilder().setColumn("state").setDescription("State").setMode("NULLABLE").setType("STRING").build()).build()).build()).setType(EntryType.FILESET).build();
        // Construct the Entry request to be sent by the client.
        CreateEntryRequest entryRequest = CreateEntryRequest.newBuilder().setParent(entryGroupResponse.getName()).setEntryId(entryId).setEntry(entry).build();
        // Use the client to send the API request.
        Entry entryResponse = dataCatalogClient.createEntry(entryRequest);
        System.out.printf("\nEntry created with name: %s\n", entryResponse.getName());
    } catch (AlreadyExistsException | IOException e) {
        // AlreadyExistsException's are thrown if EntryGroup or Entry already exists.
        // IOException's are thrown when unable to create the DataCatalogClient,
        // for example an invalid Service Account path.
        System.out.println("Error in create entry process:\n" + e.toString());
    }
}
Also used : Entry(com.google.cloud.datacatalog.v1.Entry) AlreadyExistsException(com.google.api.gax.rpc.AlreadyExistsException) CreateEntryGroupRequest(com.google.cloud.datacatalog.v1.CreateEntryGroupRequest) EntryGroup(com.google.cloud.datacatalog.v1.EntryGroup) DataCatalogClient(com.google.cloud.datacatalog.v1.DataCatalogClient) NotFoundException(com.google.api.gax.rpc.NotFoundException) CreateEntryRequest(com.google.cloud.datacatalog.v1.CreateEntryRequest) PermissionDeniedException(com.google.api.gax.rpc.PermissionDeniedException) IOException(java.io.IOException)

Example 4 with DataCatalogClient

use of com.google.cloud.datacatalog.v1beta1.DataCatalogClient in project DataflowTemplates by GoogleCloudPlatform.

the class DataCatalogSchemaUtils method lookupPubSubEntry.

static Entry lookupPubSubEntry(DataCatalogClient client, String pubsubTopic, String gcpProject) {
    String linkedResource = String.format(DATA_CATALOG_PUBSUB_URI_TEMPLATE, gcpProject, pubsubTopic);
    LOG.info("Looking up LinkedResource {}", linkedResource);
    LookupEntryRequest request = LookupEntryRequest.newBuilder().setLinkedResource(linkedResource).build();
    try {
        Entry entry = client.lookupEntry(request);
        return entry;
    } catch (ApiException e) {
        System.out.println("CANT LOOKUP ENTRY" + e.toString());
        e.printStackTrace();
        LOG.error("ApiException thrown by Data Catalog API:", e);
        return null;
    }
}
Also used : LookupEntryRequest(com.google.cloud.datacatalog.v1beta1.LookupEntryRequest) Entry(com.google.cloud.datacatalog.v1beta1.Entry) ApiException(com.google.api.gax.rpc.ApiException)

Aggregations

Entry (com.google.cloud.datacatalog.v1beta1.Entry)3 IOException (java.io.IOException)3 AlreadyExistsException (com.google.api.gax.rpc.AlreadyExistsException)2 ApiException (com.google.api.gax.rpc.ApiException)2 DataCatalogClient (com.google.cloud.datacatalog.v1beta1.DataCatalogClient)2 LookupEntryRequest (com.google.cloud.datacatalog.v1beta1.LookupEntryRequest)2 NotFoundException (com.google.api.gax.rpc.NotFoundException)1 PermissionDeniedException (com.google.api.gax.rpc.PermissionDeniedException)1 CreateEntryGroupRequest (com.google.cloud.datacatalog.v1.CreateEntryGroupRequest)1 CreateEntryRequest (com.google.cloud.datacatalog.v1.CreateEntryRequest)1 DataCatalogClient (com.google.cloud.datacatalog.v1.DataCatalogClient)1 Entry (com.google.cloud.datacatalog.v1.Entry)1 EntryGroup (com.google.cloud.datacatalog.v1.EntryGroup)1 EntryGroupName (com.google.cloud.datacatalog.v1.EntryGroupName)1 CreateEntryGroupRequest (com.google.cloud.datacatalog.v1beta1.CreateEntryGroupRequest)1 CreateEntryRequest (com.google.cloud.datacatalog.v1beta1.CreateEntryRequest)1 EntryGroup (com.google.cloud.datacatalog.v1beta1.EntryGroup)1 ListEntriesRequest (com.google.cloud.datacatalog.v1beta1.ListEntriesRequest)1 ListEntriesResponse (com.google.cloud.datacatalog.v1beta1.ListEntriesResponse)1 LocationName (com.google.cloud.datacatalog.v1beta1.LocationName)1