Questions tagged [google-cloud-dlp]

Questions related to the Google Cloud Data Loss Prevention API. Classification and de-identification of sensitive data like PII. Works on text and images.

With over 120 built-in infoTypes, Cloud Data Loss Prevention API gives you the power to scan, discover, classify, and report on data from virtually anywhere. Cloud DLP has native support for scanning and classifying sensitive data in Cloud Storage, BigQuery, and Datastore and a streaming content API to enable support for additional data sources, custom workloads, and applications.

89 questions
3
votes
1 answer

Can we save wrapped keys generated with cloud KMS keys in DLP deidentification templates(using Python Api)?

I am working on a PII de-identification project and using google cloud's data loss prevention api. Use case: To encrypt a field with cloud KMS key. Created a dlp-deidentification template, here is the snippet: { "deidentify_template":{ …
3
votes
2 answers

How to scan BigQuery table with DLP looking for sensitive data?

I would like to analyze my tables in BigQuery using DLP. It is possible ? How to do that ?
hkanjih
  • 817
  • 1
  • 8
  • 21
3
votes
1 answer

Google Cloud Data Loss Prevention (DLP) scanning .parquet files in GCS

I am new to Google Cloud DLP and I ran a POST https://dlp.googleapis.com/v2beta1/inspect/operations to scan a .parquet file within a Google Cloud Storage directory and also using cloudStorageOptions to save the .csv output. The .parquet file is…
2
votes
1 answer

Can Google DLP used without processing Data in other regions Outside AUstralia

We are trying to create a DLP platform for our company and need to know if we can call DLP API and get the results without actually saving the Document outside of Australia.
Tony
  • 21
  • 1
2
votes
2 answers

How to use custom infotypes in data leak prevention's(Google cloud platform) De-identify Template?

I am working on developing PII de identification application using data leak prevention(GCP). I am using de identification template for the de-identification rules. Issue: I am not able to figure out about using custom info types in the…
Arnab Mukherjee
  • 148
  • 1
  • 13
2
votes
0 answers

Content Compliance for Google Groups?

My company is migrating to GSuite and we have some Content Compliance rules that we need to enforce. We previously enforced these with an email security appliance using Exchange as the platform. The DLP rules for normal Gmail work just fine, but we…
2
votes
3 answers

Dynamic data masking BigQuery

Is there any feature in the BigQuery roadmap to support dynamic data masking? For example, displaying masked data based on the user's roles. I have explored DLP which helps in storing masked data in BigQuery, but with that approach, one will have to…
2
votes
1 answer

How to mask a specified field in a table when I query it using big query?

I want to mask a specified field in a table when I query it using big query. For example: a table named employee has email-id column, so when I query the table all the email-id field should be masked with ****. How can I do it using DLP in google?
2
votes
1 answer

Is it possible to inspect all tables in a BigQuery dataset with one dlpJob?

I'm using Google Cloud DLP to inspect sensitive data in BigQuery. I wonder is it possible to inspect all tables within a dataset with one dlpJob? If so, how should I set the configs? I tried to omit the BQ tableId field in config. But it will return…
2
votes
1 answer

Google cloud DLP API Image redaction in Python error with info_types

Hello I'm trying google's sample code for redacting information from an image in python, I had success with inspecting the info_types in a string, but when I try with the sample code for image inspection in Anaconda distribution it gives me the…
2
votes
1 answer

Processing a Table using Google Cloud DLP API is too slow

Recently, I have been trying to use the Google DLP API in Python 3 to classify the content of tables. I first started by testing the API on small examples, which all worked perfectly. However, as I attempted to send larger tables (1000 rows x 18…
2
votes
1 answer

Google DLP: "ValueError: Protocol message Value has no "stringValue" field."

I have a method where I build a table for multiple items for Google's DLP inspect API which can take either a ContentItem, or a table of values Here is how the request is constructed: def redact_text(text_list): dlp =…
CBredlow
  • 2,603
  • 2
  • 26
  • 46
2
votes
1 answer

Authenticating Google DLP API from Java without setting GOOGLE_APPLICATION_CREDENTIALS?

I am playing around with the Google Cloud DLP Java library. I've set up my service credentials and saved them in a JSON file as per the instructions here: https://cloud.google.com/dlp/docs/libraries. The documentation states that the preferred way…
user2337270
  • 1,073
  • 2
  • 10
  • 26
1
vote
1 answer

DLP data scan from bigquery table showing row_index as null

I have scanned a Bigquery table from Google DLP Console. The scan results are saved back into a big query table. DLP has identified sensitive information, but the row_index is shown as null…
1
vote
1 answer

Image Type PNG not allowed for de-identification with GCP DLP

I got this error: google.api_core.exceptions.InvalidArgument: 400 BytesType.IMAGE_PNG is not supported for use with de-identification. I have this code, and it should work with images, but then I get this error stating it isn't supported. Did I do…
James Black
  • 40,548
  • 9
  • 79
  • 153
1
2 3 4 5 6