Automated Masking Sensitive Data from Google BigQuery to Tableau with IBM WKC — Part 1 of 3

Son Le Thanh (Son)
4 min readMay 10, 2021

TLDR: If you are working in an enterprise context and protecting sensitive data from unauthorized users is a concern, I will show you a demo of datasets hosted on Google Big Query governed by IBM Watson Knowledge Catalog and the data masking still preserved when exposed to Tableau Desktop.

Overall Solution Architecture Diagram

Assuming you are the data architect working at a bank, one of the datasets under your manage is customer information who wish to take a mortgage loan. There is a Business Analyst who just joined the bank last week and she is tasked to analyse the historical dataset of mortgage applications that were default to present a report to the head of mortgage loan department. You wish to share the dataset with her but she is not authorized to view customers’ sensitive information such as social security number, phone number, credit card number, email etc.

The dataset is hosted on Google BigQuery and comprises of 10 tables which comprises of customer profile, mortgage applicants, property and record of mortgage applications that have been default etc.

The mortgage application dataset hosted on Google BigQuery — customer sensitive information not masked yet

--

--

Son Le Thanh (Son)

I am a geek, my background was in software development but I enjoyed building community and creating useful content. I am the father of two boys.