Dynamic Data Masking in Azure SQL Datawarehouse

I’m leading a project which is using Azure SQL Datawarehouse, and I’m pretty excited to be involved.  I love watching the data take shape, and, for the customer requirements, Azure SQL Datawarehouse is perfect.

secret-3037639_640 Note that my customer details are confidential and that’s why I never give details away such as the customer name and so on. I gain – and retain – my customers based on trust, and, by giving me their data, they are entrusting me with detailed information about their business.

One question they raised was in respect to dynamic data masking, which is present in Azure SQL Database. How does it manifest itself in Azure SQL Datawarehouse? What are the options regarding the management of personally identifiable information?

sasint

As we move ever closer to the implementation of GDPR, more and more people will be asking these questions. With that in mind, I did some research and found there are a number of options, which are listed here. Thank you to the Microsoft people who helped me to come up with some options.

1. Create an Azure SQL Database spoke as part of a hub and spoke architecture.

The Azure SQL Database spoke can create external tables over Azure SQL Datawarehouse tables for moving data into Azure SQL Database to move data into the spoke. One note of warning: It isn’t possible to use DDM over an external table, so the data would have to move into Azure SQL Database.
2. Embed masking logic in views and restrict access.

This is achievable but it is a manual process.
3. Mask the data through the ETL processes creating a second, masked, column.

This depends on the need to query the data. Here, you may need to limit access through stored procs.
On balance, the simplest method overall is to use views to restrict access to certain columns. That said, I an holding a workshop with the customer in the near future in order to see their preferred options. However, I thought that this might help someone else, in the meantime. I hope that you find something that will help you to manage your particular scenario.

Share the Post:

Related Posts

Decision Velocity: Why the Dashboard Era is Evolving into Action

This week’s discussion focuses on the changing role of dashboards in the context of AI and decision velocity. While traditional dashboards often promote passive data observation, organizations must shift towards action-oriented decision-making. High decision velocity is essential for effective strategy, requiring real-time data and confidence in data utility for decision velocity leading to actions.

Read More

Data Resiliency: The Strategic Foundation for AI After Commvault SHIFT

The Commvault SHIFT London event highlighted that data resiliency is essential for effective AI and BI strategies. We need to consider what we do with the data that AI produces, not just the data sources. Without a strong data foundation, businesses risk poor decisions and operational failures. The event advocated for integrating security and data management, deploying tools like Commvault Cloud Unity, and continuously testing recovery plans to ensure organisational resilience.

Read More

Discover more from Jennifer Stirrup: AI Strategy, Data Consulting & BI Expert | Keynote Speaker

Subscribe now to keep reading and get access to the full archive.

Continue reading