I was working for a nonprofit here in Kansas called Children International. I was a global data engineering manager, and I was tasked specifically with developing a data governance program. And at the same time, that's right when Purview came out. So I spent a year in that realm. I've been very plugged into Purview ever since.
The main use is data governance. Now, not every firm is necessarily at that maturity level when it comes to understanding data as an asset. So, a lot of companies can use it just as a data dictionary, a business glossary, which is nice, but it doesn't give the entire picture of what Purview can actually do, and so, actually, there were a dozen of implementations of Purview that I have been a part of in the past. I've led as a consultant and an architect. I'd say maybe I have used twenty-five percent of every feature that it has to offer. I am still kind of catching up with the rest of the features.
Microsoft Purview breaks down barriers to sharing information, accessing data, understanding what a piece of data means, how it is defined, how it's interpreted, and then how the end user uses it. So, with Purview, when it's best used, and it is at its best when the end users are the ones that are very involved because they're the ones who are going to be using it. Its users can see the support, and it grows year over year, you know, a kind of revenue measure. I don't know how exactly this is created. Also, I don't know what fields it uses. I don't know exactly what this means. We have a different department in the company that may need this. So they feel the need to go out and then hop to Purview. So it definitely knocks down a lot of barriers. It eliminates a lot of needless communication between different departments, and it just centralizes all information about an organization's data assets.
Speaking of the valuable features, I would speak on behalf of my clients, considering how it seemed when it was implemented. So, the sensitivity labels are big. The insights are also very nice as it gives the visual representation of being able to look at what or how the data storage is performing, especially monitoring if they are tagging technical assets or if they are adding business glossary terms. So, that's big, along with the basic features of Microsoft Purview. I'd say the last one that is very big with most clients is data lineage and being able to integrate that end-to-end flow component kind of so they can see that this is data from a Power BI dataset, and then they can follow where that dataset is coming from like it's from a SQL query in a Power BI report itself and data coming from a SQL Server, etcetera, etcetera.
I think Purview, in terms of its stability, a lot of people think Purview itself is for data governance. Whereas data governance is something that is more like how we now view, let's say, Toyota six sigma or Agile, where it's kind of embedded in an organization and kind of culturally and throughout the work that's being done, it's kind of embedded in the network. I think Purview is kind of the same way. So, data governance operates in the same way. It first needs to be adopted and agreed upon by those that will be using it. Purview is the tool, and that's, like, the actual interface, and that's the actual, you know, that's the tool that allows it to be done. But in terms of stability, I think Purview does as good a job as it can of providing a foundation to make stability with data governance as easy as possible for those that manage it, but it does come down to those that are in charge of Purview itself to make sure that it stays stable. Well, I won't just say stable. I'll say that it is as stable as the data governance maturity that exists within an organization. It can't be more stable than that.
Scalability is fantastic considering their pricing since it addresses their pricing where it can have your incremental scans, which can cut down on your computing costs. That's great for scalability. So, if you have more data and you need to scan more and more, you don't need to scan the exact same data again and again and again. You may be okay only scanning the entire database maybe once a week, and then, let's say daily, you have incremental refreshers that help very much, or that help a lot with scalability. I think as more an organization scales and the more that it uses Purview with that scaling, the more you can get out of it, and the more it'll end up helping the organization, the more the stewards that are in their tagging things, the more that data is suggested, and that triggers are set up for ingestion runs. The more it scales, the better it is.
The deployment of the solution was done on Azure. A majority of our deployments for our clients are through a private cloud, and they're locked down with private DNS zones and custom DNS records on the back end if they're on their own servers and private endpoints as well. So fairly locked down, especially with where we talked about a lot of the data being sensitive and a need for it to be protected and monitored. Well, if data needs to be protected and monitored, then it likely is going to be needing something to be secure. So it's all been on a private cloud so far.
I was involved in the initial deployment of Purview. The initial deployments were actually straightforward because we were figuring out about the features, and found that it was a little less feature-rich, so there was less of a question. And it hasn't given us the time to figure out where it works well or where it doesn't work well, and if we need to bring in consultants or can we do this in-house. As Microsoft Purview got a little more mature, the deployments got a little more complex, as we were kind of seeing that there are a lot of ways, like there were a hundred ways to do one thing or less. I think that applies to anything in software. But it applies to Purview as well and where we control access to a given data source while figuring out if we should use the policies and governance features within Purview. We also thought if we should do it in Azure portal using RBAC or should it be elsewhere. So I think we passed that part, and then now we're going to get back to some smooth deployments, and a lot of them will be templatized, you know, whether it be in ARM templates or Bicep like Terraform. Overall, at first, the setup process was fairly smooth to moderate.
I have not been able to conduct any actual ROI analysis on Purview just because it is hard to measure those kinds of intangibles of how those results show up. But they absolutely show up if we consider data to be an asset. Also then, it can also be a liability, just like something that would be like a can of soup on the shelf in a restaurant. That's an asset as it exists then, but also there can be hazards. So, if it's a large pallet, it could fall and cause damage. So, data cost would be a liability. I think that it is not something that currently appears on the balance sheet and can be cut to draw that line, but I think it'll be made much clearer in the near future, especially as we've become more and more data-saturated and we'll be more conscious of the value of data. Regarding ROI, I have not done any analysis in that realm, but I have full confidence that there are real ROIs that do exist and that will show up one way or another in the future.
Price-wise, I think it's very generous and accessible to not just enterprises but small to medium-sized companies as well. I think it's very fair in terms of how they break apart the storage and the actual computing, and that makes it very accessible. So, that is a very big plus for Purview. I think a lot of clients and organizations would feel that way as well.
On the time front, I can absolutely speak about the cost of the solution since time is money. Saving analysts or developers, or engineers time can add or translate into actual savings if you can do the right calculation. But just to stick on that time front, I've seen it cut down, let's say, a given team, or a data governance team, I'd say, in a reduction of fifty to sixty percent of the time that would be spent answering maybe questions for other parts of the business about what a certain data means or where this data lives, they can just point them into certain directions, and they can point them to Purview and have those answers or have or have this questions answered. So that's a big part. And then also being able to have those ties from the business to IT and being able to bridge that gap and being able to have a technical asset ingested from the data side or, from the technical side, and then being able to get the business to interact with that as well. I would say that total savings in terms of on a team-to-team basis, I could see anywhere from fifty to seventy-five percent reduction in time spent on those activities just by having Purview.
Speaking about the importance that Purview delivers data protection across multi-cloud and multi-platform environments, including AWS and Google Cloud Platform, I would say that it's very important, and I am going to lean on the side of this that has to do with data protection in terms of international data protection and various laws that kind of force data to stay in certain places. For example, I have one of my clients, which is a Fortune 100 company, and they have three different main regions around the world, including Asia Pacific, the EU, and the US. They had a lot of financial data that were in these three regions, but they couldn't cross each other even though it was the same data that was being used in the same applications, but it was difficult due to various barriers and regulations for the actual data control. Hence, they couldn't pass between each other. So that was kind of an issue. We had to actually go and deploy multiple instances of Purview. We had a dev and a prod instance in these three regions. That was definitely very important, and that was because of not only regulations but because of their desire to keep their data private, secure, and kind of locked down.
Speaking of how important it is for me that Purview can connect to iOS, Mac, and Android devices and data in other SaaS apps, it is not very important as it's typically with legacy databases and data warehouses.
Regarding Purview's data connector platform for supporting ingestion from non-Microsoft data sources, I think it's very robust, and out of the gate, they included a data connector that you can plug in straight to Amazon and SAP. I believe now that they have some connectors for Salesforce and things like that and Snowflake. So they are absolutely staying right on or ahead of the curve when it comes to making this kind of making sure that the interoperability between other vendors as well is taken into account, which is awesome.
Speaking of how important is Purview's natively integrated compliance across Azure Dynamics and Office, I think it's very important, and it does it very well. I mean, literally without a problem. It's completely seamless, and so it does that perfectly.
Speaking of how important it is to me that Purview was built taking into account critical regulations from around the world, I think that it is very important. So, that's what kind of gave birth to the original classifications for 200 or so out-of-the-box classifications for a lot of data for possible credit card numbers, passport data, and ZIP codes in different countries. They were definitely aware of that, and you could tell from their design process they keep updating those. And so that was definitely built with that in mind for sure.
I have never used Purview for data loss protection. Typically, in these solutions, when we're deploying Purview, they're usually at a stage in their data platform maturity where they already kind of have that taken care of elsewhere.
Speaking of whether Purview helped reduce the number of solutions I need to interact with, I would say that I don't think so because I think data governance exists kind of outside the solution's architecture for these kinds of things. So, I'll say no, but that's not in a bad way. It's just not its job, in my opinion.
In terms of Purview's visibility, I think visibility is the most significant part since that's kind of what it does as long as it's set up correctly and maintained by the data stewards and the business experts. The domain experts that actually know how to interpret these business glossary terms and can link them to data assets, then it is the most useful tool out there.
I haven't used Microsoft Purview's automation. However, I have used Purview's AI since we have had a few situations. While not many organizations have had to do this, we had two instances of Purview in a given tenant, which would be dev and prod. So, we would use some APIs and some Purview CLI. We automated a lot of those processes. Also, it's workflow automation in terms of approvals and sign-offs from a given definition being changed or a term being added and being able to send that to the right people, especially by sending an email to the right group for approval. So the automation piece is huge. I have not yet been a part of, or I have not, used any of the AI-related features.
Speaking of how automation affected my speed and accuracy of risk detection, I would say it went very well, especially when dealing with financial data or when dealing with possible credit card data, personal information, and health data. In PII or PHI, being able to have those classifications kind of flag that data if it comes through, and then being able to mask it or identify those is something huge. So, that's probably one of my favorite features considering how it's integrated into the various protection services that Microsoft and Azure have to offer.
Speaking of how automation affected the quality of the insights I have, I would say it has more to do with speed on that front. Also, it's going to be as good as the developers, data stewards, and experts working on it. As you know, it'll be as good as the effort that they put in to constantly go through and curate or update a certain classification, including what the threshold may be to make sure it's catching all of where it needs to cache for risk detection or risk mitigation. And there's definitely a one-to-one relationship that is a very linear relationship. So the more end users and stewards are involved, the more you get out of it in terms of every feature that it has to offer.
Regarding Purview enabling me to show my compliance in real-time, I would say that I believe that there are some integrations that I have not personally, which I think came out in Purview the last time when I was very heavily involved with Purview, which is maybe four or five months back. So, I have not used those yet. But in terms of real-time, I don't think we have used it yet. But I'm sure we'll do that eventually.
Regarding whether Purview helped to reduce the time to action needed for insider threats, I would say it has, especially in the context of early detection for possible sensitive data, like health data, credit card information, and things like that, and being able to alert the right parties. So, they can make decisions on whether they need a mask or not or if there was a problem with the actual data source, whether the data kind of slipped through, and if it should have been taken out. So, definitely, it works in that kind of early detection there and in terms of the speed of making those decisions.
I would say that Purview has helped save time and money for my clients. So, it's definitely a long road for data governance, and a lot of the benefits aren't necessarily very tangible, so they might not show up on the balance sheet. However, data is an asset and needs to be treated as such. Hence, those who implement these data governance programs and who use Purview to do that are absolutely gonna be ahead of the curve, and we'll save countless dollars and time. I mean, the savings start immediately. So, down the road is when you really see those effects come up.
I would agree that Purview has affected my ability to stay on top of compliance.
Regarding others who are looking into evaluating the solution, I think they should prepare by making sure that the individuals or the teams that internally exist that are in charge of or that have a very large say, and how data governance is or functions within that organization, they need to be on board and they need to be willing to kind of to go through the motions and be okay with having an iterative process once Purview is deployed. It's not all going to be solved on day one, and they're not going to have everything that they need right away. Again, it's a tool that their data storage and domain experts can use, I would say.
As I am trying to be as unbiased as possible, I would suggest those planning to use the solution contact Microsoft since there are a lot of financial incentives and reimbursements, and discounts that exist right now, also in the past, and in the future, I would assume that for Purview's demos and PoC engagements. So, trying to find or just having the first part, which is the internal understanding of what data governance is and what they want out of it, while the second part is Microsoft does very well by tucking in partners that can show them ropes.
Since there were some features recently that maybe I haven't been able to test, I rate the overall solution a nine out of ten.