We are working on PIA data, which is in encrypted format. We get a set of records for each individual. This record size could be 10 to 50 for each individual. We need to identify the best email and phone number out of these set of records. We are developing a sense data model for that and along with the PIA attributes for each individual, we also take help from different sources – like AT&T and T-Mobile – that are going to provide telecom services. We take data from these providers for validation purposes.
We don't consume their data, but we use their data for validation. We bounce our data into their records and they say, "This this is correct," or "This is not correct."
On GCP, we are exposing our API services to our clients so that they send us their information. It can be single individual records or it can be a batch of their clients.
The clients, in a sense, want to validate their data if there could be chances that it's stale in nature. They want to make sure that their records of individuals are up-to-date.
We are using the latest version on the public cloud and deploying it on GCP.
We are trying our best to improve our existing models and privacy and to keep on updating it, and also we are trying to use reinforcement learning and separate APIs so that if a user wants to update their data, they can do so.