Home » Security Bloggers Network » Part 3: Machine Learning Ways to De-Identify Personal Data (Homomorphic Encryption)

Part 3: Machine Learning Ways to De-Identify Personal Data (Homomorphic Encryption)

by Halyna Oliinyk on July 31, 2019

-Homomorphic Encryption: The main idea behind homomorphic encryption is that the inferences we make based on computations of encrypted data should be as accurate as if we had used decrypted data. Homomorphic encryption is an evolving field, and at this point in time as certain limitations. For example, only polynomial functions can be computed and only additions and multiplications of integers modulo-n are allowed. Most mathematical operations, which are used even in the simplest neural networks are not allowed when performing model training with homomorphically encrypted data. As you can understand, the final concepts of this methodology are still being developed.

The
main idea behind homomorphic encryption is that we don’t need to remove any
kind of values from the dataset, or mask/anonymize personal data in any way.
However, at this point in time, there is not enough practical evidence to state
that they can be used for the production-level methodologies; furthermore,
there are not so many functional homomorphic encryption pipelines.

Let’s
imagine a situation when we’ve removed all personal data from the dataset (or
anonymized and stored it separately from other values). Most likely, even after
removal of the personal data, QIs are still left in the database.

The biggest problem of storing quasi-identifiers is that when enduring an attack on the database, it isn’t all that difficult to combine QI values with some other open data sources and reveal the identity of the person together with their personal/sensitive information. A good example of that is when the Netflix Prize competition open data was combined with IMDB’s movie ratings dataset: the entire movie-watching history of individuals was compromised.

As a
result of datasets, insecure data science pipelines, which make predictions
using datasets and QIs, can also reveal potentially sensitive/personal
information even after the personal/sensitive data itself has been removed. We
need to make sure that no queries that have the potential to reveal individual
personal info can be leveraged. Furthermore, we must make sure that no
inference on the data subject can be made by running multiple predictions using
machine learning algorithms.

This is the second post in our Deidentifying and Securing Personal Data Series. To read part one, click here. For part two, click here. For part four, click here.

The post Part 3: Machine Learning Ways to De-Identify Personal Data (Homomorphic Encryption) appeared first on 1touch.io.

Part 3: Machine Learning Ways to De-Identify Personal Data (Homomorphic Encryption)

Senator Sanders Wants to Own AI Companies — and Hand America’s Adversaries the Keys

NIST’s Nine: The PQC Signature Race Moves to Round Three

The Quantum Arms Race: Why Washington Just Wrote a $2 Billion Check to Nine Companies

Beyond Moore’s Law: The Hyper-Acceleration of Autonomous AI Cyber Capabilities

The Exception Economy: When Security Teams Stop Protecting and Start Negotiating

GoPlus’s Latest Report Highlights How Blockchain Communities Are Leveraging Critical API Security Data To Mitigate Web3 Threats

C2A Security’s EVSec Risk Management and Automation Platform Gains Traction in Automotive Industry as Companies Seek to Efficiently Meet Regulatory Requirements

Zama Raises $73M in Series A Lead by Multicoin Capital and Protocol Labs to Commercialize Fully Homomorphic Encryption

RSM US Deploys Stellar Cyber Open XDR Platform to Secure Clients

ThreatHunter.ai Halts Hundreds of Attacks in the past 48 hours: Combating Ransomware and Nation-State Cyber Threats Head-On

Randall Munroe’s XKCD ‘Horizontal Stabilizers’