Generally in PPDP (Privacy-Preserving Data Publishing), the table that is to be published is in the following form:
T (Explicit_identifier, Quasi_identifier,
Sensitive_attributes, Non-sensitive_attributes)
Where Explicit_identifier is a list of attributes that directly identify person such
as name, person code, address, social security number and so on. Quasi_identifier
(QID) is a list of attributes whose combination could possibly identify person
such as sex, age, job, zip-code and so on. Combination of quasi identifiers can
either specify the unique record from second table or small set of records.
Sensitive_attributes are
sensitive attributes that are specific to person such as diseases
or salary. Non-sensitive_attributes are all other attributes that do not fit into previous
categories.
Having too many attributes in QID will cause too much distortion in the data while choosing to few attributes in QID will offer insufficient protection for privacy.
Explicit_identifier
(ID), Quasi_identifier (QID),
Sensitive_attributes (Sensitive) or Non-sensitive_attributes (Nonsensitive).
next to proceed.