Choose attribute types

Introduction

Generally in PPDP (Privacy-Preserving Data Publishing), the table that is to be published is in the following form:

T (Explicit_identifier, Quasi_identifier, Sensitive_attributes, Non-sensitive_attributes)

Where Explicit_identifier is a list of attributes that directly identify person such as name, person code, address, social security number and so on. Quasi_identifier (QID) is a list of attributes whose combination could possibly identify person such as sex, age, job, zip-code and so on. Combination of quasi identifiers can either specify the unique record from second table or small set of records. Sensitive_attributes are sensitive attributes that are specific to person such as diseases or salary. Non-sensitive_attributes are all other attributes that do not fit into previous categories.

Having too many attributes in QID will cause too much distortion in the data while choosing to few attributes in QID will offer insufficient protection for privacy.

Getting started

In this view you need to specify column types for each column. You can get hints from Introduction above on what these should be. To complete this step, follow these steps:
  1. For each attribute choose it's type from combobox. Type should be one of the following: Explicit_identifier (ID), Quasi_identifier (QID), Sensitive_attributes (Sensitive) or Non-sensitive_attributes (Nonsensitive).
  2. When you are done, click next to proceed.
Step 3