TY - GEN
T1 - Towards privacy aware data analysis workflows for e-Science
AU - Cheung, Kwok Wai
AU - Gil, Yolanda
N1 - Copyright:
Copyright 2011 Elsevier B.V., All rights reserved.
PY - 2007/7
Y1 - 2007/7
N2 - e-Science is getting more distributed and collaborative and data privacy quickly becomes a major concern, especially when the data contain sensitive information. Existing data access policies for privacy management are too restrictive for supporting the large variety of data analysis needs in e-Science. In this paper, we argue the need of a new type of policies that govern data privacy based on the type of processing done on the data. A semantic workflow approach is proposed to address the challenge. Data analysis processes are described as workflows. Ontologies for data analysis and privacy preservation describe the functionalities and the privacy attributes of the processes, as well as process-constraining privacy policies. We give some examples of related policies with their potential fields for application explained. Also, we present via a case study on distributed data clustering to illustrate how the approach could be integrated with a workflow system to make it privacy aware.
AB - e-Science is getting more distributed and collaborative and data privacy quickly becomes a major concern, especially when the data contain sensitive information. Existing data access policies for privacy management are too restrictive for supporting the large variety of data analysis needs in e-Science. In this paper, we argue the need of a new type of policies that govern data privacy based on the type of processing done on the data. A semantic workflow approach is proposed to address the challenge. Data analysis processes are described as workflows. Ontologies for data analysis and privacy preservation describe the functionalities and the privacy attributes of the processes, as well as process-constraining privacy policies. We give some examples of related policies with their potential fields for application explained. Also, we present via a case study on distributed data clustering to illustrate how the approach could be integrated with a workflow system to make it privacy aware.
UR - http://www.scopus.com/inward/record.url?scp=51849085568&partnerID=8YFLogxK
M3 - Conference proceeding
AN - SCOPUS:51849085568
SN - 9781577353386
T3 - AAAI Workshop - Technical Report
SP - 17
EP - 25
BT - Semantic e-Science - Papers from the 2007 AAAI Workshop, Technical Report
PB - Association for the Advancement of Artificial Intelligence
T2 - 2007 Workshop on Semantic e-Science (SeS2007), in conjunction with the Twenty-Second Conference of the Association for the Advancement of Artificial Intelligence (AAAI)
Y2 - 22 July 2007 through 26 July 2007
ER -