Data Protection laws that exempt data that is not individually identifiable have led to an explosion in anonymization research. Unfortunately, how well current de-identification and anonymization techniques control risks to privacy and confidentiality is not well understood. Neither is the usefulness of anonymized data for real-world applications. The project addresses anonymization on three fronts:
Through these tasks, the project is advancing our ability to utilize the wealth of data we now collect for the benefit of society, while ensuring individual privacy is protected.
t-Plausibility: Generalizing words to desensitize text, Transactions on Data Privacy 5(3):505-534, December 2012.
From Disambiguation Failures to Common-Sense Knowledge Acquisition: A Day in the Life of an Ontological Semantic System, Proc. of WI-IAT 2011, Lyon, France, August, 2011.
Significance of Term Relationships on Anonymization, International Workshop on Web Intelligence for Information Security at WI-IAT, Lyon, France, August 22, 2011.
Privacy and Uniqueness in High-Dimensional Social Science and Sex Research Datasets(Poster), The International Academy of Sex Research, Los Angeles, California, August 10-13, 2011.
Graph Decomposition and Its Use for Ontology Verification and Semantic Representation, Proc. of ICAI 2011, Las Vegas, July, 2011.
Post-Logical Verification of Ontology and Lexicons: The Ontological Semantic Technology Approach, Proc. of ICAI 2011, Las Vegas, July, 2011.
Document Clustering with Universum, International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), Beijing, July 24-28, 2011.
Understanding the Unknown: Unattested Input Processing in Natural language, Proc. of FUZZ-IEEE, Taipei, Taiwan, June, 2011.
Privacy and De-Identification in High Dimensional Social Science Data Sets(Poster), The 32nd Annual IEEE Symposium on Security and Privacy, Oakland, California, May 22-25, 2011.
Purdue at TREC 2010 Entity Track: a Probabilistic Framework for Matching Types between Candidate and Target Entities, Proceedings of the 18th Text REtrieval Conference (TREC), Gaithersburg, MD, 2010.
Ontological Semantic Technology for Detecting Insider Threat and Social Engineering, New Security Paradigms Workshop, Concord, Massachusetts, September 21-23, 2010.
t-Plausibility: Semantic Preserving Text Sanitization, the 2009 IEEE International Conference on Privacy, Security, Risk and Trust (PASSAT-09), Vancouver, Canada, August 29-31, 2009.
This material is based upon work supported by the National Science Foundation under Grant No. 1012208. Any opinions, findings and conclusions or recomendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation (NSF).
We wish to thank SmartyStreets for use of their address validation server in the conduct of this research.