Principal Investigator, Andria and Paul Heafy Family Fellow, MIT Whitehead Institute for Biomedical Research
BIOGRAPHY
Sharing sequencing datasets without identifiers has become a common practice in genomics. We recently showed that some datasets can be fully re-identified by using entirely free, publicly accessible Internet resources. I will present quantitative analysis about the probability of identifying US individuals by this technique on hundreds of genetic datasets. In addition, I will demonstrate the power of our approach by tracing back the identities of multiple whole genome datasets in public sequencing repositories.