Needles in the Haystack: Identifying Individuals Present in Pooled Genomic Data
Figure 2
Distributions of T for out-of-group samples who are related (red line) and unrelated (blue line) to individuals in G for HapMap YRI (A) and HapMap CEPH (B) populations. (C) and (D) show the same distributions as (A) and (B) respectively, with the addition (green line) of individuals who are in G and unrelated to F (i.e., true positives). Dashed black lines indicate the T significance thresholds of ±1.64 at nominal .