Curated CNVs, Sequence Variants, Simplified Access.
The Cancer Genome Atlas (TCGA) is an amazing resource, growing to contain genomic profiles of approaching 12,000 tumors across more than thirty cancer types. By re-analyzing the data and manually curating the results of each sample by trained scientists, the TCGA Premier database has the highest fidelity copy number information from this rich resource.
Features And Benefits
Curated by Experts
Beginning with the raw data available through special access, we have paired the tumors with their matched normal to remove germline polymorphisms. Expert scientists then review the genomic profiles of each sample, correct for baseline ploidy and segmentation, and manually call copy-neutral homozygous segments (LOH) and events on the mosaic threshold boundary.
Integrated Sequence Variants
Sequence variants obtained by whole-exome sequencing are integrated with CNVs, providing a more comprehensive genomic picture.
Fully Annotated Data Set
The data set is fully annotated with reviewer comments for specific samples, calling parameter settings that were used, other sample-specific information along with detailed clinical annotations, such as survival, tumor grade, etc., that foster further analyses.
Why TCGA Premier?
Useful for a wider range of studies
Processing GISTIC samples produced results suited for population-wide profiles, but left individual samples over-segmented and many having incorrect ploidy. By correcting these, and integrating sequence variants, we have enabled the data set to become more useful for studies such as survival analysis or tumor sub-type profiling.