CTCF Quantative Trait Loci
This is a page to organise data from the paper Quantitative Genetics of CTCF Binding Reveal Local Sequence Effects and Different Modes of X-Chromosome Association. As well the supplementary information in the paper, this web page helps organise information from the publication
Raw Data
The raw data for this project is present ENA
Binding Region Phenotypes
This is the phenotype file with normalised phenotype matrix with each row a binding region and each column a sample.QTLs
This is the called QTLs 1% FDR threshold (q value <= 0.01) and kept only cluster variants defined as having P value within one order of magnitude to the P value of the lead variant for the same binding region.Allele Specific Summary
This is the summary data for the allele specific SNPs (ie, behind Figure 4a and 4b)Chromosome X binding
This is the summary data for the CTCF X chromosome information- CTCFsites_chromX_classification.txt All sites.
Additional Data Files
Here are some additional data file that people might find interesting.Genotypes: genotype data in VCF format stored separately for each chromosome. Starting from the 1000 Genomes Phase 1 release, only variants that are within 50kb to a CTCF binding region, as defined in the phenotype file, are included. Variants with allele frequency less than 5% in the 51 samples in study are excluded.
- 10.51.maf005hwe1e-4.aa.vcf.gz
- 11.51.maf005hwe1e-4.aa.vcf.gz
- 12.51.maf005hwe1e-4.aa.vcf.gz
- 13.51.maf005hwe1e-4.aa.vcf.gz
- 14.51.maf005hwe1e-4.aa.vcf.gz
- 1.51.maf005hwe1e-4.aa.vcf.gz
- 15.51.maf005hwe1e-4.aa.vcf.gz
- 16.51.maf005hwe1e-4.aa.vcf.gz
- 17.51.maf005hwe1e-4.aa.vcf.gz
- 18.51.maf005hwe1e-4.aa.vcf.gz
- 19.51.maf005hwe1e-4.aa.vcf.gz
- 20.51.maf005hwe1e-4.aa.vcf.gz
- 21.51.maf005hwe1e-4.aa.vcf.gz
- 22.51.maf005hwe1e-4.aa.vcf.gz
- 2.51.maf005hwe1e-4.aa.vcf.gz
- 3.51.maf005hwe1e-4.aa.vcf.gz
- 4.51.maf005hwe1e-4.aa.vcf.gz
- 5.51.maf005hwe1e-4.aa.vcf.gz
- 6.51.maf005hwe1e-4.aa.vcf.gz
- 7.51.maf005hwe1e-4.aa.vcf.gz
- 8.51.maf005hwe1e-4.aa.vcf.gz
- 9.51.maf005hwe1e-4.aa.vcf.gz
- X.51.maf005hwe1e-4.vcf.gz
Allele specific sites per cell line. The columns are:
- chr: Chromosome of SNP
- position: Location of SNP
- ref: Reference allele
- alt: Alternative allele
- ref_count: reference count
- alt_count: alternative count
- percent_ref: percent reference allele (0-1)
- dbSNP: dbSNP ID
- genotype: heterozygous(1|0) only in these files
- low_count: mininum of ref_count and alt_count
- pVal: Binomial P value of allele bias
- Pval.adj, FDR (BH method) corrected binomial P value coordinate: chr|position
- gm06986.P.het.txt
- gm06994.P.het.txt
- gm07037.P.het.txt
- gm07051.P.het.txt
- gm07346.P.het.txt
- gm07357.P.het.txt
- gm11829.P.het.txt
- gm11830.P.het.txt
- gm11831.P.het.txt
- gm11832.P.het.txt
- gm11840.P.het.txt
- gm11881.P.het.txt
- gm11894.P.het.txt
- gm11918.P.het.txt
- gm11920.P.het.txt
- gm11931.P.het.txt
- gm11992.P.het.txt
- gm11993.P.het.txt
- gm11994.P.het.txt
- gm11995.P.het.txt
- gm12003.P.het.txt
- gm12005.P.het.txt
- gm12006.P.het.txt
- gm12043.P.het.txt
- gm12045.P.het.txt
- gm12144.P.het.txt
- gm12154.P.het.txt
- gm12155.P.het.txt
- gm12156.P.het.txt
- gm12234.P.het.txt
- gm12249.P.het.txt
- gm12287.P.het.txt
- gm12489.P.het.txt
- gm12749.P.het.txt
- gm12750.P.het.txt
- gm12751.P.het.txt
- gm12760.P.het.txt
- gm12761.P.het.txt
- gm12762.P.het.txt
- gm12763.P.het.txt
- gm12776.P.het.txt
- gm12812.P.het.txt
- gm12813.P.het.txt
- gm12814.P.het.txt
- gm12815.P.het.txt
- gm12828.P.het.txt
- gm12872.P.het.txt
- gm12873.P.het.txt
- gm12874.P.het.txt
- gm12891.P.het.txt
- gm12892.P.het.txt
