Where To Get The Genetic Map For The Haplotype Reference Consortium to phase the data externally using SHAPEIT software with family information

Where To Get The Genetic Map For The Haplotype Reference Consortium to phase the data externally using SHAPEIT software with family information

0

The Sanger Imputation Service does not offer a pipeline that utilizes the family information during phasing. Therefore, our pipeline (previously) has been to previously use genetic maps from the 1000 Genomes (Phase 3) to phase the data externally using SHAPEIT software with family information. After this we upload VCFs to the Sanger Imputation Service and Choose “1000 Genomes Phase 3” as the reference panel and “impute with PBWT, no pre-phasing”:

# Download the genetic map for 1000 Genomes
wget https://mathgen.stats.ox.ac.uk/impute/1000GP_Phase3.tgz
tar -xf 1000GP_Phase3.tgz --wildcards "1000GP_Phase3/genetic_map_*"

wget https://mathgen.stats.ox.ac.uk/impute/1000GP_Phase3_chrX.tgz
tar -xf 1000GP_Phase3_chrX.tgz --wildcards "genetic_map_*"

# phase (with duohmm for autosomes)
for i in {1..22}; do shapeit -T 8 -B 15Nov2018/CHR$i -M 1000GP_Phase3/genetic_map_chr"$i"_combined_b37.txt -W 5 -O FGDS_chr$i --states 200 --duohmm ; done

However, I would now like to find genetic maps for the Haplotype Reference Consortium (HRC) as it includes more than 32 thousand samples, from 20 different cohorts (including the UK10K and the 1000 Genomes Project Phase 3). The 1000G only has 2,504 samples, therefore the HRC should provide more accurate imputation at lower frequencies, especially in European cohorts, since it has a larger set of samples (perhaps not for Asian/African ancestry though).

Does anyone know where I can find this information for HRC?


genome


GWAS


HRC


phasing


SHAPEIT

• 2.1k views

Read more here: Source link