Different relatedness estimates by PLINK and VCFTOOLS despite same method
Search for specific SNPs in VCF files of patients.
Multiallelic variants when merging VCF’s with GLnexus
Genomic hypomethylation in cell-free DNA predicts responses to checkpoint blockade in lung and breast cancer
Lung cancer ICB cohort Advanced non-small cell lung carcinoma patients who were treated with anti-PD-1/PD-L1 monotherapy at Samsung Medical Center, Seoul, Republic of Korea were enrolled for this study. The present study has been reviewed and approved by the Institutional Review Board (IRB) of the Samsung Medical Center (IRB no….
Diversity and dissemination of viruses in pathogenic protozoa
Variant calling using HaplotypeCaller does not show #FILTER information
haplotypecaller – NVIDIA Docs
Run a GPU-accelerated haplotypecaller. This tool applies an accelerated GATK CollectMultipleMetrics for assessing the metrics of a BAM file, such as including alignment success, quality score distributions, GC bias, and sequencing artifacts. This functions as a ‘meta-metrics’ tool, and can run any combination of the available metrics tools in GATK…
Indigenous Australian genomes show deep structure and rich novel variation
Inclusion and ethics The DNA samples analysed in this project form part of a collection of biospecimens, including historically collected samples, maintained under Indigenous governance by the NCIG11 at the John Curtin School of Medical Research at the Australian National University (ANU). NCIG, a statutory body within ANU, was founded…
Individual vs. joint call VCFs
GATK GenomicsDBImport too slow
Variant missing in WGS sample
Help with gatk BaseRecalibrator
How to input list into GenomicsDBImport with snakemake?
How to create interval list from reference fasta or dict file?
GetPileupSummaries intervals-list with Targeted Sequencing?
Apply BSQR for Targeted Sequencing
How to subtract variants from one VCF file to another?
gatk SelectVariants is giving dupilicate allele error while extracting SNPs out of vcf file
ASEReadCounter output wrong number of coverage
Building reference dbSNP file using WGS samples
Help with gatk CreateSequenceDictionary
The role of APOBEC3B in lung tumor evolution and targeted cancer therapy resistance
Cell line and growth assays Cell lines were grown in Roswell Park Memorial Institute-1640 medium (RPMI-1640) with 1% penicillin–streptomycin (10,000 U ml−1) and 10% FBS or in Iscove’s modified Dulbecco’s medium (IMDM) with 1% penicillin–streptomycin (10,000 U ml−1), l-glutamine (200 mM) and 10% FBS in a humidified incubator with 5% CO2 maintained at 37 °C. Drugs…
GATK Mutect2 mouse dbSNP vcf files recommendations for mouse whole exome data
Longitudinal detection of circulating tumor DNA
Analysis of Roche KAPA Target Enrichment kit experimental data obtained on an Illumina sequencing system is most frequently performed using a variety of publicly available, open-source analysis tools. The typical variant calling analysis workflow consists of sequencing read quality assessment, read filtering, mapping against the reference genome, duplicate removal, coverage…
H101 for cervical cancer | DDDT
Introduction Patients with persistent, recurrent, or metastatic (P/R/M) cervical carcinoma respond poorly to treatment despite the best available therapeutic regimens, with a 5-year survival of 17%.1 Most of them are heavily pretreated with chemotherapy and/or radiotherapy, and many patients experience complications related to treatment or advanced disease, which exclude them…
[maftools]Too many multi_hit and missense mutation
Analyzing somatic mutations by single-cell whole-genome sequencing
Merging several vcf files for GWAS?
Phenotypic drug-susceptibility profiles and genetic analysis based on whole-genome sequencing of Mycobacterium avium complex isolates in Thailand
Abstract Mycobacterium avium complex (MAC) infections are a significant clinical challenge. Determining drug-susceptibility profiles and the genetic basis of drug resistance is crucial for guiding effective treatment strategies. This study aimed to determine the drug-susceptibility profiles of MAC clinical isolates and to investigate the genetic basis conferring drug resistance using…
Creating a Variant containing FASTA for proteomics search from VCF and genomic FASTA
BaseRecalibrator takes forever to run. Any suggestions?
Primate-specific ZNF808 is essential for pancreatic development in humans
Subjects The study was conducted in accordance with the Declaration of Helsinki and all subjects or their parents/guardian gave informed written consent for genetic testing. DNA testing and storage in the Beta Cell Research Bank was approved by the Wales Research Ethics Committee 5 Bangor (REC 17/WA/0327, IRAS project ID…
SNP calling with many samples using bcftools
Query regarding callsets used as known sites in Variant Calling
MemVerge and Sentieon Announce WaveRider for Sentieon to Accelerate Next-Generation Sequencing in the Cloud
Early Customers Realize 10x Increase in Performance and Cloud Cost Savings; Sentieon Software Offered Free in Memory Machine Cloud Subscription MILPITAS, Calif., Nov. 14, 2023 /PRNewswire/ — MemVerge®, pioneers of Big Memory software, and Sentieon®, the market leader in genomics software, today announced a collaboration to accelerate next-generation sequencing (NGS)…
BWA mem -M option for gatk mutect
GATK SelectVariants –remove-unused-alternates dropping real INDELs?
Samtools index not working in Snakemake
I am setting up a Snakemake pipeline for sequencing reads alignment and variants calling. But the samtools index rule is not activated, and the subsequent haplotype caller rule fail. I think it is because the samtools index rule is not perceived as necessary to execute the output of rule all…
BAM file for phasing
ILIAD: a suite of automated Snakemake workflows for processing genomic data for downstream applications | BMC Bioinformatics
Pipeline architecture and configuration file Genomic data processing poses a challenge for genetic research studies because it involves multiple program dependency installations, vast numbers of samples with raw data from various next-generation sequencing (NGS) platforms, and inconsistent genetic variant ID and/or positions among datasets. The Iliad suite of genomic data…
Need Help Understanding Variant Calling Issues in De Novo Yeast Assembly
variant calling – How to run a GATK Docker Image with local files?
I’m trying to use the HaplotypeCaller from the GATK toolkit but I keep getting an error. I pulled GATK through Docker and am using this command: docker run -v /Users/rimo/ -it broadinstitute/gatk:latest gatk HaplotypeCaller -R /Users/rimo/reference.fasta -I /Users/rimo/sample1.bam -O /Users/rimo/sample1.g.vcf.gz -ERC GVCF /Users/rimo is my home directory it’s where the…
How to slice a CRAM file into the 50kb regions padded with 1kb?
Structural Variants in gnomAD v4
Today, we are thrilled to announce the release of genome-wide structural variants (SVs) for 63,046 unrelated samples with genome sequencing (GS) data. All site-level information for 1,199,117 high-quality SVs discovered in these samples is browsable in the gnomAD browser (gnomAD SV v4) and downloadable from the gnomAD downloads page. For…
Single-nucleus DNA sequencing reveals hidden somatic loss-of-heterozygosity in Cerebral Cavernous Malformations
Ethical statement Our research complies with all relevant ethical regulations, including the Declaration of Helsinki and has been approved by the Institutional Review Boards of University of Chicago, Duke University and the Alliance to Cure Cavernous Malformations. Cerebral cavernous malformation lesions All human CCM tissue specimens have been previously reported18,19…
CombineGVCFs skips a chromosome
Inferring bacterial transmission dynamics using deep sequencing genomic surveillance data
Study design Experiments were performed in accordance with the New Zealand Animal Welfare Act (1999) and institutional guidelines provided by the University of Auckland Animal Ethics Committee, which reviewed and approved these experiments under application R1003. We did not use any specific randomisation process to allocate animals to a particular…
Hey guys, I’m having a prob when using GATK4 BQSR . This dbsnp vcf file has chromosomes notated as 1,2 …. but my reference contiges are chr1.chr2…incompatibility in coutigs..
Mycobacterium tuberculosis Sub Lineage 4.2.2/SIT149 as DR
Introduction Antimicrobial resistance is a hidden global pandemic that shattered over 4.9 million people in 2019 alone, and the burden is highest, mainly in low-resource settings.1 Drug-resistant tuberculosis (DR-TB) caused by Mycobacterium tuberculosis (Mtb) complex (MTBC), which is resistant to one or more anti-TB drugs, is a leading global public…
Identification of CCZ1 as an essential lysosomal trafficking regulator in Marburg and Ebola virus infections
Cells and viruses Haploid mSCs AN3–12 are a feeder independent clonal derivative of HMSc2 isolated from mice oocyte and maintained at IMBA52. The AN3–12 library and knocked out cells used for the haploid screening was obtained from IMBA (Austria)14,52. AN3–12 cells were validated by STR analysis. Haploid mES cells were…
Invasive Californian death caps develop mushrooms unisexually and bisexually
Mushroom collecting Sporocarps were collected from various herbaria and during three expeditions to Point Reyes National Seashore (PRNS), California in 2004, 2014 and 2015, and in 2015 from three sites in Portugal. A total of 86 sporocarps were collected: 67 Californian sporocarps (one early herbarium sample dates to 1993), 11…
Confirming called variants
Pre-imputation checks using 1000G data (hg19) for a hg38 VCF
Does GATK SetNmMdAndUqTags reduces the size of a CRAM?
Challenges in Variant Calling and Genotyping with Short-Read Data Mapped to a Pangenome Graph: Seeking Guidance
Building reference dbSNP file for citrus sinensis using 80 WGS samples
Bootstrapping for BQSR of 80 WGS samples
How to choose the best tool for variant trio analysis
Genotyping, sequencing and analysis of 140,000 adults from Mexico City
Recruitment of study participants The MCPS was established in the late 1990s following discussions between Mexican scientists at the National Autonomous University of Mexico (UNAM) and British scientists at the University of Oxford about how best to measure the changing health effects of tobacco in Mexico. These discussions evolved into…
The mutational signature of hypertrophic cardiomyopathy
Introduction Hypertrophic cardiomyopathy (HCM), characterized by asymmetric hypertrophy of the ventricular wall, is a condition where the heart becomes thickened without a distinct inducement.1,2 Epidemiological investigation shows that the estimated prevalence rate of HCM in the general population is 1:500.3,4 The clinical manifestations vary greatly, with no symptoms and mild…
Determine INDELs number (both classes separately) from reference and graph-based VCF files
How to choose LiftOver chain file
RNAseq based variant dataset in a black poplar association panel | BMC Research Notes
GenotypeGVCF too many genotypes from pooled samples
Problem with RNAseq MarkDuplicates(Picard)
PTEN-induced kinase 1 gene single-nucleotide variants as biomarkers in adjuvant chemotherapy for colorectal cancer: a retrospective study | BMC Gastroenterology
Tissue samples A total of 84 analytic samples from surgical or biopsy specimens were collected from 84 patients who underwent radical surgery for CRC at Saitama Medical University International Medical Center between January and December 2016. One case was excluded because the specimen was too small; therefore, we used a…
Purpose, Types, Applications, Bioinformatics and more
Next-Generation Sequencing (NGS), also known as high-throughput sequencing, is a revolutionary technology used for determining the sequence of DNA or RNA molecules. It has significantly advanced the field of genomics and has numerous applications in various biological and medical fields. Key Points of Next-Generation Sequencing (NGS): Revolutionary Technology: NGS represents…
Effect of recombination on genetic diversity of Caenorhabditis elegans
Strong correlation exists between recombination rate and abundance and proportion of indels Whole-genome sequence data of many C. elegans wild isolates now exist. These include Illumina paired-end data of over 600 wild isolates by CeNDR, which also obtained first-generation PacBio long-read data of 14 wild isolates. Second-generation PacBio HiFi data20…
how to extract unique snps in a vcf file by comparing with multiple vcf files
filtering variants in a Strelka2 VCF file based on AD and AF
Allele specific binding of histone modifications and a transcription factor does not predict allele specific expression in correlated ChIP-seq peak-exon pairs
ChIP-seq and RNA-seq Tissue sampling and RNA-sequencing for three Holstein dairy cows and two of their foetuses (one male and one female with a shared sire) are described in17 and18. ChIP-sequencing for all tissues was as described in16, with the inclusion of more tissues. Whole genome sequence for each animal…
after gatk VariantAnnotator -V *_com_norm.vcf -A AlleleFraction -O *_norm_AB.vcf There “nan,nan” or “nan” in my vcf file
hg38 1kg/GATK is not available in the Lift Genome Annotation tool
Is a PON necessary for tumor-normal matched Mutect2?
Downstream analysis on multi-sample or single-sample VCF files?
Liftover GRCh37 to hg38 1kg/GATK.
Mismatch repair deficiency is not sufficient to elicit tumor immunogenicity
Mice All animal use was approved by the Department of Comparative Medicine at the Massachusetts Institute of Technology (MIT) and the Institutional Animal Care and Use Committee under protocol no. 0714-076-17. Mice were housed with a 12-h light/12-h dark cycle with temperatures in the range 20–22 °C and 30–70% humidity. KrasLSL-G12D…
sarek: Introduction
Introduction nf-core/sarek is a workflow designed to detect variants on whole genome or targeted sequencing data. Initially designed for Human, and Mouse, it can work on any species with a reference genome. Sarek can also handle tumour / normal pairs and could include additional relapses. The pipeline is built using…
The genomic footprint of whaling and isolation in fin whale populations
Samples and sequencing Tissue samples from 50 fin whales (Balaenoptera physalus) were collected using a standard protocol to obtain skin biopsies from free-ranging cetacean species, which use a small stainless-steel biopsy dart deployed from a crossbow or rifle73,74. These samples were collected throughout the Eastern North Pacific (ENP; N = 30, represented…
bcftools merge is resulting in a lot of missing data, how do I fix this?
Issues while running BaseRecalibrator
READ GROUP in GATK 1 My fastq files for a sample with their header line looked like this: HHNG7DSX5_19417170_S118_L003_R1_001.fastq.gz @A00428:335:HHNG7DSX5:3:1101:5466:1000 1:N:0:NGATGTTT+NTCAATTG HHNG7DSX5_19417170_S118_L003_R2_001.fastq.gz @A00428:335:HHNG7DSX5:3:1101:5466:1000 2:N:0:NGATGTTT+NTCAATTG HHNG7DSX5_19417170_S118_L004_R1_001.fastq.gz @A00428:335:HHNG7DSX5:4:1101:2302:1000 1:N:0:NGATGTTT+NTCAATTG HHNG7DSX5_19417170_S118_L004_R2_001.fastq.gz @A00428:335:HHNG7DSX5:4:1101:2302:1000 2:N:0:NGATGTTT+NTCAATTG I merged L003_R1, L004_R1 and L003_R2, L004_R2. First question is should I merge R1 and R2 lanes? I want to…
Problem while working with sequenza
Problem while working with sequenza – Chromosomes out of order 1 Hi, I’m trying to work with sequenza in order to calculate HRD score of a sample using WES data. When I run sequenza, I get a message saying that “chromosomes are out of order”, and I don’t know how…
WES CNV analysis 0 Hi, I am new to CNV analysis and beginner in R language. I am trying to call germline CNVs using exome data using ExomeDepth. I only have the raw data with hg38 reference. If you have the ExomeDepth scripts to run on hg38 reference. Kindly share…