Tag: BCFtools

Where to find vcf of dbsnp build 144 ?

Where to find vcf of dbsnp build 144 ? 0 Hi everyone, I have zipped vcf files that I would like to annotate using hg19 bsnp144. I have bed files for each chromosome but, based on other biostar answers (How to add rsIDs to VCF?), it seems it is easier…

Continue Reading Where to find vcf of dbsnp build 144 ?

How to install latest version of bcftools on ubuntu?

How to install latest version of bcftools on ubuntu? 1 Hello everybody, I am trying to install latest version of bcftools in my ubuntu system. But when I run this command sudo apt-get install bcftools It installs bcftoolsv1.10. I want bcftoolsv1.16 or greater. Can anybody please help me out in…

Continue Reading How to install latest version of bcftools on ubuntu?

Are package downgrades a necessary evil in Conda?

Are package downgrades a necessary evil in Conda? 1 I am just trying to get my head around using conda environments. I created a conda environment for a project containing plink2, plink, R and bcftools. When I installed plink, using mamba install -n autozygosity -c conda-forge plink, I got the…

Continue Reading Are package downgrades a necessary evil in Conda?

bcftools view remove (.) id

Hello I have a txt file that consists from CHROM,ID,POS, REF and ALT ( 48 variants ) I want to subset this txt with original VCF to make a new VCF I try to use bcftools using this query bcftools view -T variants.txt mydata.vcf > variant1.vcf but the problem ,…

Continue Reading bcftools view remove (.) id

Inferring and perturbing cell fate regulomes in human brain organoids

Experimental methods Stem cell and organoid culture We used six human iPS cell lines (Hoik1, Wibj2, Kucg2 from the HipSci resource47; 409B2 from the RIKEN BRC cell bank; 01F49i-N-B7 (B7) from Institute of Molecular and Clinical Ophthalmology Basel; and WTC from the Allen Institute) and three human ES cell lines (H1-PAX6YFP…

Continue Reading Inferring and perturbing cell fate regulomes in human brain organoids

Bioinformatics Workflow Developer job with Barrington James

Global biotechnology company focusing on molecular biology and applications, informatics, engineering, electronics, manufacturing and commercialisation. Their sequencing platform allows for rapid insights, that analyse DNA and RNA data. The Customer Analysis Workflows are responsible for communicating bioinformatic analyses to end users through various media from software libraries, notebook tutorials,…

Continue Reading Bioinformatics Workflow Developer job with Barrington James

Tool that can merge 2 VCF files while taking “representational ambiguity” of (multi-allelic) variants into account

Tool that can merge 2 VCF files while taking “representational ambiguity” of (multi-allelic) variants into account 0 Is there a tool that can merge 2 VCF files while taking “representational ambiguity” of multi-allelic variants into account? By: replaying all variant alleles from the 2 VCF files into the reference genome…

Continue Reading Tool that can merge 2 VCF files while taking “representational ambiguity” of (multi-allelic) variants into account

Genomic architecture of adaptive radiation and hybridization in Alpine whitefish

Sampling the radiation To understand the phylogenetic relationships between Alpine whitefish, we carried out whole-genome resequencing on 96 previously collected whitefish (with associated phenotypic measurements including standard length and gill-raker counts; collected in accordance with permits issued by the cantons of Zurich (ZH128/15), Bern (BE68/15), and Lucerne (LU04/14); these fish…

Continue Reading Genomic architecture of adaptive radiation and hybridization in Alpine whitefish

How To Install libhts-dev on Kali Linux

In this tutorial we learn how to install libhts-dev on Kali Linux. libhts-dev is development files for the HTSlib Introduction In this tutorial we learn how to install libhts-dev on Kali Linux. What is libhts-dev HTSlib is an implementation of a unified C library for accessing common file formats, such…

Continue Reading How To Install libhts-dev on Kali Linux

Bcftools equivalent of vcftools conversion to ped & map

Bcftools equivalent of vcftools conversion to ped & map 1 I am converting a VCF to ped & map thus in vcftools vcftools –gzvcf ZZZZZTYT.vcf.gz –plink –out ZZZZZTYT which works fine. However, I have been searching and searching, can bcftools do the same with a bcf? bcftools • 103 views…

Continue Reading Bcftools equivalent of vcftools conversion to ped & map

difficulty filtering vcf file with vcftools

difficulty filtering vcf file with vcftools 1 I had a large VCF file named “common_known_variants.vcf ” which contains all known human variants downloaded from ftp.ncbi.nlm.nih.gov/snp/organisms/human_9606_b151_GRCh38p7/VCF/00-common_all.vcf.gz -O common_known_variants.vcf.gz I’m trying to extract the known variants from only chromosomes 1,2,3,9,22, and X and write them in a new vcf file with the…

Continue Reading difficulty filtering vcf file with vcftools

wrong number of fields ?

Error occurence after merging files with bcftools: wrong number of fields ? 1 I have multiple vcf of CASES and CONTROLS variations annotated by VEP, SNPEff, SnpSift. first pair vcf -> only variations| CASES and CONTROLS second pair vcf -> variations + SnpEff | CASES and CONTROLS third pair vcf->…

Continue Reading wrong number of fields ?

using ANNOVAR annotation clinvar database out wrong position

using ANNOVAR annotation clinvar database out wrong position 0 Hello Biostars, I was trying to annotate the VCF using ANNOVAR,but I get a wrong out ,it seems my clinvar database is not sutibale bcftools_callCommand=call -m -v -o /project/plantform/20220316PCR/03.amplify/L2107973CFD7G5kxT1/L2107973CFD7G5kxT1.variation.vcf /project/plantform/20220316PCR/03.amplify/L2107973CFD7G5kxT1/L2107973CFD7G5kxT1.mpileup.vcf clinvar ANNOVAR • 34 views Read more here: Source link

Continue Reading using ANNOVAR annotation clinvar database out wrong position

Genetic associations at regulatory phenotypes improve fine-mapping of causal variants for 12 immune-mediated diseases

Cooper, G. S., Bynum, M. L. K. & Somers, E. C. Recent insights in the epidemiology of autoimmune diseases: improved prevalence estimates and understanding of clustering of diseases. J. Autoimmun. 33, 197–207 (2009). PubMed  PubMed Central  Google Scholar  El-Gabalawy, H., Guenther, L. C. & Bernstein, C. N. Epidemiology of immune-mediated…

Continue Reading Genetic associations at regulatory phenotypes improve fine-mapping of causal variants for 12 immune-mediated diseases

Genomic analysis on Galaxy using Azure CycleCloud

Cloud computing and digital transformation have been powerful enablers for genomics. Genomics is expected to be an exabase-scale big data domain by 2025, posing data acquisition and storage challenges on par with other major generators of big data. Embracing digital transformation offers a practically limitless ability to meet the genomic…

Continue Reading Genomic analysis on Galaxy using Azure CycleCloud

Errors when compiling older version **samtools**

Errors when compiling older version **samtools** 0 I have downloaded bcf file from this website ricevarmap. In order to “view” this old bcf format and convert it to a newer one, it’s said that I have to install samtools-0.1.17, which has a older version bcftools in it. When I make…

Continue Reading Errors when compiling older version **samtools**

Petabase-scale sequence alignment catalyses viral discovery

Serratus alignment architecture Serratus (v0.3.0) (github.com/ababaian/serratus) is an open-source cloud-infrastructure designed for ultra-high-throughput sequence alignment against a query sequence or pangenome (Extended Data Fig. 1). Serratus compute costs are dependent on search parameters (expanded discussion available: github.com/ababaian/serratus/wiki/pangenome_design). The nucleotide vertebrate viral pangenome search (bowtie2, database size: 79.8 MB) reached processing rates…

Continue Reading Petabase-scale sequence alignment catalyses viral discovery

bcftools merged vcf file assigns all variants to one sample

bcftools merged vcf file assigns all variants to one sample 0 I’ve made one vcf file for each of three samples. I then combined them using bcftools, like so: # Make a list of vcf files to merge cat “${OUT}/results/variants/vcf_list” /mnt/gpfs/live/rd01__/ritd-ag-project-rd018o-mdflo13/data/test/manual/results/variants/3a7a-10.vcf.gz /mnt/gpfs/live/rd01__/ritd-ag-project-rd018o-mdflo13/data/test/manual/results/variants/MF3.vcf.gz /mnt/gpfs/live/rd01__/ritd-ag-project-rd018o-mdflo13/data/test/manual/results/variants/R507H-FB_S355_L001.vcf.gz Then merge the list: bcftools merge -l…

Continue Reading bcftools merged vcf file assigns all variants to one sample

Genome-wide identification of enhancers and transcription factors regulating the myogenic differentiation of bovine satellite cells | BMC Genomics

1. Yin H, Price F, Rudnicki MA. Satellite cells and the muscle stem cell niche. Physiol Rev. 2013;93(1):23–67. CAS  PubMed  PubMed Central  Google Scholar  2. Hoppeler H, Fluck M. Plasticity of skeletal muscle mitochondria: structure and function. Med Sci Sport Exer. 2003;35(1):95–104. CAS  Google Scholar  3. Astruc T: Carcass Composition,…

Continue Reading Genome-wide identification of enhancers and transcription factors regulating the myogenic differentiation of bovine satellite cells | BMC Genomics

bcftools merge of over 9000+ vcf files

Hi all, I have around 9000+ vcf files that I’m trying to merge using bcftools merge. They are all located in their own folder so essentially I have a folder containing 9000+ separate folders, each containing one vcf.gz file. I have tried out the following code via this tutorial bcftools…

Continue Reading bcftools merge of over 9000+ vcf files

Senior Bioinformatics Scientist (Statistical Geneticist) – Research – Cambridge, UK in San Diego, California

Senior Bioinformatics Scientist – Cambridge, UK Candidates wishing to work remotely from the Netherlands, France, or Belgium may also be considered. Overview Since 2001, the cost of DNA sequencing has dropped more than 100,000-fold, from $100,000,000 USD per human genome to less than $600 USD today. This is resulting in…

Continue Reading Senior Bioinformatics Scientist (Statistical Geneticist) – Research – Cambridge, UK in San Diego, California

Padding out a GVCF file with 1000G exomes to get gatk VariantRecalibrator working with a small sample

I’ve got sequencing data for a small 500 bp amplicon from a few samples. GATK best principles suggest running VariantRecalibrator on the GVCF files I generate. I’m trying to get this working, but I get an error about “Found annotations with zero variances”. Reading the gatk manual and other posts…

Continue Reading Padding out a GVCF file with 1000G exomes to get gatk VariantRecalibrator working with a small sample

How to call LOH with FreeC

How to call LOH with FreeC 0 Good morning, I am try to infer loss of heterozygosity (LOH) from WGS data using Freec. For this purpose, I am using these parameters in the “[BAF]” section of the configuration file: [BAF] makePileup = My_somaticVCF.vcf.gz fastaFile = hg19.fa SNPfile = hg19_snp142.SingleDiNucl.1based.txt.gz When…

Continue Reading How to call LOH with FreeC

How to merge vcf files

How to merge vcf files 3 Hi, I have 90 VCF files which I am looking to merge into one VCF file. I am trying to use VCFtools to merge these files. For that I am following the below process but while using vcf-merge command is not able to merge…

Continue Reading How to merge vcf files

GitHub – AI-sandbox/gnomix

This repository includes a python implemenation of Gnomix, a fast and accurate local ancestry method. Gnomix can be used in two ways: training a model from scratch using reference training data or loading a pre-trained Gnomix model (see Pre-Trained Models below) In both cases the models are used to infer…

Continue Reading GitHub – AI-sandbox/gnomix

Filter criteria for variants based on GBS data

Filter criteria for variants based on GBS data 0 Are there recommended filter criteria for variants based on GBS data? I currently use this filter formula that is used in bcbio for WGS based variants soft-filtering bcftools –soft-filter GATKCutoffSNP -e TYPE=”snp” && (MQRankSum < -12.5 || ReadPosRankSum < -8.0 ||…

Continue Reading Filter criteria for variants based on GBS data

plotting roh from bcftools

plotting roh from bcftools 0 Heys, I am following this small tutorial on how to calculate ROHs from a vcf file using bcftools (samtools.github.io/bcftools/howtos/roh-calling.html) and I am getting this txt file: # This file was produced by: bcftools roh(1.10.2+htslib-1.10.2-3) # The command line was: bcftools roh -G30 –AF-dflt 0.4 my_file.vcf…

Continue Reading plotting roh from bcftools

Interpreting output of BCFtools RoH

Interpreting output of BCFtools RoH 0 Hello! I am using BCFtools RoH for the first time, and I am having some trouble understanding its output file. The input is a gvcf file with genotype calls for one sample only, and I want to infer where there might be autozygous tracts….

Continue Reading Interpreting output of BCFtools RoH

How to merge many huge gVCFs with high speed.

How to merge many huge gVCFs with high speed. 3 Hello, In order to perform population gnomic analysis, I am trying to merge many and huge variants data (gVCF), such as several dozens Gb, over 20 files. Bcftools merge and vcf-merge were used so far but very slow to merge…

Continue Reading How to merge many huge gVCFs with high speed.

Phylogeographic reconstruction of the marbled crayfish origin

Procambarus fallax collections and PCR genotyping Animals were collected from various wild populations (Table S1) in compliance with state and local regulations (Georgia department of natural resources scientific collection permit 115621108, state of Florida collection permits S-19-10 and S-20-04). DNA was isolated from abdominal muscle tissue using SDS-based extraction and precipitation…

Continue Reading Phylogeographic reconstruction of the marbled crayfish origin

User friendly (visual&interactive) VCF/BCF mining tools (2021)

What is currently the best user friendly (visual and interactive) VCF/BCF mining tool in 2021? For VCF/BCF similar to size or even larger than the 1000 human genomes VCF? I guess most organization do not have a visual and interactive mining VCF mining tool but use either: A website front-end…

Continue Reading User friendly (visual&interactive) VCF/BCF mining tools (2021)

compare two vcf files

compare two vcf files 1 Hi. I have a problem I want to compare the rs numbers in two vcf files. so I want to check which of the Rs numbers are in the top 10 percent. I don’t know what to do. Can you help me if I have…

Continue Reading compare two vcf files

The sardine run in southeastern Africa is a mass migration into an ecological trap

INTRODUCTION Large-scale annual migrations occur in an extraordinary range of animals, from insects to the great whales. While the driving mechanisms of these migrations are varied and sometimes poorly understood, they often represent a way of optimizing conditions for breeding and adult fitness when these are in conflict. Often, populations…

Continue Reading The sardine run in southeastern Africa is a mass migration into an ecological trap

High frequency of an otherwise rare phenotype in a small and isolated tiger population

Significance Small and isolated populations have low genetic variation due to founding bottlenecks and genetic drift. Few empirical studies demonstrate visible phenotypic change associated with drift using genetic data in endangered species. We used genomic analyses of a captive tiger pedigree to identify the genetic basis for a rare trait,…

Continue Reading High frequency of an otherwise rare phenotype in a small and isolated tiger population

Produce PCA bi-plot for 1000 Genomes Phase III

Note1 – Previous version: Produce PCA bi-plot for 1000 Genomes Phase III in VCF format (old) Note2 – this data is for hg19 / GRCh37 Note3 – GRCh38 data is available HERE The tutorial has been updated based on the 1000 Genomes Phase III imputed genotypes. The original tutorial was…

Continue Reading Produce PCA bi-plot for 1000 Genomes Phase III

Filtering long indels from VCF

Filtering long indels from VCF 1 Hi, to create a multi-sample VCF in a large cohort of WES samples of very different quality I have to select only high-quality variants genotyped in as many samples as possible. I figured out that long indels have low quality only substitutions do not…

Continue Reading Filtering long indels from VCF

BCFtools Allelic Depth format nowhere explained?

BCFtools Allelic Depth format nowhere explained? 1 Hi, I had a question about Allelic depth (AD) of BCFtools. It has this format e.g. AD=262,18,0 – What number shows the depth of the REF and what is the ALT? and what is the ‘0’? I found 1 form where someone said…

Continue Reading BCFtools Allelic Depth format nowhere explained?

bcftools merge

Check out the vcf_merge command I wrote: $ fuc vcf_merge -h usage: fuc vcf_merge [-h] [–how TEXT] [–format TEXT] [–sort] [–collapse] vcf_files [vcf_files …] This command will merge multiple VCF files (both zipped and unzipped). It essentially wraps the ‘pyvcf.merge’ method from the fuc API. By default, only the GT…

Continue Reading bcftools merge

Edit vcf file 0|0 to 0

Edit vcf file 0|0 to 0 1 I have a vcf file with GT format as 0|0 0|1 1|1 etc. I would like to convert those to a single number to create a dosage file. Ex: Editing the vcf so that 0|0 become 0, 0|1 becomes 1 1|1 becomes 2…

Continue Reading Edit vcf file 0|0 to 0

Pacific Biosciences hiring Bioinformatics Software Engineer in United States

PacBio’s Application Software Group focuses on building solid, strategic value around our core data type – highly accurate, long read sequencing – by producing innovative software that unlocks genomics in ways never seen before. We’re growing an interdisciplinary team of bioinformatic experts to tackle some of the most interesting problems…

Continue Reading Pacific Biosciences hiring Bioinformatics Software Engineer in United States

How to filter GATK vcf file using other programs

How to filter GATK vcf file using other programs 0 hi everyone I called variants for a WGS project using GATK (HaplotypeCaller). Now, when I want to filter that VCF file by VariantFiltration command in GATK, so the following error message appears. java.lang.NumberFormatException: For input string: “10.90” I asked my…

Continue Reading How to filter GATK vcf file using other programs

comparing variants between two VCF files

comparing variants between two VCF files 1 I have two VCF files (e.g. SV1.vcf.gz, SV2.vcf.gz) and a bed file (reg.bed). I would like to compare the variants among them in the BED regions. The comparison includes the common variants and unique variants present in SV1 and SV2. I am currently…

Continue Reading comparing variants between two VCF files

bcftools isec -n operators

bcftools isec -n operators 0 I am still very confused by the use of the bcftools isec -n flag. According to the manual: samtools.github.io/bcftools/bcftools.html#isec): -n, –nfiles [+-=]INT|~BITMAP output positions present in this many (=), this many or more (+), this many or fewer (-), or the exact same (~) files…

Continue Reading bcftools isec -n operators

bcftools multiallelic split not working

I am attempting to split multiallelic sites using bcftools norm with the following command: zcat ${inputVcf} | sed ‘s/AD,Number=./AD,Number=R/g’ | sed ‘s/ADR,Number=./ADR,Number=R/g’ | sed ‘s/ADF,Number=./ADF,Number=R/g’ | bcftools norm –fasta-ref ${genomeFa} –check-ref s –multiallelics -any –output ${outputVcf} The sed commands were based on the recommendation from here. However I’m still getting…

Continue Reading bcftools multiallelic split not working

phase_trio.sh | searchcode

phase_trio.sh | searchcode PageRenderTime 24ms CodeModel.GetById 16ms app.highlight 5ms RepoModel.GetById 1ms app.codeStats 0ms /Phase/phase_trio.sh github.com/BioinformaticsArchive/fCNV Shell |…

Continue Reading phase_trio.sh | searchcode

bcftools merge; retaining sample names

bcftools merge; retaining sample names 2 When I do bcftools merge, the headers do not retain the filenames.  How can I specify filenames? This is my command  bcftools merge vcf/unfiltered/*.vcf.gz -O z > msa/pooled.vcf.gz However this is the relevant part of my header, despite the filenames I gave it.  Is…

Continue Reading bcftools merge; retaining sample names

Bcftools how to add DP to FORMAT field (get per sample read depth for REF vs ALT alleles )

Bcftools how to add DP to FORMAT field (get per sample read depth for REF vs ALT alleles ) 1 I’m trying to achieve what this post was looking for Add Dp Tag To Genotype Field Of Vcf File Currently this is my command: bcftools mpileup -Ou –max-depth 8000 –min-MQ…

Continue Reading Bcftools how to add DP to FORMAT field (get per sample read depth for REF vs ALT alleles )

Vcfutils error code

Vcfutils error code 20-08-2021 code at line (I think) just to get it to write a proper fq. Second issue is this error: substr outside of string at /usr/local/bin/object91.ru line We can do this in a single…

Continue Reading Vcfutils error code

Pybedtools error sans

Pybedtools error sans 20-08-2021 pysam – Error when I install samtools for python on windows – i trying install pysam, pybedtools modules on python got error: ($i=1; $i[email protected] temp]$ conda install pysam bedtools hisat2 [ snip. However,…

Continue Reading Pybedtools error sans

how to install conda bcftools +fill-tags plugin ?

how to install conda bcftools +fill-tags plugin ? 0 Hi, I am very new to bioinformatics but I wanted to know is there a way to install bcftools +fill-tags plugin in conda env. Plz consider that when I check my env there is not a bcftools folder and subsequently the…

Continue Reading how to install conda bcftools +fill-tags plugin ?

Calling variants on reads with MAPQ=0 on HaplotypeCaller or bcftools mpileup

Calling variants on reads with MAPQ=0 on HaplotypeCaller or bcftools mpileup 2 I am working with about 500 samples of human exome data. used hg19 to align my reads and ran a standard best-practices GATK workflow. Later only to realise that a small 1Mb loci has not mapped properly due…

Continue Reading Calling variants on reads with MAPQ=0 on HaplotypeCaller or bcftools mpileup

Extracting variations in the gene regions and from 100 bp of gene boundary from multiple VCF files

Extracting variations in the gene regions and from 100 bp of gene boundary from multiple VCF files 0 Hi, I sincerely hope that I am not repeating an already answered question. I couldn’t find the answer to my exact problem. I have three VCF files derived using bcftools (isec). Those…

Continue Reading Extracting variations in the gene regions and from 100 bp of gene boundary from multiple VCF files

How to include/keep only the samples in a list in VCF.gz file?

How to include/keep only the samples in a list in VCF.gz file? 3 Dear Friends, I have a list of 8000 samples in a file “samples.txt”: samples.txt: TCGA..barcode.. TCGA..barcode.. . . I am using bcftools to only keep these samples in the vcf.gz file. The vcf.gz file has 10000 samples….

Continue Reading How to include/keep only the samples in a list in VCF.gz file?

Filter on Allele Balance using BCFTools

Filter on Allele Balance using BCFTools 0 Hi All, I need to filter my variants based on the following criteria. 1) Include SNP sites with at least one heterozygous with allele balance(AB) > 0.15 or at least one homozygous variant 2) Include INDEL sites with at least one heterozygous with…

Continue Reading Filter on Allele Balance using BCFTools

Inquiry related to vcf file and formatting

Hello everyone, I am trying to run predixcan software. But its showing error as segmentation fault implying that there is something wrong with my vcf files. I am sharing the header of vcf file. ##fileformat=VCFv4.1 ##INFO=<ID=LDAF,Number=1,Type=Float,Description=”MLE Allele Frequency Accounting for LD”> ##INFO=<ID=AVGPOST,Number=1,Type=Float,Description=”Average posterior probability from MaCH/Thunder”> ##INFO=<ID=RSQ,Number=1,Type=Float,Description=”Genotype imputation quality from…

Continue Reading Inquiry related to vcf file and formatting

How to set variant FILTER in a VCF file based on overlap with regions in a BED file

I figured out how to do the annotation using BCFTools. 2 steps are needed. Input BED file requires 1 for each region where the annotation should be set Chr_01 1000 2000 1 Chr_05 5000 6000 1 Input header file: ##INFO=<ID=BAD_REGION,Number=0,Type=Flag,Description=”My bad region for some reason”> bgzip and tabix the bed…

Continue Reading How to set variant FILTER in a VCF file based on overlap with regions in a BED file

Understanding bcftools command

Understanding bcftools command 1 I need to perform the following action to combine multiple vcf files into one BCF=/path_to_bcftools export BCFTOOLS_PLUGINS=$BCF/plugins DIR=/path_to_normal_vcf_file $BCF/bcftools merge -m all -f PASS,. –force-samples $DIR/*.vcf.gz | $BCF/bcftools plugin fill-AN-AC | $BCF/bcftools filter -i ‘SUM(AC)>1′ > panel_of_normal.vcf I don’t have access to command-line bcftools, and since…

Continue Reading Understanding bcftools command

EOF marker absent in VCF

EOF marker absent in VCF – can this be safely ignored? 0 Hi, I generated a VCF file using a bcftools mpileup | bcftools call pipeline. I have done this before, and the file produced then looks fine. However, the log for this one had [W::bgzf_read_block] EOF marker is absent….

Continue Reading EOF marker absent in VCF

Error while subsetting VCF – error doesn’t check out with (z)grep

Error while subsetting VCF – error doesn’t check out with (z)grep 0 I’m using bcftools view -s to subset a VCF.gz file. I ran into an error: [E::vcf_parse_format] Number of columns at chr9:44897051 does not match the number of samples (90 vs 99) To look at this site, I ran…

Continue Reading Error while subsetting VCF – error doesn’t check out with (z)grep

bcftools consensus still returns “Could not parse the header” error

bcftools consensus still returns “Could not parse the header” error 0 I attempted to create a consensus fasta file using bcftools, i.e. bgzip -c All_SRR_SNP_Clean.vcf > All_SRR_SNP_Clean.vcf.gz tabix All_SRR_SNP_Clean.vcf.gz cat $ref| bcftools consensus $vcf_dir/All_SRR_SNP_Clean.vcf.gz > consensus.fasta where $ref is the path to a Drosophila reference genome fa and the vcf…

Continue Reading bcftools consensus still returns “Could not parse the header” error

Extremely low number of variants in VCF file after filtering MIN(FORMAT/DP)>10

Extremely low number of variants in VCF file after filtering MIN(FORMAT/DP)>10 0 I’m doing microbiome analysis where I’m looking for SNPs in a large number of microbe species’ genomes. I ran my bcftools pipeline on around 15 bacterial and viral species from which the end result produced a number of…

Continue Reading Extremely low number of variants in VCF file after filtering MIN(FORMAT/DP)>10