Tag: EOF

Possible bugs in Rsubread/stad-alone featureCounts options fracOverlap and largestOverlap with fractional counts

Hi, running Rsubread 2.8.2/2.12.0 or featureCounts 2.0.3/2.0.1, I stumbled over two issues when allowing ambiguous read assignment (-O/allowMultiOverlap) 1) regarding assignment via minimum fractional overlap (–fracOverlap) using featureCounts stand-alone binary. 2) when combined with –/largestOverlap and –/fraction using Rsubread featureCounts function or the stand-alone binary. to 1) Assume a read…

Continue Reading Possible bugs in Rsubread/stad-alone featureCounts options fracOverlap and largestOverlap with fractional counts

Issues importing STARsolo’s output into Seurat

Hello everyone, I have issues importing the filtered matrix files of STARsolo output to use with Seurat. I have tried multiple ways like: Drosophila.data <- ReadMtx(mtx =”~/genome/matrix/matrix.mtx”, cells=”~/genome/matrix/barcodes.tsv”, features=”~/genome/matrix/features.tsv”) and Drosophila.data <- ReadSTARsolo(data.dir =”~/genome/matrix/) But both give me the following error: Error: Matrix has 13968 rows but found 12507 features….

Continue Reading Issues importing STARsolo’s output into Seurat

Freebayes-parallel with large bam file – individual threads running for >6 days

Context: I’m trying to call variants on a sequencing project using pooled genotyping-by-sequencing. Pools consist of 94 samples each, alongside a number of individuals. Sequence data was demultiplexed and then aligned to a reference genome using hisat2, and the resultant bams were merged with samtools merge. The problem bam is…

Continue Reading Freebayes-parallel with large bam file – individual threads running for >6 days

Samtools Htslib Issues

Issue Title State Comments Created Date Updated Date How to get a specific chromosome open 1 2022-07-14 2022-07-18 tabix returns row from VCF file multiple times open 4 2022-07-11 2022-07-18 Modified base parsing failure failure closed 0 2022-07-01 2022-07-18 extract genotype information open 1 2022-06-24 2022-07-18 sam_hdr_remove_lines is inefficient if…

Continue Reading Samtools Htslib Issues

bgzf_read_block] EOF marker is absent reformat.sh

BBMap/BBTools reformat.sh : real error or spurious message? [W::bgzf_read_block] EOF marker is absent reformat.sh 1 When subsampling paired-end .fastq.gz files using reformat.sh from BBMap/BBTools, I get this error message: [W::bgzf_read_block] EOF marker is absent reformat.sh I’ve checked the input files with gunzip -t, no error. The input files are a…

Continue Reading bgzf_read_block] EOF marker is absent reformat.sh

[W::bgzf_read_block] EOF marker is absent in BBMAP

[W::bgzf_read_block] EOF marker is absent in BBMAP 0 Hello, I’m asking an issue encountered in bbmap. I was using bbmap to remove host contaminants from my microbiome data. The commands are simple as below (ref folder already generated in the last step) bbmap.sh -Xmx42g in=R1.fastq.gz in2=R2.fastq.gz outu=cleaned.interleaved.fastq.gz threads=12 overwrite=t unpigz=t…

Continue Reading [W::bgzf_read_block] EOF marker is absent in BBMAP

Problem with using flagstat after bowtie2 alignment

I’m running bowtie2 to align multiple samples to one reference genome, and then run samtools flagstats to output the results. All but two samples have aligned and I’ve managed to run flagstat on them. For those two samples, when I run flagstat, I first get: [W::bam_hdr_read] EOF marker is absent….

Continue Reading Problem with using flagstat after bowtie2 alignment

Alternative to samtools quickcheck with bash scripting

Alternative to samtools quickcheck with bash scripting 1 Is there an alternative to samtools quickcheck? I need to perform a check some bam files in order to verify that they’re not truncated. Nevertheless, I can’t use samtools since in the linux machine where I have the data I can’t install…

Continue Reading Alternative to samtools quickcheck with bash scripting

bamdst gives error “EOF marker is absent. The input is probably truncated.”

bamdst gives error “EOF marker is absent. The input is probably truncated.” 0 I created a set of bam files from Poolseq data using bwa -aln, and all of the output files gave the following error when I ran bamdst to get summary statistics on read depth: “EOF marker is…

Continue Reading bamdst gives error “EOF marker is absent. The input is probably truncated.”

samtools mpileup fail to create bcf

samtools mpileup fail to create bcf 1 I have indexed my reference.fasta using bowtie2: bowtie2-build reference.fasta reference.fasta created the bam file form the sam file using samtools, sorted and indexed the bam file: samtools view -S -b Sample1_mapped.sam > Sample1_mapped.bam samtools sort Sample1_mapped.bam -o Sample1_sorted > Sample1_sorted.bam samtools index Sample1_sorted.bam…

Continue Reading samtools mpileup fail to create bcf

split pdb into submodels

split pdb into submodels 1 There is a program called pdbsplitchains in this repository: github.com/ACRMGroup/bioptools pdbset from the CCP4 package can do all kinds of manipulations on PDB files – including splitting by chain – but it requires a bit of scripting. For example, these several lines would extract chain…

Continue Reading split pdb into submodels

EOF marker absent in VCF

EOF marker absent in VCF – can this be safely ignored? 0 Hi, I generated a VCF file using a bcftools mpileup | bcftools call pipeline. I have done this before, and the file produced then looks fine. However, the log for this one had [W::bgzf_read_block] EOF marker is absent….

Continue Reading EOF marker absent in VCF

Error while subsetting VCF – error doesn’t check out with (z)grep

Error while subsetting VCF – error doesn’t check out with (z)grep 0 I’m using bcftools view -s to subset a VCF.gz file. I ran into an error: [E::vcf_parse_format] Number of columns at chr9:44897051 does not match the number of samples (90 vs 99) To look at this site, I ran…

Continue Reading Error while subsetting VCF – error doesn’t check out with (z)grep