Tag: minimap2
Lh3 Minimap2 Issues
Issue Title State Comments Created Date Updated Date Mapping reads against multi references. Any proposition? open 0 2022-06-28 2022-06-30 Inversion between tandem repeats yields misalignment closed 1 2022-06-21 2022-06-30 use minimap2 to extract mitochondrial reads from genome assembly open 0 2022-06-20 2022-06-30 Asking for #301 to be reopened closed 0…
A closed Candidatus Odinarchaeum chromosome exposes Asgard archaeal viruses
Zaremba-Niedzwiedzka, K. et al. Asgard archaea illuminate the origin of eukaryotic cellular complexity. Nature 541, 353–358 (2017). CAS PubMed Article Google Scholar Williams, T. A., Cox, C. J., Foster, P. G., Szöllősi, G. J. & Embley, T. M. Phylogenomics provides robust support for a two-domains tree of life. Nat. Ecol….
In vivo hypermutation and continuous evolution
Arnold, F. H. Design by directed evolution. Acc. Chem. Res. 31, 125–131 (1998). Google Scholar Packer, M. S. & Liu, D. R. Methods for the directed evolution of proteins. Nat. Rev. Genet. 16, 379–394 (2015). Google Scholar Drake, J. W., Charlesworth, B., Charlesworth, D. & Crow, J. F. Rates of…
Accelerating minimap2 for long-read sequencing applications on modern CPUs ePrints@IISc
Kalikar, S and Jain, C and Vasimuddin, M and Misra, S (2022) Accelerating minimap2 for long-read sequencing applications on modern CPUs. In: Nature Computational Science, 2 (2). pp. 78-83. Full text not available from this repository. Abstract Long-read sequencing is now routinely used at scale for genomics and transcriptomics applications….
Postdoctoral Research Fellow in Bioinformatics/Computational Biology
Details Posted: 27-Apr-22 Location: Boston, Massachusetts Salary: Open Categories: Staff/Administrative Internal Number: 2022-27118 Located in Boston and the surrounding communities, Dana-Farber Cancer Institute brings together world renowned clinicians, innovative researchers and dedicated professionals, allies in the common mission of conquering cancer, HIV/AIDS and related diseases. Combining extremely talented people with…
Minimap2 detects too many 16S sequences in metagenome
16S is a very conserved sequence, which is why it’s used for targeted phylogenetic analysis; it makes it easy to amplify and analyse. Unfortunately that conservation is an issue with minimap2, which is built around the idea that matching scattered subsequences within a sequence is good enough for identifying matches…
different result using minimap2 and pbmm2
Hi all! I am analysing CSS Pacbio data and each sample came from different run, in particular I have three files for each sample. I tested both pbmm2 and minimap2 to align my long reads, after getting the consensus sequences. This is the command I used to run mnimap2: minimap2…
Pangenome-based genome inference allows efficient and accurate genotyping across a wide spectrum of variant classes
Sequencing data We used publicly available sequencing data from the GIAB consortium45, 1000 Genomes Project high-coverage data46 and Human Genome Structural Variation Consortium (HGSVC)4. All datasets include only samples consented for public dissemination of the full genomes. Statistics and reproducibility For generating the assemblies, we used all 14 samples for…
Mapping back 3 sets of reads/sample with minimap2
I used FaQC to qc my raw fastqs before assembling. That program (and perhaps others) outputs properly paired Forward and Reverse fastqs, as well as an unpaired fastq file for each sample. I used the all 3 for each single sample assembly. Since minimap2 only allows for 2 query files,…
Why did I achieve shorter than initial reads subset after aligned reads extraction.
Why did I achieve shorter than initial reads subset after aligned reads extraction. 1 Hello dear colleages! I have recently faced some problem. I have worked with long WGS reads. Firstly I have filtered the longest subset of reads, and aligned them to the custom sequence with several structural variants…
Debian — File list of package minimap2/bookworm/i386
/usr/bin/minimap2 /usr/share/doc-base/minimap2.minimap2 /usr/share/doc/minimap2/changelog.Debian.gz /usr/share/doc/minimap2/copyright /usr/share/doc/minimap2/minimap2.pdf /usr/share/doc/minimap2/run-unit-test /usr/share/doc/minimap2/test/MT-human.fa.gz /usr/share/doc/minimap2/test/MT-orang.fa.gz /usr/share/doc/minimap2/test/q-inv.fa.gz /usr/share/doc/minimap2/test/q2.fa /usr/share/doc/minimap2/test/t-inv.fa.gz /usr/share/doc/minimap2/test/t2.fa /usr/share/doc/minimap2/test_script /usr/share/man/man1/minimap2.1.gz Read more here: Source link
Minimap2 options for Nanopore cDNA direct seq
Minimap2 options for Nanopore cDNA direct seq 0 Hello, I’m working with ONT RNA seq data and I used the cDNA direct seq to do the seq. I want to look for long deletions in mRNAs that are not spliced, for this, I want to use the splice option of…
Accelerating minimap2 for long-read sequencing applications on modern CPUs
Chaisson, M. J. et al. Multi-platform discovery of haplotype-resolved structural variation in human genomes. Nat. Commun. 10, 1–16 (2019). Article Google Scholar Conesa, A. et al. A survey of best practices for RNA-seq data analysis. Genome Biol. 17, 1–19 (2016). Article Google Scholar Beyter, D. et al. Long-read sequencing of…
bwa-mem2/mm2-fast: Accelerated version of minimap2; up to 1.8x faster
GitHub – bwa-mem2/mm2-fast: Accelerated version of minimap2; up to 1.8x faster This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository. You can’t perform that action at this time. You signed in with another tab or window. Reload to…
Accepted minimap2 2.24+dfsg-1 (source) into unstable
—–BEGIN PGP SIGNED MESSAGE—– Hash: SHA512 Format: 1.8 Date: Thu, 17 Feb 2022 15:33:57 +0100 Source: minimap2 Architecture: source Version: 2.24+dfsg-1 Distribution: unstable Urgency: medium Maintainer: Debian Med Packaging Team <debian-med-packag…@lists.alioth.debian.org> Changed-By: Andreas Tille <ti…@debian.org> Changes: minimap2 (2.24+dfsg-1) unstable; urgency=medium . * New upstream version Checksums-Sha1: c420faa756a58bcb60927999f3afc21abf91128e 2281 minimap2_2.24+dfsg-1.dsc 8b870031e6b221240cc43b03e117e50a312881ec…
Find Transposon Element insertions using long reads (nanopore), by alignment directly. (minimap2)
find_te_ins is designed to find Transposon Element (TE) insertions using long reads (nanopore), by alignment directly. (minimap2) Install $ git clone github.com/bakerwm/find_te_ins.git $ cd find_te_ins Change the following variables upon your condition: genome_fa and te_fa in line-10 and line-11; $ bash run_pipe.sh run_pipe.sh Prerequisite minimap2 – 2.17-r974-dirty, align long…
[MonashBioinformaticsPlatform/RSeQC] junction_saturation not suit for bam/sam file generated by minimap or pbmm2
because the CIGAR in bam/sam file generated by minimap2 contain “=” , represent right match with reference, and “X”, represent wrong match with reference. while the bam_cigar.py in ./lib/qcmodule/bam_cigar.py only suit for bam/sam generated such as BWA/bowtie, which CIGAR contain only “M” ,represent mis/match. So i modified the bam_cigar.py 77…
Using Minimap2 with FMLRC2
Using Minimap2 with FMLRC2 1 Hello all, I am using FMLRC2 (github.com/HudsonAlpha/rust-fmlrc) to correct PacBio reads with Illumina reads for hybrid genome assembly. Since FMLRC2 only corrects reads (does not do any assembly) another program is needed. In the paper published on FMLRC minimap (now succeeded by minimap2, github.com/lh3/minimap2) was…
[lh3/minimap2] Memory leak when using Python and threads
The program align.py uses mappy to align reads in Python using multiple worker threads. After loading the index the memory usage jumps up quickly to >20Gb and then continues to climb steadily through 40Gb an beyond. This issue was first discovered in bonito and isolated to mappy. The data flow…
Variant calls of published already assembled genomes
Variant calls of published already assembled genomes 0 I have a set of short read sequencing for the 172 KB Epstein-barr virus genome. We successfully called our variants using GATK to a reference genome. A publication linked below from a different population compared variants (also from short read sequencing) to…
Overestimation of number of reads from nanopore data (flagstat)
Same issue as mentioned on the minimap2 tool: github.com/lh3/minimap2/issues/236#issue-361097444 For example nanopore reads aligned to the host transcriptome the flagstat output is: 5953480 + 0 in total (QC-passed reads + QC-failed reads) 2961480 + 0 secondary 22696 + 0 supplementary 0 + 0 duplicates 4195469 + 0 mapped (70.47% :…
Samtools flagstat
Samtools flagstat 1 I aligned my ONT sequencing run with minimap2, subsequently I filtered the file using samtools view -b -F 256 aln_transcriptome_sorted_6.bam -o filtered_aln_transcriptome_6.bam to end up with primary alignments only. When I run samtools flagstat on the filtered file I get the following output: 3502608 + 0 in…
minimap2 – Github Help
38 9 7 minimap2,SneakySnake:snake: is the first and the only pre-alignment filtering algorithm that works efficiently and fast on modern CPU, FPGA, and GPU architectures. It greatly (by more than two orders of magnitude) expedites sequence alignment calculation for both short and long reads. Described in the Bioinformatics (2020) by…
About QUAST
About Short intro Further reading How to cite Brief project history Authors Funding License and Terms of use Short intro QUAST evaluates genome assemblies by computing various metrics, including N50, length for which the collection of all contigs of that length or longer covers at least 50% of assembly length…
A Fast, Memory-Efficient, and Accurate Mechanism to Find Fuzzy Seed Matches
BLEND is a mechanism that can efficiently find fuzzy seed matches between sequences to significantly improve the performance and accuracy while reducing the memory space usage of two important applications: 1) finding overlapping reads and 2) read mapping. Finding fuzzy seed matches enable BLEND to find both 1) exact-matching seeds…
sniffles failed detect SV on minimap2 aligments
When I use ngmlr the sniffles worked. The coverage it more than 90% The code I sent on the github is exactly what it generated, I don’t think there any error Xu Zhang PhD Postdoctoral Associate, Department of Microbiology and Immunology Weill Cornell Medicine 1300 York Avenue, Box 62 New…
#1001716 – src:minimap2: fails to migrate to testing for too long: autopkgtest regression
#1001716 – src:minimap2: fails to migrate to testing for too long: autopkgtest regression – Debian Bug report logs Reported by: Paul Gevers <elbrus@debian.org> Date: Tue, 14 Dec 2021 20:03:02 UTC Severity: serious Tags: bookworm, sid Found in version minimap2/2.17+dfsg-12 Fixed in version minimap2/2.22+dfsg-3 Done: Paul Gevers <elbrus@debian.org> Reply or subscribe…
Towards the biogeography of prokaryotic genes
1. Sunagawa, S. et al. Structure and function of the global ocean microbiome. Science 348, 1261359 (2015). PubMed Google Scholar 2. Zou, Y. et al. 1,520 reference genomes from cultivated human gut bacteria enable functional microbiome analyses. Nat. Biotechnol. 37, 179–185 (2019). CAS PubMed PubMed Central Google Scholar 3. Mohammad,…
Bug#1001716: src:minimap2: fails to migrate to testing for too long: autopkgtest regression
Source: minimap2 Version: 2.17+dfsg-12 Severity: serious Control: close -1 2.22+dfsg-3 Tags: sid bookworm User: release.debian….@packages.debian.org Usertags: out-of-sync Dear maintainer(s), The Release Team considers packages that are out-of-sync between testing and unstable for more than 60 days as having a Release Critical bug in testing [1]. Your package src:minimap2 has been…
Mapping multiples
Mapping multiples 1 Hi, I am coming to you for help. I am doing a mapping on short and long read files with BWA and MINIMAP2 My problem is that, I want to make an if loop that would allow me to choose either BWA if I work with short…
The Biostar Herald for Tuesday, September 14, 2021
The Biostar Herald publishes user submitted links of bioinformatics relevance. It aims to provide a summary of interesting and relevant information you may have missed. You too can submit links here. This edition of the Herald was brought to you by contribution from Istvan Albert, and was edited by Istvan…
How to plot the gap distribution of contigs wrt to reference genome
How to plot the gap distribution of contigs wrt to reference genome 0 I have a contigs file that I generated using Minia and then I have used Minimap2 to map these contigs again to the reference genome. Now I want to plot the gap distribution i.e. gap distance of…
Pacific Biosciences hiring Bioinformatics Software Engineer in United States
PacBio’s Application Software Group focuses on building solid, strategic value around our core data type – highly accurate, long read sequencing – by producing innovative software that unlocks genomics in ways never seen before. We’re growing an interdisciplinary team of bioinformatic experts to tackle some of the most interesting problems…
The Biostar Herald for Friday, September 03, 2021
The Biostar Herald publishes user submitted links of bioinformatics relevance. It aims to provide a summary of interesting and relevant information you may have missed. You too can submit links here. This edition of the Herald was brought to you by contribution from zx8754, Istvan Albert, and was edited by…
Global phylogenomic analyses of Mycobacterium abscessus provide context for non cystic fibrosis infections and the evolution of antibiotic resistance
1. Lee, M.-R. et al. Mycobacterium abscessus complex infections in humans. Emerg. Infect. Dis. 21, 1638–1646 (2015). CAS PubMed PubMed Central Google Scholar 2. Prince, D. S. et al. Infection with Mycobacterium avium complex in patients without predisposing conditions. N. Engl. J. Med. 321, 863–868 (1989). CAS PubMed Article Google…
unable to find chromosome in SAM header
featureCounts: unable to find chromosome in SAM header 0 I am using featureCounts to try and create a count table for some RNA-Seq data I collected using an Oxford Nanopore platform. I have .sam files aligned with minimap2, and am running the following command to try to get a count…
hisat2 compatibility for long read
hisat2 compatibility for long read 0 Hi, I am trying to align PacBio transcriptome reads against the genome to count the gene number. For pair end read i used the following workflow: # convert gff to gtf /home/software/cufflinks-2.2.1/gffread xxx.gff -T -o xxx.gtf # build index /home/software/hisat2-2.2.1/hisat2_extract_exons.py xxx.gtf > xxx.exon /home/software/hisat2-2.2.1/hisat2_extract_splice_sites.py…
De novo genome assembly
Forum:De novo genome assembly 0 Howdy, I have recently been tasked as the ‘bioinformatics guy” in my lab and am having trouble with a de novo genome assembly of Mother of a Thousand. I am working with Nanopore reads and have ran my reads through CANU. I have all of…