Tag: SNAKEMAKE

python – Missing input files after defining them in function

I am trying to do QC on RNAseq data that is tarballed. I am using Snakemake as a workflow manager and am aware that Snakemake does not like one-to-many rules. I defining a checkpoint would fix the problem but when I run the script I get this this error message…

Continue Reading python – Missing input files after defining them in function

Anyone know any clever snakemake/SLURM tricks to run a big analysis with limited storage?

Anyone know any clever snakemake/SLURM tricks to run a big analysis with limited storage? 1 I am using a SLURM HPC to run jobs and have ran into issues with storage. I have 3TB storage, and want to run over 1000 publicly available RNAseq data through my pipeline, which includes…

Continue Reading Anyone know any clever snakemake/SLURM tricks to run a big analysis with limited storage?

snakemake truncating shell codes

snakemake truncating shell codes 0 I’m trying to change the chromosome number notation from [0-9XY] to Chr[0-9XY] using the samtools reheader in the shell command of the snakemake. rule rename: input: os.path.join(config[“input”], “{sample}.bam”), output: os.path.join(config[“output”], “new_sample/{sample}_chr.bam”) log: os.path.join(config[“log”], “samtools/{sample}”) shell: “samtools view -H {input} | sed -e ‘s/SN:([0-9XY]*)/SN:chr1/’ -e ‘s/SN:MT/SN:chrM/’…

Continue Reading snakemake truncating shell codes

[moiexpositoalonsolab/grenepipe] freebayes causes early error about number of threads

Hi Lucas, got a weird one for you. If I change the caller from hapotypecaller to freebayes, I get the error below. It’s doubly strange because it seems to occur well before freebayes would be used in the pipeline. [Sat Dec 11 11:13:02 2021] rule samtools_stats: input: dedup/111D03-1.bam output: qc/samtools-stats/111D03-1.txt…

Continue Reading [moiexpositoalonsolab/grenepipe] freebayes causes early error about number of threads

Scripts for BGC analysis in large MAGs and results of their application to soil metagenomes within Chernevaya Taiga RSF-funded project

This repository include scripts for analysis of biosynthetic gene clusters (BGCs) in large metagenome assemblies. All scripts were created within the Chernevaya Taiga project funded by the Russian Science Foundation (grant 19-16-00049). The repository also contains results of the scripts application to four hybrid (illumina + ONT) assemblies of various…

Continue Reading Scripts for BGC analysis in large MAGs and results of their application to soil metagenomes within Chernevaya Taiga RSF-funded project

Aro Biotherapeutics hiring Investigator, Genetics & Bioinformatics in Philadelphia, Pennsylvania, United States

About Aro BioTx Join the team at Aro Biotherapeutics creating breakthrough biotherapeutics based on Centyrin oligonucleotide conjugates. Centyrins are small protein domains based on the fibronectin domains of human Tenascin C that combine the affinity and specificity properties of antibodies with the stability and tissue penetration properties of small molecules….

Continue Reading Aro Biotherapeutics hiring Investigator, Genetics & Bioinformatics in Philadelphia, Pennsylvania, United States

iCOMIC: a graphical interface-driven bioinformatics pipeline for analyzing cancer omics data

Abstract Despite the tremendous increase in omics data generated by modern sequencing technologies, their analysis can be tricky and often requires substantial expertise in bioinformatics. To address this concern, we have developed a user-friendly pipeline to analyze (cancer) genomic data that takes in raw sequencing data (FASTQ format) as input…

Continue Reading iCOMIC: a graphical interface-driven bioinformatics pipeline for analyzing cancer omics data

Germline variant calling pipeline using Snakemake

Tool:Germline variant calling pipeline using Snakemake 0 Hello everybody, as part of a project, I had to write an in-house pipeline to call germline mutations for ~100 patients. For that I used Snakemake and GATKs best practice guidelines. Steps that take a long time (HaplotypeCaller or BaseQualityScoreRecalibration) are automatically parallelized…

Continue Reading Germline variant calling pipeline using Snakemake

python – snakemake multiple parameters for multiple input and single output in snakemake. ConbineGVCFs gatk problem

I have written a rule for CombineGVCFs in gatk4. The rule is as follow all_gvcf = get_all_gvcf_list() rule cohort: input: all_gvcf_list = all_gvcf, ref=”/data/refgenome/hg38.fa”, interval_list = prefix+”/bedfiles/hg38.interval_list”, params: extra = “–variant”, output: prefix+”/vcf/cohort.g.vcf”, shell: “gatk CombineGVCFs -R {input.ref} {params.extra} {input.all_gvcf_list} -O {output} –tmp-dir=/data/tmp -L {input.interval_list}” all_gvcf is the dataset for…

Continue Reading python – snakemake multiple parameters for multiple input and single output in snakemake. ConbineGVCFs gatk problem

BIOINFORMATICIAN II

Apply at: uab.taleo.net/careersection/ext/jobdetail.ftl?job=T190159&tz=GMT-05%3A00&tzname=America%2FChicago A Bioinformatician II is needed to participate in research activities involved in the analysis of Next Generation Sequencing such as data derived from genomics and transcriptomics (bulk and single-cell sequencing) to the UAB Biological Data Science institutional core (www.uab.edu/cores/ircp/bds & Twitter page: twitter.com/UAB_BDS) . This position will…

Continue Reading BIOINFORMATICIAN II

how to generate a file when snakemake pipeline failed

how to generate a file when snakemake pipeline failed 0 Hi there, I am working on a pipeline to make WES analyses. I am preparing that pipeline via snakemake. I have a problem about the input fasq file. Sometimes, users enter fastq files that are 8-9 mb. In this case,…

Continue Reading how to generate a file when snakemake pipeline failed

Clinical Bioinformatics Scientist/Engineer Job in Massachusetts (MA), Career, Full Time Jobs in Novartis Pharmaceuticals

6500 – The number of associates in the Novartis Institutes for BioMedical Research (NIBR). This division is the innovation engine of Novartis, focusing on powerful new technologies that have the potential to help produce therapeutic breakthroughs for patients. We are seeking a bioinformatics scientist to coordinate the processing and…

Continue Reading Clinical Bioinformatics Scientist/Engineer Job in Massachusetts (MA), Career, Full Time Jobs in Novartis Pharmaceuticals

Pacific Biosciences hiring Bioinformatics Software Engineer in United States

PacBio’s Application Software Group focuses on building solid, strategic value around our core data type – highly accurate, long read sequencing – by producing innovative software that unlocks genomics in ways never seen before. We’re growing an interdisciplinary team of bioinformatic experts to tackle some of the most interesting problems…

Continue Reading Pacific Biosciences hiring Bioinformatics Software Engineer in United States

Research Scientist Bioinformatics at Exscientia

We are looking to hire an experienced bioinformatician who specializes in the analysis of human NGS data. The Research Scientist Bioinformatics will lead the development and expansion of our in-house NGS capabilities together with data managers and software developers, while also carrying out project-specific analyses. Exscientia GmbH is a company…

Continue Reading Research Scientist Bioinformatics at Exscientia

Solvuu hiring Bioinformatics Engineer in United States

Summary At Solvuu, we are building technology to revolutionize bioinformatics and data science. We are seeking an accomplished, self-motivated and ambitious bioinformatics engineer with a strong track record in developing, executing, and maintaining bioinformatics pipelines on AWS for biotech R&D. The successful candidate will have the opportunity to drive and…

Continue Reading Solvuu hiring Bioinformatics Engineer in United States

Bioinformatics Engineer, Molecular Diagnostics Labs, UCLA Health

Job:Bioinformatics Engineer, Molecular Diagnostics Labs, UCLA Health 0 You will be part of a team of molecular biologists, medical geneticists, and bioinformaticians working together to analyze new and archival clinical cases using high throughput sequencing. As one of a newer team of software engineers, you will have the unique opportunity…

Continue Reading Bioinformatics Engineer, Molecular Diagnostics Labs, UCLA Health

Snakemake-Aligment using BWA-MEM2

Hello I have started using snakemake 6.5.2 to align fastq files with reference file. I have pasted the error below in this question. How to allocate memory in the snakefile and read the header from samfile, ‘-‘. This is the snakefile (wrapper for running alignment): rule bwa_mem2_mem: input: reads=[“/scicore/home/cichon/GROUP/test_workflow/samples/{sample}.1.fq”, “/scicore/home/cichon/GROUP/test_workflow/samples/{sample}.2.fq”]…

Continue Reading Snakemake-Aligment using BWA-MEM2

Head of Bioinformatics & Data Mining at HUMMINGBIRD BIOSCIENCE PTE. LTD. Singapore

About Hummingbird Bioscience Hummingbird Bioscience is an innovative clinical-stage biotech company focused on developing precision therapies against hard-to-drug targets to improve treatment outcomes. We harness the latest advances in systems biology and data science to better understand and solve the underlying causes of disease and guide development of our therapeutics. Enabled…

Continue Reading Head of Bioinformatics & Data Mining at HUMMINGBIRD BIOSCIENCE PTE. LTD. Singapore

Bioinformatics Software Engineer, Cancer Genomics Research Laboratory (req2036) job with Frederick National Laboratory

PROGRAM DESCRIPTION We are seeking an enthusiastic, creative, and collaborative bioinformatics software engineer to support pipeline development and analysis for our broad portfolio of genomic studies. If you have experience designing and deploying robust, reproducible, production-quality pipelines, then come join our talented team of bioinformaticians dedicated to understanding the genetics…

Continue Reading Bioinformatics Software Engineer, Cancer Genomics Research Laboratory (req2036) job with Frederick National Laboratory