Tag: ENA

Talentify.io hiring 100% Remote- Bioinformatics Analyst ($50.00 – $55.00 / hour) in United States

Talentify helps candidates around the world to discover and stay focused on the jobs they want until they can complete a full application in the hiring company career page/ATS. Seeking a Bioinformatics Analyst 100% Remote Description Support of computational research priorities associated with oncology discovery programs. This variably entails querying…

Continue Reading Talentify.io hiring 100% Remote- Bioinformatics Analyst ($50.00 – $55.00 / hour) in United States

How to deal with SRA file without .SRA

How to deal with SRA file without .SRA 0 Good afternoon, I have encountered a problem with an SRA file and I hope you can help me. I downloaded the SRA file from this website: www.ebi.ac.uk/ena/browser/view/PRJNA834732. I obtained the SRA file without the .SRA extension. I used fast-dump to extract…

Continue Reading How to deal with SRA file without .SRA

SPECTRAFORCE hiring Bioinformatics Analyst III in North Chicago, Illinois, United States

Title: Bioinformatics Analyst III Duration: 12 Months Location/Site: 100% Remote Pay Rate: $51/hr. – $56/hr. on w2 Job Description Services Overview: Services include support of computational research priorities associated with oncology discovery programs. This variably entails querying public cancer genomics resources for mutation/expression distributions, assessing normal expression, interpreting germline associations…

Continue Reading SPECTRAFORCE hiring Bioinformatics Analyst III in North Chicago, Illinois, United States

Job Opening – Bioinformatics Analyst III – North Chicago, IL

job summary: As the largest staffing and recruitment agency in the world, we can commit to finding you the perfect role that gives you the opportunity to learn and grow in the life sciences arena. Utilizing a recruiter for your job search gives you access to a large network of…

Continue Reading Job Opening – Bioinformatics Analyst III – North Chicago, IL

Bioinformatics Analyst – Hiring Urgently at Synectics Inc in North Chicago, IL

We are looking to hire a capable Bioinformatics Analyst to join our passionate team at Synectics Inc in North Chicago, IL.Growing your career as a Full Time Bioinformatics Analyst is a terrific opportunity to develop relevant skills.If you are strong in planning, problem-solving and have the right experience for the…

Continue Reading Bioinformatics Analyst – Hiring Urgently at Synectics Inc in North Chicago, IL

Rangam hiring Bioinformatics Analyst (RNA-Seq/R Coding/Linux/RNA-Seq) (Phd) in United States

Remote role Title – Bioinformatics Analyst We have an exciting contract opportunity for a Bioinformatics Analyst with extensive experience in designing and implementing bioinformatics methods as well as analysis and interpretation of various types of omics data to solve biological and clinical problems. The candidate should be familiar with developing…

Continue Reading Rangam hiring Bioinformatics Analyst (RNA-Seq/R Coding/Linux/RNA-Seq) (Phd) in United States

Bioinformatics Analyst (PhD) – Rangam

Remote role Title – Bioinformatics Analyst We have an exciting contract opportunity for a Bioinformatics Analyst with extensive experience in designing and implementing bioinformatics methods as well as analysis and interpretation of various types of omics data to solve biological and clinical problems. The candidate should be familiar with developing…

Continue Reading Bioinformatics Analyst (PhD) – Rangam

US Tech Solutions hiring Bioinformatics Analyst III in United States

Title – Bioinformatics AnalystTypically between 9 AM-5 PM Central Time.Remote We have an exciting contract opportunity for a Bioinformatics Analyst with extensive experience in designing and implementing bioinformatics methods as well as analysis and interpretation of various types of omics data to solve biological and clinical problems. The candidate should…

Continue Reading US Tech Solutions hiring Bioinformatics Analyst III in United States

Curated and harmonized gut microbiome 16S rRNA amplicon data from dietary fiber intervention studies in humans

Carlson, J. L., Erickson, J. M., Lloyd, B. B. & Slavin, J. L. Health effects and sources of prebiotic dietary fiber. Curr. Dev. Nutr. 2, nzy005 (2018). Article  PubMed  PubMed Central  Google Scholar  Deehan, E. C. et al. Precision microbiome modulation with discrete dietary fiber structures directs short-chain fatty acid…

Continue Reading Curated and harmonized gut microbiome 16S rRNA amplicon data from dietary fiber intervention studies in humans

Genome sequencing and multifaceted taxonomic analysis of novel strains of violacein-producing bacteria and non-violacein-producing close relatives

Abstract Violacein is a water-insoluble violet pigment produced by various Gram-negative bacteria. The compound and the bacteria that produce it have been gaining attention due to the antimicrobial and proposed antitumour properties of violacein and the possibility that strains producing it may have broad industrial uses. Bacteria that produce violacein…

Continue Reading Genome sequencing and multifaceted taxonomic analysis of novel strains of violacein-producing bacteria and non-violacein-producing close relatives

The impact of rare protein coding genetic variation on adult cognitive function

The UKB is approved by the North West Multi-centre Research Ethics Committee (www.ukbiobank.ac.uk/learn-more-about-uk-biobank/about-us/ethics). The current study was conducted under UKB application no. 26041. The data in the UKB were collected after written informed consent was obtained from all participants. The Human Research Committee of the MGB approved the Biobank research…

Continue Reading The impact of rare protein coding genetic variation on adult cognitive function

MCQ on Nucleotide Databases – Biology MCQ

Nucleotide databases are repositories that store and provide access to nucleotide sequence data. These databases contain a vast collection of DNA and RNA sequences from various organisms, including viruses, bacteria, plants, animals, and humans. Nucleotide databases play a crucial role in genomics, molecular biology, and bioinformatics research by providing a…

Continue Reading MCQ on Nucleotide Databases – Biology MCQ

GenBank Overview

2 hours ago Article URL: www.ncbi.nlm.nih.gov/genbank/ Comments URL: news.ycombinator.com/item?id=36038722 Points: 1 # Comments: 0 GenBank OverviewWhat is GenBank?GenBank ® is the NIH genetic sequence database, an annotated collection of all publicly available DNA sequences (Nucleic Acids Research, 2013 Jan;41(D1):D36-42). GenBank is part of the International Nucleotide Sequence Database Collaboration, which…

Continue Reading GenBank Overview

Sample GenBank Record / Visual abstracts made easy with Mind the Graph

This page presents an annotated sample GenBank record (accession number U49845) in its GenBank Flat File format. You can check the corresponding alive record for U49845, and seeexamples of other records the show a range of biological features. SITE SCU49845 5028 bp DNA PLN 21-JUN-1999 DEFINITION Saccharomyces cerevisiae TCP1-beta gene,…

Continue Reading Sample GenBank Record / Visual abstracts made easy with Mind the Graph

In-depth Temporal Transcriptome Profiling of Monkeypox and Host Cells using Nanopore Sequencing

Figure 1 shows the detailed workflow of the study. Fig. 1 General overview of the study. Briefly, MPXV was isolated from a skin lesion and then was used to infect CV-1 cells. After the designated infection times, total RNA was isolated and sequenced using direct cDNA sequencing protocol on ONT’s MinION platform….

Continue Reading In-depth Temporal Transcriptome Profiling of Monkeypox and Host Cells using Nanopore Sequencing

Issue about generating EMBL Flat file for ENA submission

Issue about generating EMBL Flat file for ENA submission 0 Hello all! I am trying to generate an EMBL flat file to submit an annotated assembly to ENA. I am using EMBLmyGFF3 to generate the flat file from the whole genome FASTA file and the GFF3 file. I am getting…

Continue Reading Issue about generating EMBL Flat file for ENA submission

Whole genome sequencing of Ethiopian Brucella abortus isolates expands the known diversity of an early branching sub-Saharan African lineage

1. Introduction Brucellosis is a zoonotic infection caused by bacteria of the genus Brucella, which affects domestic livestock and a wide range of wild mammals (Ducrotoy et al., 2017). The disease is amongst the most common zoonotic infections globally, with an estimated 500,000 human cases annually (Pappas et al., 2006),…

Continue Reading Whole genome sequencing of Ethiopian Brucella abortus isolates expands the known diversity of an early branching sub-Saharan African lineage

can the controversial COVID genome database survive?

During the COVID-19 pandemic, one online platform emerged as the main repository for viral genome data. GISAID, an initiative launched in 2008 to improve the global sharing of influenza data, earned the trust of scientists by ensuring that they would be credited for the data they generated. It now hosts…

Continue Reading can the controversial COVID genome database survive?

The therapeutic potential of neurofibromin signaling pathways and binding partners

Bergqvist, C. et al. Neurofibromatosis 1 French national guidelines based on an extensive literature review since 1966. Orphanet J. Rare Dis. 15, 1–23 (2020). Article  Google Scholar  Martin, G. A. et al. The GAP-related domain of the neurofibromatosis type 1 gene product interacts with ras p21. Cell 63, 843–849 (1990)….

Continue Reading The therapeutic potential of neurofibromin signaling pathways and binding partners

Issue With CRAM -> BAM -> FASTQ Conversion

Issue With CRAM -> BAM -> FASTQ Conversion 2 Please help! I am trying to obtain fastq files from the GDSC, all we have in the lab is CRAM files. Unfortunately, the reference genome seems to not exist when pulled from an online source. I have attempted to use the…

Continue Reading Issue With CRAM -> BAM -> FASTQ Conversion

EMBL-EBIs Protein Data Bank in Europe

curl -s “https://www.ebi.ac.uk/europepmc/webservices/rest/MED/33024307/datalinks?format=json” | jq ‘.'{ “version”: “6.8”, “hitCount”: 9, “request”: { “id”: “33024307”, “source”: “MED” }, “dataLinkList”: { “Category”: [ { “Name”: “Nucleotide Sequences”, “CategoryLinkCount”: 5, “Section”: [ { “ObtainedBy”: “tm_accession”, “Tags”: [ “supporting_data” ], “SectionLinkCount”: 5, “Linklist”: { “Link”: [ { “ObtainedBy”: “tm_accession”, “PublicationDate”: “04-11-2022”, “LinkProvider”: { “Name”:…

Continue Reading EMBL-EBIs Protein Data Bank in Europe

Creating a reference panel of equus caballus sequences

Creating a reference panel of equus caballus sequences 1 Hi all, I have a quick question regarding the starting of my masters project that I’m trying to understand, I am currently trying to create a reference panel of wgs for the species equus caballus (horse) as part of my research…

Continue Reading Creating a reference panel of equus caballus sequences

First-of-its-kind stem cell study sheds light on Klinefelter syndrome

The impact of X overdosage on the global transcriptomes of Saudi and ENA KS-iPSCs. A) Venn diagram showing the DEGs shared in the contrast 47,XXY Vs. 46,XY in iPSCs generated from ENA and Saudi KS patients. B) Gene Ontology analysis on common DEGs using the GO enriched for Biological Processes…

Continue Reading First-of-its-kind stem cell study sheds light on Klinefelter syndrome

Dealing with “Too many samples were discarded” – StrainPhlAn

I’m attempting to do a strainphlan analysis of Blautia wexlerae, but nothing I’m doing is working. I think the problem might be the reference genomes that I’m using, but I’m using the 2 primary genomes from NCBI. Is there another place to get correctly formatted references? Error Here’s the primary…

Continue Reading Dealing with “Too many samples were discarded” – StrainPhlAn

Metagenomic dataset from Swedish urban lakes

We release metagenomic data of seven urban, eutrophic Swedish lakes that have been extensively studied and characterized in terms of biogeochemistry. Here we provide the supplementary tables and the full set of metagenome-assembled genomes of 17 metagenomic samples. – We have 10 metagenomic samples from Lake Mälaren which is the…

Continue Reading Metagenomic dataset from Swedish urban lakes

Inactivation of interleukin-30 in colon cancer stem cells via CRISPR/Cas9 genome editing inhibits their oncogenicity and improves host survival

Introduction Colorectal cancer (CRC) is a leading cause of cancer-related death1 and its mortality rate is expected to rise worldwide, due to population growth and aging, thus entailing a global public health challenge. CRC mortality is mainly due to therapy resistance and metastasis, which are driven by a small population…

Continue Reading Inactivation of interleukin-30 in colon cancer stem cells via CRISPR/Cas9 genome editing inhibits their oncogenicity and improves host survival

Obtain number of base pairs in a genome

Obtain number of base pairs in a genome 1 HI! It’s going to be a stupid question since I’m not anyhow related to bioinformatics – I’m interested into how can I obtain the number of base pairs in my genome sample. I’m trying to remake the experiment that was made…

Continue Reading Obtain number of base pairs in a genome

10x 3′ library creates R1 and R2 fastq files with the same read length

Let me show you an example: trace.ncbi.nlm.nih.gov/Traces/index.html?view=run_browser&acc=SRR16093385&display=metadata This data contains two reads, R1 and R2. The read length of R1 and R2 are the same 150bp. However, this experiment is performed following 10x 3’library protocol. In the method section, it described as below: The scRNA-seq libraries were generated using the…

Continue Reading 10x 3′ library creates R1 and R2 fastq files with the same read length

Open data for biodiversity: an EMBL-EBI 2022 round-up

Ensuring open data from landmark biodiversity projects and smaller genome sequencing initiatives are readily-available to the global scientific community Genomic data for biodiversity research is produced and shared worldwide. Credit: Karen Arnott/EMBL-EBI As major biodiversity projects increasingly use genomic sequencing to catalogue and understand species, EMBL-EBI is ensuring that the…

Continue Reading Open data for biodiversity: an EMBL-EBI 2022 round-up

Error while converting GFF file to GTF using AGAT

Error while converting GFF file to GTF using AGAT 0 Hi I am trying to convert a gff file to gtf file which I want to use for STAR. I tried AGAT(latest version) to convet but it gives me a series of error(mailny tow types) .I have attached the error…

Continue Reading Error while converting GFF file to GTF using AGAT

Brain expression quantitative trait locus and network analyses reveal downstream effects and putative drivers for brain-related diseases

Harmonizing datasets for eQTL and co-regulation analysis We combined 14 eQTL datasets into the ‘MetaBrain’ resource to maximize statistical power to detect eQTLs and create a brain-specific gene co-regulation network (Fig. 2, Supplementary Figs. 1–7 and Supplementary Table 1). Previous to quality control (QC), MetaBrain includes 7,604 RNA-seq samples and…

Continue Reading Brain expression quantitative trait locus and network analyses reveal downstream effects and putative drivers for brain-related diseases

10x BAM to count matrix in Rsubread and Rsamtools- cellCounts vs. featureCounts

Hi, I am new to Single cell analysis but have some experience with NGS data output and manipulation. I have a set of .bam that I assume were the product of the cell ranger pipeline from the ENA (project ID PRJEB36998) unfortunately I have no access to any other output…

Continue Reading 10x BAM to count matrix in Rsubread and Rsamtools- cellCounts vs. featureCounts

A 10-year microbiological study of Pseudomonas aeruginosa strains revealed the circulation of populations resistant to both carbapenems and quaternary ammonium compounds

P. aeruginosa bacterial strains Reference strains Four well-described and genome-available reference strains were used in the present study, ATCC27853 and ATCC15442, obtained from the American Type Culture Collection (ATCC), and PAO1 and PA14, from the collection of Institut Pasteur (Paris, France). Strain ATCC15442 is recommended for disinfectant susceptibility testing44, strain…

Continue Reading A 10-year microbiological study of Pseudomonas aeruginosa strains revealed the circulation of populations resistant to both carbapenems and quaternary ammonium compounds

Alignment in bioinformatics

Alignment in bioinformatics 1 my project talk about gene expression profile of Mycobacterium tuberculosis on lung , i choose my samples from ENA :www.ebi.ac.uk/ena/browser/view/PRJEB19976 , I download reference genome from NCBI : www.ncbi.nlm.nih.gov/genome/?term=h37rvand , I create an index by myself using HISAT2 and after that i started the process of…

Continue Reading Alignment in bioinformatics

Institut Pasteur Project Aims to Index Global Sequencing Data

NEW YORK – The Institut Pasteur in Paris has won €2 million ($2.1 million) in EU funding to create a “search engine for DNA sequencing data,” indexing next-generation sequencing data available in the Sequence Read Archive in order to make it searchable and more accessible. The five-year IndexThePlanet project, led…

Continue Reading Institut Pasteur Project Aims to Index Global Sequencing Data

Bioinformatics Analyst – Tellus Solutions

Tellus Solutions is in partnership with a committed biopharmaceutical company in North Chicago focused on providing innovative therapies. Your technical expertise as a Bioinformatics analyst in the area of using R to do basic data analysis (processing, plotting), RNA-Seq alignment experience will contribute to our client’s innovative therapies which will…

Continue Reading Bioinformatics Analyst – Tellus Solutions

genbank sequence format

HHS Vulnerability Disclosure, Help This document is an overview of the Entrez databases, with general information on If you are not sure that the “Save” option in your program will do this for you, use “Save As”, In Excel, select “Save As” from the File menu. optimizations to reduce memory…

Continue Reading genbank sequence format

Cogent Biosciences to Showcase Precision Therapy Pipeline

WALTHAM, Mass. and BOULDER, Colo., Oct. 26, 2022 (GLOBE NEWSWIRE) — Cogent Biosciences, Inc. (Nasdaq: COGT), a biotechnology company focused on developing precision therapies for genetically defined diseases, today announced that it will be presenting two preclinical posters at the EORTC-NCI-AACR (“ENA”) annual meeting to be held October 26-28, 2022. Presentations…

Continue Reading Cogent Biosciences to Showcase Precision Therapy Pipeline

Bioinformatic Analyst job at Tellus Solutions in Remote

Job description Tellus Solutions is in partnership with a committed biopharmaceutical company in North Chicago focused on providing innovative therapies. Your technical expertise as a Bioinformatics analyst in the area of using R to do basic data analysis (processing, plotting), RNA-Seq alignment experience will contribute to our client’s innovative therapies…

Continue Reading Bioinformatic Analyst job at Tellus Solutions in Remote

Genomic architecture of adaptive radiation and hybridization in Alpine whitefish

Sampling the radiation To understand the phylogenetic relationships between Alpine whitefish, we carried out whole-genome resequencing on 96 previously collected whitefish (with associated phenotypic measurements including standard length and gill-raker counts; collected in accordance with permits issued by the cantons of Zurich (ZH128/15), Bern (BE68/15), and Lucerne (LU04/14); these fish…

Continue Reading Genomic architecture of adaptive radiation and hybridization in Alpine whitefish

Ubuntu Manpage: bamfillquery – fill query sequences into BAM files

Provided by: biobambam2_2.0.179+ds-1_amd64 NAME bamfillquery – fill query sequences into BAM files SYNOPSIS bamfillquery [options] <in.bam queries.fasta >out.bam DESCRIPTION bamfillquery reads a SAM/BAM/CRAM file and a FastA file, copies the sequences found in the FastA file into the query sequence field of the SAM/BAM/CRAM file and writes the resulting data…

Continue Reading Ubuntu Manpage: bamfillquery – fill query sequences into BAM files

Strange Per base sequence content of fastqc

Hi, all! I download fastq.gz files of GSE162708 from ENA which only have 2 files of each sample(usually scRNA-seq has 3 files I1 , R1 & R2 ). Then I run fastp as following Then I get QC report , but I can’t understand why Per base sequence content of…

Continue Reading Strange Per base sequence content of fastqc

Toll-like receptor 2/4 in Chinese patients with sepsis

Introduction Sepsis is a life-threatening organ dysfunction that results from an exaggerated host immune response to disseminate infection.1 Despite improvements in treatment strategies, sepsis remains a leading cause of death in critically ill patients worldwide.2 Low platelet number, known as thrombocytopenia, is common in infectious diseases (also sometimes referred to…

Continue Reading Toll-like receptor 2/4 in Chinese patients with sepsis

Scientists Develop $10 Per Genome Approach for Large-Scale Bacterial Sequencing

A worldwide consortium of scientists, led by the Earlham Institute and the University of Liverpool, has developed an efficient, inexpensive approach to large-scale bacterial genome sequencing that could equip researchers in low- and middle-income countries (LMICs) with cheap and accessible methods for sequencing large collections of bacterial pathogens—at a cost…

Continue Reading Scientists Develop $10 Per Genome Approach for Large-Scale Bacterial Sequencing

Large-scale genome-wide study reveals climate adaptive variability in a cosmopolitan pest

Genomic data The foundational resource for this study was a dataset of 40,107,925 nuclear SNPs sequenced from a worldwide sample of 532 DBM individuals collected in 114 different sites based on our previous project15. DNA was extracted from each of the 532 individuals using DNeasy Blood and Tissue Kit (Qiagen,…

Continue Reading Large-scale genome-wide study reveals climate adaptive variability in a cosmopolitan pest

miRNAs and mRNAs in intestinal ischemia-reperfusion injury

Introduction Intestinal ischemia-reperfusion (II/R) injury is a severe clinical complication common in the Intensive Care Unit (ICU). It is associated with high morbidity and mortality.1 Usually, this problem is followed by various causes, including sepsis, shock, trauma, and so on.2 Intestinal ischemia-reperfusion injury destroys intestinal tissue and impairs the function…

Continue Reading miRNAs and mRNAs in intestinal ischemia-reperfusion injury

Improving processing and quality of DNA data for biodiversity research

These datasets reuse the globally comprehensive DNA sequence data that ENA and its partners, the National Centre for Biotechnology Information (NCBI)  and the DNA Data Bank of Japan, maintain in the International Nucleotide Sequence Database Collaboration (INSDC). EMBL-EBI maintains ENA, which supplied the first DNA-derived dataset shared through GBIF in…

Continue Reading Improving processing and quality of DNA data for biodiversity research

Submit sequence data to NCBI

Data provision and standards. GEO sequence submission procedures are designed to encourage provision of MINSEQE elements: Thorough descriptions of the biological samples under investigation, and procedures to which they were subjected. Thorough descriptions of the protocols used to generate and process the data. Request updates to accessioned records per the…

Continue Reading Submit sequence data to NCBI

The faces of three ancient Egyptians appeared thanks to the remains of DNA more than 2,000 years ago

The faces of three ancient Egyptians are brought back to life thanks to DNA group of scientists Recreate The faces of three ancient Egyptian men using DNA that is more than 2,000 years old. According to the magazine NEWSWEEKIt is believed that this is the first time that modern techniques…

Continue Reading The faces of three ancient Egyptians appeared thanks to the remains of DNA more than 2,000 years ago

Faces of Three Ancient Egyptians Brought to Life Using 2,000-Year-Old DNA

The faces of three ancient Egyptian men have been brought to life by scientists, using DNA that is more than 2,000 years old. This is thought to be the first time modern techniques have been used on human DNA of this age, with the trio of samples estimated to be…

Continue Reading Faces of Three Ancient Egyptians Brought to Life Using 2,000-Year-Old DNA

Parabon Recreates Egyptian Mummy Faces From Ancient DNA

New Snapshot methods for low-coverage sequencing bring hidden data to life Scientific Poster Image Scientific Poster Image Scientific Poster Image RESTON, Va., Sept. 15, 2021 (GLOBE NEWSWIRE) — At the 32nd International Symposium on Human Identification (ISHI), being held this week in Orlando, Florida, Parabon NanoLabs will unveil for the…

Continue Reading Parabon Recreates Egyptian Mummy Faces From Ancient DNA

I downloaded fastq files from a repository and tried to run fastqc, how can the average sequence length be only 8 bp?

I downloaded fastq files from a repository and tried to run fastqc, how can the average sequence length be only 8 bp? 1 I downloaded sequencing files from 2 patients from here: www.ebi.ac.uk/ena/browser/view/PRJNA588461?show=reads there is one fastq file for the forward (1) and reverse (2) reads. I wanted to look…

Continue Reading I downloaded fastq files from a repository and tried to run fastqc, how can the average sequence length be only 8 bp?

SRA/ENA library layout is inconsistent with the data source

project number: PRJNA505380 An example of Run accession: SRR8244780 Issue: Inconsistency between the library layout of Run and data source. As the library layout both in ENA and SRA labeled, Runs in Bioproject PRJNA505380 should be pair-end reads data. But some of them only have a single fastq and without…

Continue Reading SRA/ENA library layout is inconsistent with the data source

Sequence (annotation) databases in 2021

Forum:Sequence (annotation) databases in 2021 1 Hi everyone, So I know there are several threads on this topic already (or tangentially related to it). For example: But these threads are really old now. Things have probably changed quite significantly in the mean time. So I would like to start a…

Continue Reading Sequence (annotation) databases in 2021