Tag: geneID

How to convert transcript-relative coordinates to genomic coordinates?

How to convert transcript-relative coordinates to genomic coordinates? 0 I have queried using Entrez Utilities (efetch: www.ncbi.nlm.nih.gov/books/NBK25499/) and obtained annotations for transcripts like the following: >Feature ref|NM_152486.3| 1 2557 gene gene SAMD11 gene_syn MRS gene_desc sterile alpha motif domain containing 11 db_xref GeneID:148398 db_xref HGNC:HGNC:28706 db_xref MIM:616765 How/what database should…

Continue Reading How to convert transcript-relative coordinates to genomic coordinates?

KEGG T00007: b1542

Entry b4323             CDS       T00007                                  Symbol uxuB Name (RefSeq) D-mannonate oxidoreductase   KO K00040   fructuronate reductase [EC:1.1.1.57] Organism eco  Escherichia coli K-12 MG1655 Pathway eco00040   Pentose and glucuronate interconversions eco01100   Metabolic pathways Module eco_M00061   D-Glucuronate degradation, D-glucuronate => pyruvate + D-glyceraldehyde 3P Brite KEGG Orthology (KO) [BR:eco00001] 09100 Metabolism  09101 Carbohydrate metabolism   00040 Pentose and glucuronate interconversions    b4323 (uxuB)Enzymes…

Continue Reading KEGG T00007: b1542

rna seq – How does DESeq2 “collapseReplicates” work on read counts data?

Comparing read counts from an RNA-seq experiment for a couple select genes before and after using DESeq2’s collapseReplicates function yields interesting results: Before: Geneid foo1.1 foo1.2 foo2.1 foo2.2 foo3.1 foo3.2 bar1.1 bar1.2 bar2.1 bar2.2 bar3.1 bar3.2 baz1.1 baz1.2 baz2.1 baz2.2 baz3.1 baz3.2 WASH7P 6 5 0 2 7 3 1…

Continue Reading rna seq – How does DESeq2 “collapseReplicates” work on read counts data?

KEGG T01005: 453503

Entry 453503            CDS       T01005                                  Symbol SNX1 Name (RefSeq) sorting nexin-1 isoform X3   KO K17917   sorting nexin-1/2 Organism ptr  Pan troglodytes (chimpanzee) Pathway ptr04144   Endocytosis Brite KEGG Orthology (KO) [BR:ptr00001] 09140 Cellular Processes  09141 Transport and catabolism   04144 Endocytosis    453503 (SNX1) 09180 Brite Hierarchies  09182 Protein families: genetic information processing   04131 Membrane trafficking [BR:ptr04131]    453503 (SNX1)  09183 Protein families: signaling and cellular processes   04990…

Continue Reading KEGG T01005: 453503

KINNEY_DNMT1_METHYLATION_TARGETS

Standard name KINNEY_DNMT1_METHYLATION_TARGETS Systematic name M2508 Brief description Hypomethylated genes in prostate tissue from mice carrying hypomorphic alleles of DNMT1 [GeneID=1786]. Full description or abstract Previous studies have shown that tumor progression in the transgenic adenocarcinoma of mouse prostate (TRAMP) model is characterized by global DNA hypomethylation initiated during early-stage…

Continue Reading KINNEY_DNMT1_METHYLATION_TARGETS

Combining RNA-Seq read counts from 2 lanes of the same sample (.txt file)

Hi, I have a question on combining read counts from 2 lanes of the same sample. I have a very large RNA-Seq dataset downloaded from the NCBI GEO. The data files are in the *.txt format and each sample with 2 lanes obtained from featureCounts. I would like to perform…

Continue Reading Combining RNA-Seq read counts from 2 lanes of the same sample (.txt file)

Running htseq-count to “grab” long non coding gene_id names

Running htseq-count to “grab” long non coding gene_id names 0 hi all, new to bioinformatics. so bare with me.. I am trying find long non coding RNA from RNA-seq data. As i checked the human gtf file there are 2 different types of long non coding RNA, “lnc_RNA” and “lncRNA”,…

Continue Reading Running htseq-count to “grab” long non coding gene_id names

number of GO terms in results

clusterProfiler: number of GO terms in results 0 I am working with a non-model organism. So I constructed TERM2GENE and TERM2NAME files and used enricher to run GO enrichment analysis. The code I used was below. Finally, I got 71 GO terms in the result. But actually, there are 99…

Continue Reading number of GO terms in results

geneiD-genetranscript annotations

Hello, Trying to generate a frame with 2 columns: transcript_id and gene_id, in LINUX (gtf from esembl) grep -P -o ‘ESNCAGd{11} Equus_caballus.EquCab3.0.104.gtf’ > ensecag.txt grep -P -o ‘ESNCATd{11} Equus_caballus.EquCab3.0.104.gtf’ > ensecat.txt wc -l enseca* # To see if both files have the same length They are not the same length:…

Continue Reading geneiD-genetranscript annotations

does not contain a ‘gene’ attribute

htseq-count returns : does not contain a ‘gene’ attribute 1 Dear BIOSTAR community, I’m trying to make count matrix with htseq-count, htseq-count -s yes -t gene -i gene 01.sorted.sam annotation_cattle.gff > 01.txt even with –idattr=gene , it returns error: Error processing GFF file (line 1864255 of file annotation_cattle.gff): Feature gene-D1Y31_gp1…

Continue Reading does not contain a ‘gene’ attribute

Circos plot with logfold change RNA seq data

I am new to circos plot analysis and have been trying to use the cyclize package. I want to display mRNA differential gene expression data based on data analyses of 8 libraries and links between their respective target genes. The dataset I am working with looks like this geneid baseMean…

Continue Reading Circos plot with logfold change RNA seq data

Combining 2 different depth RNA-seq data (DESeq2)

Hello, I have two RNA-seq data generated from Illumina Novaseq (same experimental design but different depth, 25M and 15M reads/sample for Run1 and Run2 respectively). The dateset look like this: Samples Condition Run Sample_1 A R1 Sample_2 B R1 Sample_3 A R1 Sample_4 B R1 Sample_5 A R1 Sample_6 B…

Continue Reading Combining 2 different depth RNA-seq data (DESeq2)

Combining RNA-seq data from 2 experiments (DESeq2)

Hello, I have two RNA-seq data generated from Illumina Novaseq (same experimental design but different depth, 25M and 15M reads/sample for Run1 and Run2 respectively). The dateset look like this: Samples Condition Run Sample_1 A R1 Sample_2 B R1 Sample_3 A R1 Sample_4 B R1 Sample_5 A R1 Sample_6 B…

Continue Reading Combining RNA-seq data from 2 experiments (DESeq2)

How to colour points in cnetplot of clustprofiler?

I have a cnetplot from running enrichment with kegg using clusterprofiler. I have scores input as the fold change but for each gene in the plot they are not varying in colour to show their difference in the fold change score. My dataset is genes of entrez IDs and then…

Continue Reading How to colour points in cnetplot of clustprofiler?

Answer: AnnotationHub::mapIds() cannot find existing ENSG (GEO supplemental data cross-r

Hi, a quick check on NCBI Gene reveals that the official symbol for this is *PRXL2C*, not *AAED1*. In this way, I would not have expected `org.Hs.eg.db` (using ‘recent’ annotation) to have it. However, I can see that `EnsDb.Hsapiens.v86` (older version) does [have it]. So, there must have been an…

Continue Reading Answer: AnnotationHub::mapIds() cannot find existing ENSG (GEO supplemental data cross-r