Tag: ensembl

xml – I am trying to importxml from ENSEMBL using xpath copied from Chrome, but it is not working

I want to use google sheet’s IMPORTXML to extract the gene name (SLC3A1) and ensembl ID (ENSG00000138079) from this URL: asia.ensembl.org/Multi/Search/Results?q=SLC3A1;site=ensembl I tried copying xpath from Chrome and also tried deriving it on my own step by step, but I am only getting a #NA. My xpath: /html/body/div[1]/div/div[2]/div[1]/div[1]/div[2]/div[2]/div[4]/div/div/div[2]/div/div/div[3]/div[2]/div[1]/div[2]/div/div/div/div[1]/div/div[2]/span From Chrome:…

Continue Reading xml – I am trying to importxml from ENSEMBL using xpath copied from Chrome, but it is not working

How to analyze the differential expression of a gene if it has several ensembl id

How to analyze the differential expression of a gene if it has several ensembl id 0 Hie, I have obtained the differential expressions of genes (from RNA-seq data) in an excel file. I can see one gene has several ensembl ids and I guess the gene is from the haplotype…

Continue Reading How to analyze the differential expression of a gene if it has several ensembl id

Transcription Start Site

Transcription Start Site 2 What are the best databases to check out the transcription start sites of specific genes in human genome? TSS • 130 views wget -q -O – “http://hgdownload.cse.ucsc.edu/goldenPath/hg19/database/wgEncodeGencodeBasicV19.txt.gz” | gunzip -c | awk ‘(int($7)< int($8)) {if($4==”+”) {printf(“%s\t%d\t%d\t%s\t%s\n”,$3,$7,int($7)+1,$2,$4);}else {printf(“%s\t%d\t%d\t%s\t%s\n”,$3,int($8)-3,$8,$2,$4);}}’ chr1 69090 69091 ENST00000335137.3 + chr1 139306 139309 ENST00000423372.3…

Continue Reading Transcription Start Site

Bioconductor – AHEnsDbs

DOI: 10.18129/B9.bioc.AHEnsDbs     This package is for version 3.10 of Bioconductor; for the stable, up-to-date release version, see AHEnsDbs. EnsDbs for AnnotationHub Bioconductor version: 3.10 Supplies AnnotationHub with EnsDb Ensembl-based annotation databases for all species. EnsDb SQLite databases are generated separately from Ensembl MySQL databases using functions from the…

Continue Reading Bioconductor – AHEnsDbs

snp – Reference variant detected as altered one in bam file

I received (from manufacturer) several .bam files and I used four callers (samtools, freebayes, haplotypecaller, deepvariant) to find some sequence variants. In obtained .vcf files, I took a closer look to some calls. I found interesting, homozygous one rs477033 (C/G Ref/Alt) with flag ‘COMMON=0’ and very low MAF. I also…

Continue Reading snp – Reference variant detected as altered one in bam file

Antiplatelet response to clopidogrel is associated with a haplotype in CYP2C19 gene in Pakistani patients

Amsterdam, E. A. et al. 2014 AHA/ACC guideline for the management of patients with non-ST-elevation acute coronary syndromes: Executive summary: A report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines. Circulation 130, 2354–2394 (2014). Google Scholar  Kernan, W. N. et al. Guidelines for the prevention…

Continue Reading Antiplatelet response to clopidogrel is associated with a haplotype in CYP2C19 gene in Pakistani patients

How to trim a GFF3 file based on specific coordinates?

How to trim a GFF3 file based on specific coordinates? 0 Hi, I would like to create a GFF3 file containing information only for specific coordinates from the chromosome level GFF3 file. I know how to extract gene and CDS info separately but don’t know how to do trimming based…

Continue Reading How to trim a GFF3 file based on specific coordinates?

Inquiry related to vcf file and formatting

Hello everyone, I am trying to run predixcan software. But its showing error as segmentation fault implying that there is something wrong with my vcf files. I am sharing the header of vcf file. ##fileformat=VCFv4.1 ##INFO=<ID=LDAF,Number=1,Type=Float,Description=”MLE Allele Frequency Accounting for LD”> ##INFO=<ID=AVGPOST,Number=1,Type=Float,Description=”Average posterior probability from MaCH/Thunder”> ##INFO=<ID=RSQ,Number=1,Type=Float,Description=”Genotype imputation quality from…

Continue Reading Inquiry related to vcf file and formatting

Antibody Matching Transcripts

Antibody Matching Transcripts 1 Hello everyone, I am interested to find the matching isoforms of a gene that my HPA antibody is able to hit. When I check the Human Atlas Protein website it gives me a list of 4 transcripts my antibody can recognize. However, when I open ensembl…

Continue Reading Antibody Matching Transcripts

Converting mouse Gene IDs to Human while keeping genes that don’t convert

Hi there, I am using bioMart to convert some gene IDs from mouse to human for some data I generated through RNA-seq. I am currently mapping using the following function: convertMouseGeneList <- function(x){ require(“biomaRt”) human = useMart(“ensembl”, dataset = “hsapiens_gene_ensembl”) mouse = useMart(“ensembl”, dataset = “mmusculus_gene_ensembl”) genesV2 = getLDS(attributes =…

Continue Reading Converting mouse Gene IDs to Human while keeping genes that don’t convert

How to search dbSNP using a list of SNPs and retrieve Gene name (hgnc symbol if existing, otherwise just whatever is in there)

How to search dbSNP using a list of SNPs and retrieve Gene name (hgnc symbol if existing, otherwise just whatever is in there) 2 I have a list of 500.000 SNPs from which I want to obtain the gene name. I try to search with biomaRt library(data.table) library(biomaRt) rs <-…

Continue Reading How to search dbSNP using a list of SNPs and retrieve Gene name (hgnc symbol if existing, otherwise just whatever is in there)

microarray miRNA expression data analysis

I wrote a script on how to analyze the microarray-based miRNA expression data. Here is my code: # general config baseDir <- ‘.’ annotfile <- ‘mirbase_genelist.tsv’ setwd(baseDir) options(scipen = 99) require(limma) # read in the data targets <- read.csv(“/media/mdrcubuntu/46B85615B8560439/microarray_text_files/targets.txt”, sep=””) # retain information about background via gIsWellAboveBG project <- read.maimages(targets,source=”agilent.median”,green.only…

Continue Reading microarray miRNA expression data analysis

Answer: AnnotationHub::mapIds() cannot find existing ENSG (GEO supplemental data cross-r

Hi, a quick check on NCBI Gene reveals that the official symbol for this is *PRXL2C*, not *AAED1*. In this way, I would not have expected `org.Hs.eg.db` (using ‘recent’ annotation) to have it. However, I can see that `EnsDb.Hsapiens.v86` (older version) does [have it]. So, there must have been an…

Continue Reading Answer: AnnotationHub::mapIds() cannot find existing ENSG (GEO supplemental data cross-r

AnnotationHub::mapIds() cannot find existing ENSG (GEO supplemental data cross-referenced with ensembl.org)

Anyone know why I’m not getting ENSG ids for some of these symbols? The example below retrieves `NA` for multiple symbols, including AAED1 [whose ENSG is ENSG00000158122][1]. “` > library(AnnotationHub) > library(org.Hs.eg.db) > library(GEOquery) > temp download.file(getGEO(“GSM4430459″)@header$supplementary_file_1,temp) > genes unlink(temp) > ensids = mapIds(org.Hs.eg.db, keys=genes, column=”ENSEMBL”, keytype=”SYMBOL”, multiVals=”first”) > ensids[“AAED1”]…

Continue Reading AnnotationHub::mapIds() cannot find existing ENSG (GEO supplemental data cross-referenced with ensembl.org)

Why does Txdb transcript length not always match to transcript end-start position?

Why does Txdb transcript length not always match to transcript end-start position? 2 I have just found an example that biomart’s transcript_length is not identical with transcript_end – transcript_start. ensembl_gene_id mgi_symbol chromosome_name strand start_position end_position gene_biotype transcript_start transcript_end strand.1 transcript_length 128537 ENSMUSG00000037860 Aim2 1 1 173178445 173293606 protein_coding 173178445 173293606…

Continue Reading Why does Txdb transcript length not always match to transcript end-start position?

install ensembl-vep

install ensembl-vep 0 Hello, I want to install ensembl-vep in my Ubuntu 18.04.2. I have already installed LWP::Simple. What can I do in the next step? Thanks in advance for great help! Best, Yue Inspiron-3670:~$ perl -MCPAN -e’install “LWP::Simple”‘ Reading ‘/home/jing/.cpan/Metadata’ Database was generated on Sat, 07 Aug 2021 06:55:53…

Continue Reading install ensembl-vep

Where To Find Annotation File For Agilent Microarray?

An easier way that has [probably] only come about since this question was posted is via biomaRt in R. You can build annotation tables for Agilent 4×44 arrays for mouse and human as follows: require(biomaRt) Homo sapiens # agilent_wholegenome_4x44k_v1 mart <- useMart(‘ENSEMBL_MART_ENSEMBL’) mart <- useDataset(‘hsapiens_gene_ensembl’, mart) annotLookup <- getBM( mart…

Continue Reading Where To Find Annotation File For Agilent Microarray?