Getting exon sequences for all human genes
Hi everyone,
I would be very grateful if you could help me.
I want to download the sequences of all the exons for each human gene. I went to ensembl biomart and tried to do it for BRCA2 first (this was a random choice). First I selected the following attributes: Unspliced(gene), Exon start, Exon end, strand, Gene start. I thought that this information will be enough to ‘cut out’ all the exons. The first thing I noticed is that the number of exons is almost twice larger then the number I see on the Wiki.
Then I tried to download directly Exon sequences, but the number of sequences was again larger the the number of exons should be and moreover several exon ids correspond to the same sequence.
I also tried to download coding sequence and cDNA, but the length of both sequences is not consistent with ‘official’ BRCA2 length!
These all drives me crazy and I have absolutely no idea on what to do. All I want is to get for each gene the sequences of its exons. Help me please!
Thanks!
• 2.2k views
Read more here: Source link