I have two datasets. One is normalised_counts.txt which is the results of deseq2 analysis and the other is a predixcan file which have counts for each gene for all the chromosomes. I have to do Pearson correlation to figure out how many genes in our sample is present in the normalised_counts file.
But the problem is that the normalised_Count file is created from rna-seq of bam files for only british ancestry whereas our predixcan file is run on vcf file for all the ancestry.
So my doubt is if I do the correlation analysis won’t it affect my results since in one file i have only one ancestry and in another I have all.
• 22 views
Read more here: Source link