Combining different data types into a single matrix for WGCNA using DESeq2 normalization
I have two different RNA-seq datasets for the sample set of samples, generated using mRNA-seq and smallRNA-seq. The goal here is to identify a set of coding-genes and smallRNAs (no known function/targets) that act together. One of the approaches I’m considering is applying WGCNA on each dataset and use correlation to associate gene modules found in each data type.
Another way I’m considering is applying WGCNA on a concatenated data matrix, where DESeq2 normalization is applied to each data type (mRNA and smallRNA) independently and then combined to create a single matrix, which can then be used as an input for WGCNA. What I wasn’t sure was if combining counts that came from different sequencing methods and were normalized independently is reasonable. I’m having a hard time coming up with a logical explanation for why it’s incorrect to do this (or why it’s okay to do this) so if anyone can help out, would very much appreciate it!
p.s. Also, if you have any suggestions for what I’m trying to achieve here, it would be great if you could share your thoughts!
• 25 views
Read more here: Source link