DESeq2 rlog-transformed RNA-seq data for machine learning input
I am interested in using rlog-transformed RNA-seq data for machine learning applications. One concern that I have is that rlog does not account for gene length.
Would it be unadvisable to scale rlog-transformed data by log-gene length?
A couple notes on my approach:
1) I am planning on using a frozen rlog transformation for my validation/test sets
2) Because of my deep learning architecture, I would prefer to account for gene length in preprocessing rather than downstream
Thank you for your time!
• 26 views
Read more here: Source link