I am new to bioinformatics and currently try to understand why I should normalize scRNA-seq count data, e.g. by scaling each observation/cell to have the same number of total counts, before working with the data.
This paper says that: “Thus, when gene expression is compared between cells based on count data, any difference may have arisen solely due to sampling effects. Normalization addresses this issue by e.g. scaling count data to obtain correct relative gene expression abundances between cells.”
Can somebody explain this a bit clearer, maybe a ELI5? (sorry, I have a Computer Science background)
Many thanks in advance!
Read more here: Source link