Introduction to R for Microbiome Data

  • Abubucker, Sahar, Nicola Segata, Johannes Goll, Alyxandria M. Schubert, Jacques Izard, Brandi L. Cantarel, Beltran Rodriguez-Mueller, Jeremy Zucker, Mathangi Thiagarajan, Bernard Henrissat, Owen White, Scott T. Kelley, Barbara Methé, Patrick D. Schloss, Dirk Gevers, Makedonka Mitreva, and Curtis Huttenhower. 2012. Metabolic reconstruction for metagenomic data and its application to the human microbiome. PLoS Computational Biology 8 (6): e1002358. doi.org/10.1371/journal.pcbi.1002358.

    CrossRef 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Andersen, Kasper S., Rasmus H. Kirkegaard, Søren M. Karst, and Mads Albertsen. 2018. ampvis2: An R package to analyse and visualise 16S rRNA amplicon data. BioRxiv 299537.


    Google Scholar
     

  • Anderson, Edgar. 1935. The irises of the Gaspe peninsula. Bull. Am. Iris Soc. 59: 2–5.


    Google Scholar
     

  • Caporaso, J. Gregory, Christian L. Lauber, William A. Walters, Donna Berg-Lyons, Catherine A. Lozupone, Peter J. Turnbaugh, Noah Fierer, and Rob Knight. 2011. Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample. Proceedings of the National Academy of Sciences 108 (Supplement 1): 4516–4522.

    CrossRef 
    CAS 

    Google Scholar
     

  • DiGiulio, D.B., B.J. Callahan, P.J. McMurdie, E.K. Costello, D.J. Lyell, A. Robaczewska, C.L. Sun, D.S. Goltsman, R.J. Wong, G. Shaw, D.K. Stevenson, S.P. Holmes, and D.A. Relman. 2015. Temporal and spatial variation of the human microbiota during pregnancy. Proceedings of the National Academy of Sciences of the United States of America 112 (35): 11060–11065. doi.org/10.1073/pnas.1502875112.

    CrossRef 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Fisher, Ronald A. 1936. The use of multiple measurements in taxonomic problems. Annals of Eugenics 7 (2): 179–188.

    CrossRef 

    Google Scholar
     

  • Ordination methods, diversity analysis and other functions for community and vegetation ecologists. CRAN.R-project.org/package=vegan

  • Huber, Wolfgang, Vincent J. Carey, Robert Gentleman, Simon Anders, Marc Carlson, Benilton S. Carvalho, Hector Corrada Bravo, Sean Davis, Laurent Gatto, Thomas Girke, Raphael Gottardo, Florian Hahne, Kasper D. Hansen, Rafael A. Irizarry, Michael Lawrence, Michael I. Love, James MacDonald, Valerie Obenchain, Andrzej K. Oleś, Hervé Pagès, Alejandro Reyes, Paul Shannon, Gordon K. Smyth, Dan Tenenbaum, Levi Waldron, and Martin Morgan. 2015. Orchestrating high-throughput genomic analysis with Bioconductor. Nature Methods 12 (2): 115–121. doi.org/10.1038/nmeth.3252. pubmed.ncbi.nlm.nih.gov/25633503; www.ncbi.nlm.nih.gov/pmc/articles/PMC4509590/.

    CrossRef 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Jari Oksanen, F.G.B., Michael Friendly, Roeland Kindt, Pierre Legendre, Dan McGlinn, Peter R. Minchin, R.B. O’Hara, Gavin L. Simpson, Peter Solymos, M. Henry, and H. Stevens. 2018. Vegan: Community ecology package. R Package Version 2 (6).


    Google Scholar
     

  • Jari Oksanen, F., Guillaume Blanchet, Michael Friendly, Roeland Kindt, Pierre Legendre, Dan McGlinn, Peter R. Minchin, R.B. O’Hara, Gavin L. Simpson, Peter Solymos, M. Henry H. Stevens, Eduard Szoecs, and Helene Wagner. 2019. Vegan: Community ecology package. R Package Version 2.


    Google Scholar
     

  • Kassambara, Alboukadel. 2020 June 27. ggpubr: ‘ggplot2’ based publication ready plots. cran.r-project.org/web/packages/ggpubr/index.html

  • Lahti, Leo, Sudarshan Shetty, et al. 2017. Tools for microbiome analysis. R. Version 1.9.95.


    Google Scholar
     

  • Louca, Stilianos, and Michael Doebeli. 2017. Efficient comparative phylogenetics on large trees. Bioinformatics 34 (6): 1053–1055. doi.org/10.1093/bioinformatics/btx701.

    CrossRef 
    CAS 

    Google Scholar
     

  • Matsen, Frederick A., Noah G. Hoffman, Aaron Gallagher, and Alexandros Stamatakis. 2012. A format for phylogenetic placements. PLoS One 7 (2): e31009. doi.org/10.1371/journal.pone.0031009.

    CrossRef 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • McDonald, Daniel, Jose C. Clemente, Justin Kuczynski, Jai Ram Rideout, Jesse Stombaugh, Doug Wendel, Andreas Wilke, Susan Huse, John Hufnagle, Folker Meyer, Rob Knight, and J. Gregory Caporaso. 2012. The biological observation matrix (BIOM) format or: How I learned to stop worrying and love the ome-ome. GigaScience 1 (1). doi.org/10.1186/2047-217x-1-7.

  • McMurdie, Paul J., and Susan Holmes. 2013. phyloseq: An R package for reproducible interactive analysis and graphics of microbiome census data, PLOS ONE. (8, 4): e61217. doi.org/10.1371/journal.pone.0061217.

  • ———. 2022. “Handling and analysis of high-throughput microbiome census data.” Accessed March 4, 2022. www.bioconductor.org/packages/release/bioc/html/phyloseq.html

  • McMurdie, Paul J., and Joseph N. Paulson. 2021. Biomformat: An interface package for the BIOM file format. R/Bioconductor Package Version 1.23.0. Last Modified October 26, 2021. Accessed 5 Mar 2022.


    Google Scholar
     

  • O’Keefe, Stephen J.D., Jia V. Li, Leo Lahti, Ou Junhai, Franck Carbonero, Khaled Mohammed, Joram M. Posma, James Kinross, Elaine Wahl, Elizabeth Ruder, Kishore Vipperla, Vasudevan Naidoo, Lungile Mtshali, Sebastian Tims, Philippe G.B. Puylaert, James DeLany, Alyssa Krasinskas, Ann C. Benefiel, Hatem O. Kaseb, Keith Newton, Jeremy K. Nicholson, Willem M. de Vos, H. Rex Gaskins, and Erwin G. Zoetendal. 2015. Fat, fibre and cancer risk in African Americans and rural Africans. Nature Communications 6 (1): 6342. doi.org/10.1038/ncomms7342.

    CrossRef 
    CAS 
    PubMed 

    Google Scholar
     

  • O’Keefe, Stephen J. D. et al. 2016. Data from: Fat, fibre and cancer risk in African Americans and rural Africans, Dryad, Dataset. Stephen J. D. et al. O’Keefe: Dryad.


    Google Scholar
     

  • Paradis, Emmanuel, and Klaus Schliep. 2018. Ape 5.0: An environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics 35 (3): 526–528. doi.org/10.1093/bioinformatics/bty633.

    CrossRef 
    CAS 

    Google Scholar
     

  • Pasolli, Edoardo, Lucas Schiffer, Paolo Manghi, Audrey Renson, Valerie Obenchain, Duy Tin Truong, Francesco Beghini, Faizan Malik, Marcel Ramos, Jennifer B. Dowd, Curtis Huttenhower, Martin Morgan, Nicola Segata, and Levi Waldron. 2017. Accessible, curated metagenomic data through ExperimentHub. Nature Methods 14 (11): 1023–1024. doi.org/10.1038/nmeth.4468.

    CrossRef 
    CAS 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Revell, Liam J. 2012. Phytools: An R package for phylogenetic comparative biology (and other things). Methods in Ecology and Evolution 2: 217–223.

    CrossRef 

    Google Scholar
     

  • Revell, Liam J. 2022. Phytools: phylogenetic tools for comparative biology (and other things). cran.r-project.org/web/packages/phytools/index.html.

  • Schiffer, Lucas, and Levi Waldron. 2021. curatedMetagenomicData. Last Modified 22 December 2021. Accessed 6 Mar 2022. bioconductor.org/packages/release/data/experiment/vignettes/curatedMetagenomicData/inst/doc/curatedMetagenomicData.html

  • Truong, Duy Tin, Eric A. Franzosa, Timothy L. Tickle, Matthias Scholz, George Weingart, Edoardo Pasolli, Adrian Tett, Curtis Huttenhower, and Nicola Segata. 2015. MetaPhlAn2 for enhanced metagenomic taxonomic profiling. Nature Methods 12 (10): 902–903. doi.org/10.1038/nmeth.3589.

    CrossRef 
    CAS 
    PubMed 

    Google Scholar
     

  • Wickham, Hadley. 2016. ggplot2: Elegant graphics for data analysis. New York: Springer-Verlag.

    CrossRef 

    Google Scholar
     

  • ———. 2020. Tidyr: Tidy messy data CRAN.R-project.org/package=tidyr

  • Wickham, Hadley, Jim Hester, and Romain Francois. 2018. readr: Read rectangular text data. R Package Version 1.3.1. CRAN.R-project.org/package=readr.


    Google Scholar
     

  • Wickham, Hadley, Mara Averick, Jennifer Bryan, Winston Chang, Lucy McGowan, Romain François, Garrett Grolemund, Alex Hayes, Lionel Henry, Jim Hester, Max Kuhn, Thomas Pedersen, Evan Miller, Stephan Bache, Kirill Müller, Jeroen Ooms, David Robinson, Dana Seidel, Vitalie Spinu, and Hiroaki Yutani. 2019. Welcome to the tidyverse. Journal of Open Source Software 4: 1686. doi.org/10.21105/joss.01686.

    CrossRef 

    Google Scholar
     

  • Wickham, Hadley, Romain François, Lionel Henry, and Kirill Müller. 2020. dplyr: A grammar of data manipulation. A fast, consistent tool for working with data frame like objects, both in memory and out of memory. R Package Version 1.0.7.


    Google Scholar
     

  • Xia, Yinglin, and Jun Sun. 2022. An integrated analysis of microbiomes and metabolomics. American Chemical Society.


    Google Scholar
     

  • Xia, Yinglin, Jun Sun, and Ding-Geng Chen. 2018. Introduction to R, RStudio and ggplot2. In Statistical analysis of microbiome data with R, 77–127. Springer.

    CrossRef 

    Google Scholar
     

  • Zhang, Xinyan, Yu-Fang Pei, Lei Zhang, Boyi Guo, Amanda H. Pendegraft, Wenzhuo Zhuang, and Nengjun Yi. 2018. Negative binomial mixed models for analyzing longitudinal microbiome data. Frontiers in Microbiology 9: 1683–1683. doi.org/10.3389/fmicb.2018.01683. pubmed.ncbi.nlm.nih.gov/30093893; www.ncbi.nlm.nih.gov/pmc/articles/PMC6070621/.

    CrossRef 
    PubMed 
    PubMed Central 

    Google Scholar
     

  • Read more here: Source link