zbMATH — the first resource for mathematics

Computational genomics with R. With the assistance of Verdan Franke, Bora Uyar and Jonathan Ronen. (English) Zbl 07311495
Chapman & Hall/CRC Computational Biology Series. Boca Raton, FL: CRC Press (ISBN 978-1-4987-8185-5/hbk; 978-0-429-08431-7/ebook). xxii, 440 p. (2021).
Preliminary review / Publisher’s description: Computational Genomics with R provides a starting point for beginners in genomic data analysis and also guides more advanced practitioners to sophisticated data analysis techniques in genomics. The book covers topics from R programming, to machine learning and statistics, to the latest genomic data analysis techniques. The text provides accessible information and explanations, always with the genomics context in the background. This also contains practical and well-documented examples in R so readers can analyze their data by simply reusing the code presented. As the field of computational genomics is interdisciplinary, it requires different starting points for people with different backgrounds. For example, a biologist might skip sections on basic genome biology and start with R programming, whereas a computer scientist might want to start with genome biology.
After reading:
You will have the basics of R and be able to dive right into specialized uses of R for computational genomics such as using Bioconductor packages.
You will be familiar with statistics, supervised and unsupervised learning techniques that are important in data modeling, and exploratory analysis of high-dimensional data.
You will understand genomic intervals and operations on them that are used for tasks such as aligned read counting and genomic feature annotation.
You will know the basics of processing and quality checking high-throughput sequencing data.
You will be able to do sequence analysis, such as calculating GC content for parts of a genome or finding transcription factor binding sites.
You will know about visualization techniques used in genomics, such as heatmaps, meta-gene plots, and genomic track visualization.
You will be familiar with analysis of different high-throughput sequencing data sets, such as RNA-seq, ChIP-seq, and BS-seq.
You will know basic techniques for integrating and interpreting multi-omics datasets.
92-02 Research exposition (monographs, survey articles) pertaining to biology
92D10 Genetics and epigenetics
62P10 Applications of statistics to biology and medical sciences; meta analysis
Full Text: DOI