You can find a lot of study materials about R and data science all over the internet. Here we highlight the essentials which would help you to start doing the job you have to do, figuring out details on the way.

Books

Cheatsheets

SQL

Parallel processing in R

Working with medium data (still fit in RAM, but slow with dplyr)

Working with Big data (located in cluster)

Plotting libraries

  • ggplot - see R for Data Science
  • plotly - making ggplot2 interactive
  • ggvis - interactive plots (Behind the scenes, interactive plots are built with shiny, and you can currently only have one running at a time in a given R session)
  • dygraph - R interface to the dygraphs JavaScript charting library. It provides rich facilities for charting time-series data in R
  • ggpubr - modified ggplot2 for publication ready plots
  • Visualize Large Data