R is a programming language we can use for data analysis and make statistical inferences for data. For a Big Data analyst, it is important to learn R programming. R programming can make for exploratory data analysis easy because of its packages and libraries. R help in making beautiful visualizations using libraries like ggplot2. R can be used for data wrangling which actually means cleaning the raw data and making the data useful for modeling.
R programming can be integrated with Big data and some of these software examples:
- RHIPE(R and Hadoop Integrated Programming Environment),
- ORCH(Oracle R Connector for Hadoop)
You can watch this video to know how programming languages can be integrated with Big data: