PivotalR is a package that enables users of R, the most popular open source statistical programming language and environment to interact with (Greenplum) Database as well as Apache HAWQ (incubating) and the open-source database PostgreSQL for Big Data analytics. It does so by providing an interface to the operations on tables/views in the database. These operations are almost the same as those of data.frame. Minimal amount of data is transfered between R and the database system. Thus the users of R do not need to learn SQL when they operate on the objects in the database. PivotalR also lets the user to run the functions of the open-source big-data machine learning package Apache MADlib (incubating) directly from R.
An Introduction to PivotalR
vignette("pivotalr") # execute in R console to view the PDF file
Or try out the latest development version from github by running the following code (Need R >= 3.0.2):
## install.packages("devtools") # 'devtools' package is only available for R >= 3.0.2 devtools::install_github("PivotalR", "pivotalsoftware")
Or download the source tarball directly from here, and then install the tarball
where "pivotalsoftware-PivotalR-xxxx.tar.gz" is the name of the package that you have downloaded.
install.packages("pivotalsoftware-PivotalR-xxxx.tar.gz", repos = NULL, type = "source")