Install R on Ubuntu 14.04

warning

This post is more than 5 years old. While math doesn't age, code and operating systems do. Please use the code/ideas with caution and expect some issues due to the age of the content. I am keeping these posts up for archival purposes because I still find them useful for reference, even when they are out of date!

In this post I will go over installing R on Ubuntu 14.04 following the installation instructions at cran . I will also cover installing the packages required to do the Topics in machine learning assignment for Coursera's Introduction to Data Science .

As it turns out, not much has changed from the post I wrote: install R on Ubuntu 12.04 , back in 2012. First we add the cran repository we want to use for getting our R updates. I use the stats repository at UCLA with the following command:

$ sudo add-apt-repository "deb http://cran.stat.ucla.edu/bin/linux/ubuntu trusty/"

The use of trusty in the address above indicates that we want to install for Ubuntu 14.04. The archives are signed by a key that we should add for secure apt using:

$ gpg --keyserver keyserver.ubuntu.com --recv-key E084DAB9
$ gpg -a --export E084DAB9 | sudo apt-key add -

If you have problems with these commands consult the installation instructions at cran for alternatives to the above.

Note:

2015, Dec: If you installed R using this method before Oct 18th 2015, the key added above expired and you are probably getting errors about an Expired Key when updates are done on your machine-- see the end of the post for a fix to this issue. If you're installing for the first time, the instructions above will install the current (not-expired) key. So, ignore this note.

Now that we have the repository setup and the key added we do typical Ubuntu install commands to install R:

$ sudo apt-get update
$ sudo apt-get install r-base r-base-dev

r-base-dev is not required but I suggest installing because it might be needed to install other R packages later. This will download and install ~40MB worth of base packages and install everything.

Once finished with the install, you should be able startup R:

$ R

R version 3.1.1 (2014-07-10) -- "Sock it to Me"
Copyright (C) 2014 The R Foundation for Statistical Computing
Platform: x86_64-pc-linux-gnu (64-bit)

R is free software and comes with ABSOLUTELY NO WARRANTY.
You are welcome to redistribute it under certain conditions.
Type 'license()' or 'licence()' for distribution details.

Natural language support but running in an English locale

R is a collaborative project with many contributors.
Type 'contributors()' for more information and
'citation()' on how to cite R or R packages in publications.

Type 'demo()' for some demos, 'help()' for on-line help, or
'help.start()' for an HTML browser interface to help.
Type 'q()' to quit R.

> quit()
Save workspace image? [y/n/c]: n

Installing R packages

The above installation provides the following packages:

r-cran-boot
r-cran-class
r-cran-cluster
r-cran-codetools
r-cran-foreign
r-cran-kernsmooth
r-cran-lattice
r-cran-mass
r-cran-matrix
r-cran-mgcv
r-cran-nlme
r-cran-nnet
r-cran-rpart
r-cran-spatial
r-cran-survival
r-cran-rodbc

These will be automatically update, like normal Ubuntu updates, whenever new versions are available at cran.

If we want other (less standard) packages we have to install and update them by ourselves. The basic process is as follows:

  • Start R using sudo
    $ sudo R
  • To install a package called packagename the command would be:
    > install.packages('packagename', dep = TRUE)
    Note: the quotes around packagename are required. Updating packages is accomplished using:
    $ sudo R
    and
    > update.packages()

Example

Finally, as an example of the above, I'll install the packages required for the Topics in machine learning assignment in Coursera's Introduction to Data Science . The needed packages are: caret , rpart , tree , randomForest , e1071 , and ggplot2 .

First, make sure that you have gcc and gfortran available (many will already have these installed) using:

$ sudo apt-get install gcc gfortran

Next, following the approach outlined above, we start R with sudo

$ sudo R

and then start installing (some of these download and compile a fair amount of code and data -- be patient):

> install.packages('caret', dep = TRUE)
> install.packages('rpart', dep = TRUE)
> install.packages('tree', dep = TRUE)
> install.packages('randomForest', dep = TRUE)
> install.packages('e1071', dep = TRUE)
> install.packages('ggplot2', dep = TRUE)

That's it, we are ready to go!

Note: 2015, Dec 1 -- fixing the expired key

So, you're updating away and you get (something like) the following error:

W: GPG error: http://cran.stat.ucla.edu trusty/ Release: The following signatures were invalid: KEYEXPIRED 1445181253 KEYEXPIRED 1445181253 KEYEXPIRED 1445181253

This means you probably followed the R install instructions before 2015-10-18 and the key used to authenticate the updates has expired. To verify this, you should get similar results:

$ apt-key list | grep expired
pub 2048R/E084DAB9 2010-10-19 [expired: 2015-10-18]

If we check-- see Michael Rutter's key . -- we see that the key has been updated. So, following this info on updating keys we can fix our issue using (be sure to use the key E084DAB9, as shown below, to fix our problem with R updates):

$ sudo apt-key adv --recv-keys --keyserver keys.gnupg.net E084DAB9
Finally, to do the R updates (as well as others that might be queued) immediately we can do:
$ sudo apt-get update
$ sudo apt-get upgrade
and (hopefully) we should be up-to-date and error-free!