--- title: "c - Using rix to build project specific environments" output: rmarkdown::html_vignette vignette: > %\VignetteIndexEntry{c-using-rix-to-build-project-specific-environments} %\VignetteEngine{knitr::rmarkdown} %\VignetteEncoding{UTF-8} --- ```{r, include = FALSE} knitr::opts_chunk$set( collapse = TRUE, comment = "#>" ) ``` ```{r, include=FALSE} library(rix) ``` ## Project-specific Nix environments Now that you have the required software installed, it’s to time learn more about declaring and using reproducible environments. The ideal workflow when using `{rix}` is to create a new, separate environment at the start of a project. Let's say that you wish to analyse some data set, and need `{dplyr}` and `{ggplot2}`. Let's also suppose that you use VS Code as your IDE (there will be more discussion on editors in the vignette `vignette("e-interactive-use")` but for now, let’s assume that you use VS Code). With the `rix::rix()` function, you can easily generate the right `default.nix` file. You need to provide the following inputs to `rix()`: - `r_ver`: the version of R required. Use `"latest-upstream"` for the latest version, and use `available_r()` to check which versions are available; - `date`: instead of `r_ver`, use `date` to specify a date. This date corresponds to a CRAN snapshot of that day. Current R versions and latest release of Bioconductor at that date are also provided. Check available dates using `available_dates()`; - `r_pkgs`: the required R packages. For example "dplyr" (more on this in the vignette `vignette("d1-installing-r-packages-in-a-nix-environment")`); - `system_pkgs`: the required system packages, if needed. For example "quarto", or a Python interpreter (more on this in the vignette `vignette("d2-installing-system-tools-and-texlive-packages-in-a-nix-environment")`); - `git_pkgs`: list of git packages to add (more on this in the vignette `vignette("d1-installing-r-packages-in-a-nix-environment")`); - `ide`: the integrated development editor to use (more on this in the vignette `vignette("e-interactive-use")`) - `path`: the path where to save the `default.nix` file. - `overwrite`: whether to overwrite the `default.nix` file or not. - `print`: whether to print the `default.nix` file to the console or not. Run the following command to generate a `default.nix` file: ```{r, eval = FALSE} path_default_nix <- tempdir() rix( r_ver = "latest-upstream", r_pkgs = c("dplyr", "ggplot2"), system_pkgs = NULL, git_pkgs = NULL, ide = "code", project_path = path_default_nix, overwrite = TRUE, print = TRUE ) #> let #> pkgs = import (fetchTarball "https://github.com/NixOS/nixpkgs/archive/12a9c0004bc987afb1ff511ebb97b67497a68e22.tar.gz") {}; #> #> rpkgs = builtins.attrValues { #> inherit (pkgs.rPackages) #> dplyr #> ggplot2 #> languageserver; #> }; #> #> system_packages = builtins.attrValues { #> inherit (pkgs) #> R #> glibcLocales #> nix; #> }; #> #> in #> #> pkgs.mkShell { #> LOCALE_ARCHIVE = if pkgs.system == "x86_64-linux" then "${pkgs.glibcLocales}/lib/locale/locale-archive" else ""; #> LANG = "en_US.UTF-8"; #> LC_ALL = "en_US.UTF-8"; #> LC_TIME = "en_US.UTF-8"; #> LC_MONETARY = "en_US.UTF-8"; #> LC_PAPER = "en_US.UTF-8"; #> LC_MEASUREMENT = "en_US.UTF-8"; #> #> buildInputs = [ rpkgs system_packages ]; #> #> } ``` To start using this environment, open a terminal in the folder containing `default.nix` and use the following Nix command: ``` nix-build ``` `nix-build` is a Nix command that builds an environment according to the specifications found in a `default.nix` file. Once the environment is done building, you should find a new file called `result` next to the `default.nix` file. This file is a symlink to the software installed by Nix. `{rix}` also provides a `nix_build()` function to build Nix environments from within an interactive R session, but it is not always guaranteed to succeed, due to differences in platforms. This is explained in more detail in the following vignette `vignette("z-advanced-topic-running-r-or-shell-code-in-nix-from-r")`. In case of doubt, run `nix-build` from your usual terminal application. To now use the environment, type in the same terminal as before: ``` nix-shell ``` This will activate the environment. If you have VS Code installed you can start it from this environment and VS Code will use this specific R version library of packages. We will explore this in greater detail in the vignette `vignette("e-interactive-use")`. ## Which `r_ver` should you choose? There are several options for `r_ver` and each have a specific purpose. The table below explains this: ```{=html}
r_ver or date Intended use State of R version State of CRAN packages State of Bioconductor packages State of other packages in Nixpkgs
r_ver = "latest-upstream" Start of new project where versions don’t matter Current or previous Outdated (up to 6 months) Outdated (up to 6 months) Current at time of generation
r_ver = "4.4.2" Reproducing old project or starting a new project where versions don’t matter Same as in `r_ver`, check `available_r()` Outdated (up to 2 months if using latest release) Outdated (up to 2 months if using latest release) Potentially outdated (up to 12 months)
r_ver = "2024-12-14" Reproducing old project or starting a new project using the most recent date Current at that date, check `available_dates()` Current at that date, check `available_dates()` Current at that date, check `available_dates()` Potentially outdated (up to 12 months)
r_ver = "bleeding-edge" To develop against the latest release of CRAN Always current Always current Always current Always current
r_ver = "frozen-edge" To develop against the latest release of CRAN, but manually manage updates Current at time of generation Current at time of generation Current at time of generation Current at time of generation
r_ver = "r-devel" To develop/test against the development version of R Development version Always current Always current Always current
r_ver = "r-devel-bioc-devel" To develop/test against the development version of R and Bioconductor Development version Always current Development version Always current
r_ver = "bioc-devel" To develop/test against the development version of Bioconductor Always current Always current Development version Always current
``` If you want to benefit from relatively fresh packages and have a stable environment for production purposes, using `"latest-upstream"` or the most recent available date for `r_ver` is probably your best option. ## Running old projects with {rix} The example below shows how to create a `default.nix` with instructions to build an environment with R version 4.2.1, the `{dplyr}` and `{janitor}` packages and no specific IDE: ```{r parsermd-chunk-2, eval = F} path_default_nix <- tempdir() rix( r_ver = "4.2.1", r_pkgs = c("dplyr", "janitor"), system_pkgs = c("quarto"), git_pkgs = NULL, ide = "other", project_path = path_default_nix, overwrite = TRUE ) ``` The file looks like this: ``` #> let #> pkgs = import (fetchTarball "https://github.com/rstats-on-nix/nixpkgs/archive/2022-10-20.tar.gz") {}; #> #> rpkgs = builtins.attrValues { #> inherit (pkgs.rPackages) #> dplyr #> janitor; #> }; #> #> system_packages = builtins.attrValues { #> inherit (pkgs) #> glibcLocales #> nix #> quarto #> R; #> }; #> #> in #> #> pkgs.mkShell { #> LOCALE_ARCHIVE = if pkgs.system == "x86_64-linux" then "${pkgs.glibcLocales}/lib/locale/locale-archive" else ""; #> LANG = "en_US.UTF-8"; #> LC_ALL = "en_US.UTF-8"; #> LC_TIME = "en_US.UTF-8"; #> LC_MONETARY = "en_US.UTF-8"; #> LC_PAPER = "en_US.UTF-8"; #> LC_MEASUREMENT = "en_US.UTF-8"; #> #> buildInputs = [ rpkgs system_packages ]; #> #> } ``` Or use a date for a dated environment (if you use `r_ver = 4.4.2` instead of the date in the call below, this will generate the same expression, as this is the version of R that is included at that date): ``` rix( date = "2024-12-14", r_pkgs = c("dplyr", "ggplot2"), system_pkgs = NULL, git_pkgs = NULL, ide = "code", project_path = path_default_nix, overwrite = TRUE, print = TRUE ) #> let #> pkgs = import (fetchTarball "https://github.com/rstats-on-nix/nixpkgs/archive/2024-12-14.tar.gz") {}; #> #> rpkgs = builtins.attrValues { #> inherit (pkgs.rPackages) #> dplyr #> ggplot2 #> languageserver; #> }; #> #> system_packages = builtins.attrValues { #> inherit (pkgs) #> glibcLocales #> nix #> R; #> }; #> #> in #> #> pkgs.mkShell { #> LOCALE_ARCHIVE = if pkgs.system == "x86_64-linux" then "${pkgs.glibcLocales}/lib/locale/locale-archive" else ""; #> LANG = "en_US.UTF-8"; #> LC_ALL = "en_US.UTF-8"; #> LC_TIME = "en_US.UTF-8"; #> LC_MONETARY = "en_US.UTF-8"; #> LC_PAPER = "en_US.UTF-8"; #> LC_MEASUREMENT = "en_US.UTF-8"; #> #> buildInputs = [ rpkgs system_packages ]; #> #> } ``` The first line is quite important, as it shows which *revision* of `nixpkgs` is being used for this environment, as well as the fork. In the two last examples, the expression will use our `rstats-on-nix` fork which includes many fixes for older packages, especially for Apple Silicon devices. The first expression in this vignette, the one generated with `r_ver = "latest-upstream"` uses the upstream `nixpkgs` from the NixOS project. The *revision* is the commit hash of that particular release of `nixpkgs`, here it's a date, because our fork of `nixpkgs` uses dates as branches names, and we are actually pointing to the head of those branches. When generating a `default.nix` using `r_ver = "latest-upstream"`, a commit is shown instead. This dated branch of our `rstats-on-nix` `nixpkgs` fork is the one that shipped version 4.2.1 of R, so the `{dplyr}` and `{janitor}` packages that will get installed will be the versions available in that date as well. This means that R versions and package versions are always coupled when using Nix. However, if you need a specific version of R, but also a specific version of a package that is not available in that particular Nix revision, one solution is to install that package from GitHub or from the CRAN archives. Read the vignette `vignette("d1-installing-r-packages-in-a-nix-environment")` to know more about this. `available_df()` provides an overview of available R and Bioconductor releases, and their compatibility with different operating systems; in short, on Linux (including WSL on Windows), every available R version or date should work, while for macOS, every available date will likely work on Intel Macs, while only R version 4.0.5 and up will work on Apple Silicon. ## Running programs from an environment You could create a bash script that you put in the path to make the process of launching your editor from that environment more streamlined. For example, if your project is called `housing`, you could create this script and execute it to start your project: #!/bin/bash nix-shell /absolute/path/to/housing/default.nix --run code This will execute VS Code in the environment for the `housing` project. If you use `{targets}` you could execute the pipeline in the environment by running: ``` cd /absolute/path/to/housing/ && nix-shell default.nix --run "Rscript -e 'targets::tar_make()'" ``` ## Running single functions in a subshell It is also possible to run single functions in an isolated environment from an active R session using `with_nix()` and get the output of that function loaded into the current session. Refer to this vignette `vignette("z-advanced-topic-running-r-or-shell-code-in-nix-from-r")` for more details on how to achieve this. Concretely this means that you could be running R version 4.3.2 (installed via Nix, or not), and execute a function on R version 4.0.0 for example in a subshell (or execute a function that requires an old version of a package in that subshell), and get the result of the computation back into the main R session. ## Nix environments are not completely isolated from your system It is important to know that an environment built by Nix is not totally isolated from the rest of the system. Suppose that you have the program `sl` installed on your system, and suppose you build a Nix environment that also comes with `sl`. If you activate that environment, the version of `sl` that will run when called is the one included in the Nix environment. If, however, you start `sl` in a Nix environment that does not come with it, then your system's `sl` will get used instead. In the context of the R programming language, this means that if you have a user or system library of packages (meaning, a library of packages generated by a regular installation of R), these packages may be loaded by an R version running from a Nix shell. To avoid issues with this, calling `rix()` automatically runs `rix_init()` as well, which generates a custom `.Rprofile` file in the project's directory. This way, your regular user library of packages will not interfere with R environments managed by Nix. Moreover, this custom `.Rprofile` also redefines `install.packages()` and makes it throw an error: indeed, if you wish to add packages to your an R environment managed by Nix, you should add these packages to the `default.nix` file instead, and rebuild the environment. If you want to even make other programs inaccessible to a running Nix environment, you can drop into it using `nix-shell --pure` instead of only `nix-shell`.