The guiding principle behind duawranglr is to make it easier for organizations to share data that contain protected elements and/or personally idenfiable information (PII) with researchers. There are two key problems this package attempts to solve:
The duawranglr package is designed with the idea that rather than setting a new DUA for each project in an ongoing collaboration between researchers and data partners, two things will happen instead:
An example crosswalk file (e.g. a CSV file or Excel spreadsheet) might look like this:
| level_i | level_ii | level_iii |
|---|---|---|
| sid | sid | sid |
| sname | sname | sname |
| dob | dob | |
| gender | ||
| raceeth | ||
| tid | ||
| tname | tname | tname |
| zip | zip |
Each column represents a restriction level—level_i, level_ii, or level_iii—along with the corresponding data element names that are restricted at that level. In this crosswalk, like variable names have been aligned so that they are easier to compare, but the elements can be included in whichever way makes most sense to the data administrator.
The restriction level names are arbitrary as far as the package goes, but in conjunction with a DUA, they have meaning:
The benefit of this level-plus-crosswalk system is two-fold:
Functions in the package do not
There are many packages, such as those in the tidyverse suite, that are already well suited to data wrangling tasks. There is no need to replicate those functions in this package.
It also should go without saying, but users can simply not use functions in this package when attempting to secure restricted data. What this package does is offer a framework and a set of useful functions that, when followed, help users secure data in a clear and replicable manner that allows data administrators to more easily participate in the process.
Need a high-speed mirror for your open-source project?
Contact our mirror admin team at info@clientvps.com.
This archive is provided as a free public service to the community.
Proudly supported by infrastructure from VPSPulse , RxServers , BuyNumber , UnitVPS , OffshoreName and secure payment technology by ArionPay.