This is a 'sparklyr' extension integrating 'VariantSpark' and R. 'VariantSpark' is a framework based on 'scala' and 'spark' to analyze genome datasets, see <https://bioinformatics.csiro.au/>. It was tested on datasets with 3000 samples each one containing 80 million features in either unsupervised clustering approaches and supervised applications, like classification and regression. The genome datasets are usually writing in VCF, a specific text file format used in bioinformatics for storing gene sequence variations. So, 'VariantSpark' is a great tool for genome research, because it is able to read VCF files, run analyses and return the output in a 'spark' data frame.
| Version: | 0.1.1 |
| Imports: | sparklyr (≥ 1.0.1) |
| Suggests: | testthat |
| Published: | 2019-06-13 |
| DOI: | 10.32614/CRAN.package.variantspark |
| Author: | Samuel Macêdo [aut, cre], Javier Luraschi [aut] |
| Maintainer: | Samuel Macêdo <samuelmacedo at recife.ifpe.edu.br> |
| License: | Apache License 2.0 | file LICENSE |
| NeedsCompilation: | no |
| Materials: | README |
| CRAN checks: | variantspark results |
| Reference manual: | variantspark.html , variantspark.pdf |
| Package source: | variantspark_0.1.1.tar.gz |
| Windows binaries: | r-devel: variantspark_0.1.1.zip, r-release: variantspark_0.1.1.zip, r-oldrel: variantspark_0.1.1.zip |
| macOS binaries: | r-release (arm64): variantspark_0.1.1.tgz, r-oldrel (arm64): variantspark_0.1.1.tgz, r-release (x86_64): variantspark_0.1.1.tgz, r-oldrel (x86_64): variantspark_0.1.1.tgz |
Please use the canonical form https://CRAN.R-project.org/package=variantspark to link to this page.
Need a high-speed mirror for your open-source project?
Contact our mirror admin team at info@clientvps.com.
This archive is provided as a free public service to the community.
Proudly supported by infrastructure from VPSPulse , RxServers , BuyNumber , UnitVPS , OffshoreName and secure payment technology by ArionPay.