Computing Infrastructure

Workflow Tools for Your Code and Data

Showing 10 of 12

targets

CRAN Peer-reviewed

Dynamic Function-Oriented Make-Like Declarative Pipelines

Maintainer

William Michael Landau

Description

Pipeline tools coordinate the pieces of computationally demanding analysis projects. The targets package is a Make-like pipeline tool for statistics and data science in R. The package skips costly runtime for tasks that are already up to date, orchestrates the necessary computation with implicit parallel computing, and abstracts files as R objects. If all the current output matches the current upstream code and data, then the whole pipeline is up to date, and the results are more trustworthy than otherwise. The methodology in this package borrows from GNU Make (2015, ISBN:978-9881443519) and drake (2018, doi:10.21105/joss.00550).

View Documentation

gitcellar

Staff maintained

Helps Download Archives of GitHub Repositories

Maintainer

Maëlle Salmon

Description

Provide functionality to download archives (backups) for all repositories in a GitHub organization (useful for backups!).

View Documentation

rix

CRAN Peer-reviewed

Reproducible Data Science Environments with Nix

Maintainer

Bruno Rodrigues

Description

Simplifies the creation of reproducible data science environments using the Nix package manager, as described in Dolstra (2006) <ISBN 90-393-4130-3>. The included rix() function generates a complete description of the environment as a default.nix file, which can then be built using Nix. This results in project specific software environments with pinned versions of R, packages, linked system dependencies, and other tools. Additional helpers make it easy to run R code in Nix software environments for testing and production.

View Documentation

git2r

Provides Access to Git Repositories

Maintainer

Stefan Widgren

Description

Interface to the libgit2 library, which is a pure C implementation of the Git core methods. Provides access to Git repositories to extract data and running some basic Git commands.

View Documentation
Scientific use cases

Blischak, J. D., Carbonetto, P., & Stephens, M. (2019). Creating and sharing reproducible research code the workflowr way. F1000Research, 8, 1749. https://doi.org/10.12688/f1000research.20843.1

tarchetypes

CRAN Peer-reviewed

Archetypes for Targets

Maintainer

William Michael Landau

Description

Function-oriented Make-like declarative pipelines for Statistics and data science are supported in the targets R package. As an extension to targets, the tarchetypes package provides convenient user-side functions to make targets easier to use. By establishing reusable archetypes for common kinds of targets and pipelines, these functions help express complicated reproducible pipelines concisely and compactly. The methods in this package were influenced by the targets R package. by Will Landau (2018) doi:10.21105/joss.00550.

View Documentation

babette

CRAN Peer-reviewed

Control BEAST2

Maintainer

Richèl J.C. Bilderbeek

Description

BEAST2 (https://www.beast2.org) is a widely used Bayesian phylogenetic tool, that uses DNA/RNA/protein data and many model priors to create a posterior of jointly estimated phylogenies and parameters. BEAST2 is commonly accompanied by BEAUti 2, Tracer and DensiTree. babette provides for an alternative workflow of using all these tools separately. This allows doing complex Bayesian phylogenetics easily and reproducibly from R.

View Documentation

pkgmatch

Staff maintained

Find R Packages Matching Either Descriptions or Other R Packages

Maintainer

Mark Padgham

Description

Find R packages matching either descriptions or other R packages.

View Documentation

gert

CRAN Staff maintained

Simple Git Client for R

Maintainer

Jeroen Ooms

Description

Simple git client for R based on libgit2 https://libgit2.org with support for SSH and HTTPS remotes. All functions in gert use basic R data types (such as vectors and data-frames) for their arguments and return values. User credentials are shared with command line git through the git-credential store and ssh keys stored on disk or ssh-agent.

View Documentation

fireexposuR

Compute and Visualize Wildfire Exposure

Maintainer

Air Forbes

Description

This package computes and visualizes wildfire exposure using the methods documented in a series of scientific publications.

View Documentation

saperlipopette

Create Example Git Messes

Maintainer

Maëlle Salmon

Description

Holds functions creating Git messes, that users would then solve, to follow https://ohshitgit.com/.

View Documentation

repometrics

Staff maintained

Metrics for Your Code Repository

Maintainer

Mark Padgham

Description

Metrics for your code repository. Call one function to generate an interactive dashboard displaying the state of your code.

View Documentation

bowerbird

Keep a Collection of Sparkly Data Resources

Maintainer

Ben Raymond

Description

Tools to get and maintain a data repository from third-party data providers.

View Documentation

pangoling

Access to Large Language Model Predictions

Maintainer

Bruno Nicenboim

Description

Provides access to word predictability estimates using large language models (LLMs) based on transformer architectures via integration with the Hugging Face ecosystem. The package interfaces with pre-trained neural networks and supports both causal/auto-regressive LLMs (e.g., GPT-2; Radford et al., 2019) and masked/bidirectional LLMs (e.g., BERT; Devlin et al., 2019, doi:10.48550/arXiv.1810.04805) to compute the probability of words, phrases, or tokens given their linguistic context. By enabling a straightforward estimation of word predictability, the package facilitates research in psycholinguistics, computational linguistics, and natural language processing (NLP).

View Documentation

pkgcheck

Staff maintained

rOpenSci Package Checks

Maintainer

Mark Padgham

Description

Check whether a package is ready for submission to rOpenSci’s peer review system.

View Documentation

tinkr

Cast (R)Markdown Files to XML and Back Again

Maintainer

Zhian N. Kamvar

Description

Parsing (R)Markdown files with numerous regular expressions can be fraught with peril, but it does not have to be this way. Converting (R)Markdown files to XML using the commonmark package allows in-memory editing via of markdown elements via XPath through the extensible R6 class called yarn. These modified XML representations can be written to (R)Markdown documents via an xslt stylesheet which implements an extended version of GitHub-flavoured markdown so that you can tinker to your hearts content.

View Documentation

allcontributors

CRAN Staff maintained

Acknowledge all Contributors to a Project

Maintainer

Mark Padgham

Description

Acknowledge all contributors to a project via a single function call. The function appends to a README or other specified file(s) a table with names of all individuals who contributed via code or repository issues. The package also includes several additional functions to extract and quantify contributions to any repository.

View Documentation

pkgstats

Staff maintained

Metrics of R Packages

Maintainer

Mark Padgham

Description

Static code analyses for R packages using the external code-tagging libraries ctags and gtags. Static analyses enable packages to be analysed very quickly, generally a couple of seconds at most. The package also provides access to a database generating by applying the main function to the full CRAN archive, enabling the statistical properties of any package to be compared with all other CRAN packages.

View Documentation

beautier

CRAN Peer-reviewed

BEAUti from R

Maintainer

Richèl J.C. Bilderbeek

Description

BEAST2 (https://www.beast2.org) is a widely used Bayesian phylogenetic tool, that uses DNA/RNA/protein data and many model priors to create a posterior of jointly estimated phylogenies and parameters. BEAUti 2 (which is part of BEAST2) is a GUI tool that allows users to specify the many possible setups and generates the XML file BEAST2 needs to run. This package provides a way to create BEAST2 input files without active user input, but using R function calls instead.

View Documentation

beastier

CRAN Peer-reviewed

Call BEAST2

Maintainer

Richèl J.C. Bilderbeek

Description

BEAST2 (https://www.beast2.org) is a widely used Bayesian phylogenetic tool, that uses DNA/RNA/protein data and many model priors to create a posterior of jointly estimated phylogenies and parameters. BEAST2 is a command-line tool. This package provides a way to call BEAST2 from an R function call.

View Documentation

babeldown

Staff maintained

Helpers for Automatic Translation of Markdown-based Content

Maintainer

Maëlle Salmon

Description

Provide workflows and guidance for automatic translation of Markdown-based R content using DeepL API.

View Documentation

gigs

Assess Fetal, Newborn, and Child Growth with International Standards

Maintainer

Simon R Parker

Description

Convert between anthropometric measures and z-scores/centiles in multiple growth standards, and classify fetal, newborn, and child growth accordingly. With a simple interface to growth standards from the World Health Organisation and International Fetal and Newborn Growth Consortium for the 21st Century, gigs makes growth assessment easy and reproducible for clinicians, researchers and policy-makers.

View Documentation

commonmetar

Wraps Commonmeta For rOpenSci Blog's Needs

Maintainer

Maëlle Salmon

Description

Uses the commonmeta Go library to generate random DOI strings.

View Documentation

BaseSet

CRAN Peer-reviewed

Working with Sets the Tidy Way

Maintainer

Lluís Revilla Sancho

Description

Implements a class and methods to work with sets, doing intersection, union, complementary sets, power sets, cartesian product and other set operations in a “tidy” way. These set operations are available for both classical sets and fuzzy sets. Import sets from several formats or from other several data structures.

View Documentation

pkgreviewr

rOpenSci package review project template

Maintainer

Maëlle Salmon

Description

Creates files and collects materials necessary to complete an rOpenSci package review. Review files are prepopulated with review package specific metadata. Review package source code is also cloned for local testing and inspection.

View Documentation

roreviewapi

Staff maintained

Plumber API to report package structure and function

Maintainer

Mark Padgham

Description

Plumber API to report package structure and function.

View Documentation

tic

CI-Agnostic Workflow Definitions

Maintainer

Eli Miller

Description

Provides a way to describe common build and deployment workflows for R-based projects: packages, websites (e.g. blogdown, pkgdown), or data processing (e.g. research compendia). The recipe is described independent of the continuous integration tool used for processing the workflow (e.g. GitHub Actions or Circle CI). This package has been peer-reviewed by rOpenSci (v0.3.0.9004).

View Documentation

stantargets

Targets for Stan Workflows

Maintainer

William Michael Landau

Description

Bayesian data analysis usually incurs long runtimes and cumbersome custom code. A pipeline toolkit tailored to Bayesian statisticians, the stantargets R package leverages targets and cmdstanr to ease these burdens. stantargets makes it super easy to set up scalable Stan pipelines that automatically parallelize the computation and skip expensive steps when the results are already up to date. Minimal custom code is required, and there is no need to manually configure branching, so usage is much easier than targets alone. stantargets can access all of cmdstanrs major algorithms (MCMC, variational Bayes, and optimization) and it supports both single-fit workflows and multi-rep simulation studies. For the statistical methodology, please refer to Stan’ documentation (Stan Development Team 2020) https://mc-stan.org/.

View Documentation

dendroNetwork

CRAN Peer-reviewed

Create Networks of Dendrochronological Series using Pairwise Similarity

Maintainer

Ronald Visser

Description

Creating dendrochronological networks based on the similarity between tree-ring series or chronologies. The package includes various functions to compare tree-ring curves building upon the dplR package. The networks can be used to visualise and understand the relations between tree-ring curves. These networks are also very useful to estimate the provenance of wood as described in Visser (2021) DOI:10.5334/jcaa.79 or wood-use within a structure/context/site as described in Visser and Vorst (2022) DOI:10.1163/27723194-bja10014.

View Documentation

jsonvalidate

Validate JSON Schema

Maintainer

Rich FitzJohn

Description

Uses the node library is-my-json-valid or ajv to validate JSON against a JSON schema. Drafts 04, 06 and 07 of JSON schema are supported.

View Documentation

babelquarto

Staff maintained

Renders a Multilingual Quarto Book

Maintainer

Maëlle Salmon

Description

Automate rendering and cross-linking of Quarto books following a prescribed structure.

View Documentation

universe

Staff maintained

//r-universe.dev>

Maintainer

Jeroen Ooms

Description

Utilities to interact with the R-universe platform. Includes functions to manage local package repositories, as well as API wrappers for retrieving data and metadata about packages in r-universe.

View Documentation

charlatan

CRAN Peer-reviewed

Make Fake Data

Maintainer

Roel M. Hogervorst

Description

Make fake data that looks realistic, supporting addresses, person names, dates, times, colors, coordinates, currencies, digital object identifiers (DOIs), jobs, phone numbers, DNA sequences, doubles and integers from distributions and within a range.

View Documentation

icepalace

Snapshot Current Versions of CRAN-like Repositories

Maintainer

Maëlle Salmon

Description

What the package does (one paragraph).

View Documentation

quartificate

Transform Google Docs into Quarto Books

Maintainer

Maëlle Salmon

Description

Automate the Transformation of a Google Document into a Quarto Book source.

View Documentation

rotemplate

Staff maintained

pkgdown template and utilities for rOpenSci docs

Maintainer

Maëlle Salmon

Description

This is a private template for use by rOpenSci packages. Please don’t use it for your own non-rOpenSci package.

View Documentation

roblog

Staff maintained

rOpenSci's blog guidance

Maintainer

Maëlle Salmon

Description

It provides templates for roweb2 blogging and help for a GitHub forking workflow.

View Documentation

aeolus

Unleash Useful Linebreaks in Markdown Documents

Maintainer

Maëlle Salmon

Description

Add linebreaks at the end of sentences and remove other linebreaks.

View Documentation

agroclimatico

Índices y Estadísticos Climáticos e Hidrológicos

Maintainer

Paola Corrales

Description

Conjunto de funciones para calcular índices y estadísticos climáticos hidrológicos a partir de datos tidy. Incluye una función para graficar resultados georeferenciados y e información cartográfica.

View Documentation

skimr

CRAN Peer-reviewed

Compact and Flexible Summaries of Data

Maintainer

Elin Waring

Description

A simple to use summary function that can be used with pipes and displays nicely in the console. The default summary statistics may be modified by the user as can the default formatting. Support for data frames and vectors is included, and users can implement their own skim methods for specific object types as described in a vignette. Default summaries include support for inline spark graphs. Instructions for managing these on specific operating systems are given in the “Using skimr” vignette and the README.

View Documentation
Scientific use cases

Sinval, J., Marques-Pinto, A., Queirós, C., & Marôco, J. (2018). Work Engagement among Rescue Workers: Psychometric Properties of the Portuguese UWES. Frontiers in Psychology, 8. https://doi.org/10.3389/fpsyg.2017.02229
Sinval, J., Pasian, S., Queirós, C., & Marôco, J. (2018). Brazil-Portugal Transcultural Adaptation of the UWES-9: Internal Consistency, Dimensionality, and Measurement Invariance. Frontiers in Psychology, 9. https://doi.org/10.3389/fpsyg.2018.00353
Almeida, L. S., Pérez Fuentes, M. del C., Casanova, J. R., Gázquez Linares, J. J., & Molero Jurado, M. del M. (2018). Alcohol Expectancy-Adolescent Questionnaire (AEQ-AB): Validation for portuguese college students. Health and Addictions/Salud y Drogas, 18(2), 155. https://doi.org/10.21134/haaj.v18i2.389
António, N., de Almeida, A., & Nunes, L. (2018). Hotel booking demand datasets. Data in Brief. https://doi.org/10.1016/j.dib.2018.11.126
Sinval, J., Casanova, J. R., Marôco, J., & Almeida, L. S. (2018). University student engagement inventory (USEI): Psychometric properties. Current Psychology. https://doi.org/10.1007/s12144-018-0082-6
Rodrigues, S., Sinval, J., Queirós, C., Marôco, J., & Kaiseler, M. (2019). Transitioning from recruit to officer: An investigation of how stress appraisal and coping influence work engagement. International Journal of Selection and Assessment. https://doi.org/10.1111/ijsa.12238
Sinval, J., Sirgy, M. J., Lee, D.-J., & Marôco, J. (2019). The Quality of Work Life Scale: Validity Evidence from Brazil and Portugal. Applied Research in Quality of Life. https://doi.org/10.1007/s11482-019-09730-3
Nalborczyk, L., Grandchamp, R., Koster, E. H. W., Perrone-Bertolotti, M., & Loevenbruck, H. (2019). Can we decode phonetic features in inner speech using surface electromyography? https://doi.org/10.31234/osf.io/8v5yd
Correia, C. N., McLoughlin, K. E., Nalpas, N. C., Magee, D. A., Browne, J. A., Rue-Albrecht, K., … MacHugh, D. E. (2018). RNA Sequencing (RNA-Seq) Reveals Extremely Low Levels of Reticulocyte-Derived Globin Gene Transcripts in Peripheral Blood From Horses (Equus caballus) and Cattle (Bos taurus). Frontiers in Genetics, 9. https://doi.org/10.3389/fgene.2018.00278
Long, J. D., & Turner, D. (2020). Applied R in the Classroom. Australian Economic Review, 53(1), 139–157. https://doi.org/10.1111/1467-8462.12362
Sinval, J., & Marôco, J. (2020). Short Index of Job Satisfaction: Validity evidence from Portugal and Brazil. PLOS ONE, 15(4), e0231474. https://doi.org/10.1371/journal.pone.0231474
Lam, K.-L., Cheng, W.-Y., Su, Y., Li, X., Wu, X., Wong, K.-H., … Cheung, P. C.-K. (2020). Use of random forest analysis to quantify the importance of the structural characteristics of beta-glucans for prebiotic development. Food Hydrocolloids, 108, 106001. https://doi.org/10.1016/j.foodhyd.2020.106001
McKnelly, K. J., Howitz, W. J., Lam, S., & Link, R. D. (2020). Extraction on Paper Activity: An Active Learning Technique to Facilitate Student Understanding of Liquid–Liquid Extraction. Journal of Chemical Education, 97(7), 1960–1965. https://doi.org/10.1021/acs.jchemed.9b00975
Behrendt, I., Fasshauer, M., & Eichner, G. (2020). Gluten intake and metabolic health: conflicting findings from the UK Biobank. European Journal of Nutrition. https://doi.org/10.1007/s00394-020-02351-9
Aragão e Pina, J., Passos, A. M., Maynard, M. T., & Sinval, J. (2021). Self-efficacy, mental models and team adaptation: A first approach on football and futsal refereeing. Psychology of Sport and Exercise, 52, 101787. https://doi.org/10.1016/j.psychsport.2020.101787
España, S., Ochoa de Olza, M., Sala, N., Piulats, J. M., Ferrandiz, U., Etxaniz, O., … Font, A. (2020). PSA Kinetics as Prognostic Markers of Overall Survival in Patients with Metastatic Castration-Resistant Prostate Cancer Treated with Abiraterone Acetate. Cancer Management and Research, Volume 12, 10251–10260. https://doi.org/10.2147/cmar.s270392
Wadley, A. L., Venter, W. D. F., Moorhouse, M., Akpomiemie, G., Serenata, C., Hill, A., … Kamerman, P. R. (2020). High individual pain variability in people living with HIV: A graphical analysis. European Journal of Pain, 25(1), 160–170. https://doi.org/10.1002/ejp.1658
Wadley AL, Venter WDF, Moorhouse M, Akpomiemie G, Serenata C, Hill A, Sokhela S, Mqamelo N, Kamerman PR. High individual pain variability in people living with HIV: A graphical analysis. Eur J Pain 2020. https://doi.org/10.1002/ejp.1658
Schrag, N. F. D., Apley, M. D., Godden, S. M., Lubbers, B. V., & Singer, R. S. (2020). Antimicrobial use quantification in adult dairy cows – Part 1 – Standardized regimens as a method for describing antimicrobial use. Zoonoses and Public Health, 67(S1), 51–68. https://doi.org/10.1111/zph.12766
Nopp-Mayr, U., Reimoser, S., Reimoser, F., Sachser, F., Obermair, L., & Gratzer, G. (2020). Analyzing long-term impacts of ungulate herbivory on forest-recruitment dynamics at community and species level contrasting tree densities versus maximum heights. Scientific Reports, 10(1). https://doi.org/10.1038/s41598-020-76843-3
Behrendt, I., Fasshauer, M., & Eichner, G. (2020). Gluten Intake and All-Cause and Cause-Specific Mortality: Prospective Findings from the UK Biobank. The Journal of Nutrition, 151(3), 591–597. https://doi.org/10.1093/jn/nxaa387

hoardr

Manage Cached Files

Maintainer

Tamás Stirling

Description

Suite of tools for managing cached files, targeting use in other R packages. Uses rappdirs for cross-platform paths. Provides utilities to manage cache directories, including targeting files by path or by key; cached directories can be compressed and uncompressed easily to save disk space.

View Documentation

circle

CRAN Peer-reviewed

R Client Package for Circle CI

Maintainer

Patrick Schratz

Description

Tools for interacting with the Circle CI API (https://circleci.com/docs/api/v2/). Besides executing common tasks such as querying build logs and restarting builds, this package also helps setting up permissions to deploy from builds.

View Documentation

prismjs

CRAN Staff maintained

Server-Side Syntax Highlighting

Maintainer

Jeroen Ooms

Description

Prism https://prismjs.com/ is a lightweight, extensible syntax highlighter, built with modern web standards in mind. This package provides server-side rendering in R using V8 such that no JavaScript library is required in the resulting HTML documents. Over 400 languages are supported.

View Documentation

rzmq

CRAN Staff maintained

R Bindings for ZeroMQ

Maintainer

Jeroen Ooms

Description

Interface to the ZeroMQ lightweight messaging kernel (see https://zeromq.org/ for more information).

View Documentation

sodium

CRAN Staff maintained

A Modern and Easy-to-Use Crypto Library

Maintainer

Jeroen Ooms

Description

Bindings to libsodium https://doc.libsodium.org/: a modern, easy-to-use software library for encryption, decryption, signatures, password hashing and more. Sodium uses curve25519, a state-of-the-art Diffie-Hellman function by Daniel Bernstein, which has become very popular after it was discovered that the NSA had backdoored Dual EC DRBG.

View Documentation

drake

CRAN Peer-reviewed

A Pipeline Toolkit for Reproducible Computation at Scale

Maintainer

William Michael Landau

Description

A general-purpose computational engine for data analysis, drake rebuilds intermediate data objects when their dependencies change, and it skips work when the results are already up to date. Not every execution starts from scratch, there is native support for parallel and distributed computing, and completed projects have tangible evidence that they are reproducible. Extensive documentation, from beginner-friendly tutorials to practical examples and more, is available at the reference website https://docs.ropensci.org/drake/ and the online manual https://books.ropensci.org/drake/.

View Documentation

jagstargets

CRAN Peer-reviewed

Targets for JAGS Pipelines

Maintainer

William Michael Landau

Description

Bayesian data analysis usually incurs long runtimes and cumbersome custom code. A pipeline toolkit tailored to Bayesian statisticians, the jagstargets R package is leverages targets and R2jags to ease this burden. jagstargets makes it super easy to set up scalable JAGS pipelines that automatically parallelize the computation and skip expensive steps when the results are already up to date. Minimal custom code is required, and there is no need to manually configure branching, so usage is much easier than targets alone. For the underlying methodology, please refer to the documentation of targets doi:10.21105/joss.02959 and JAGS (Plummer 2003) https://www.r-project.org/conferences/DSC-2003/Proceedings/Plummer.pdf.

View Documentation

RefManageR

CRAN Peer-reviewed

Straightforward BibTeX and BibLaTeX Bibliography Management

Maintainer

Mathew W. McLean

Description

Provides tools for importing and working with bibliographic references. It greatly enhances the bibentry class by providing a class BibEntry which stores BibTeX and BibLaTeX references, supports UTF-8 encoding, and can be easily searched by any field, by date ranges, and by various formats for name lists (author by last names, translator by full names, etc.). Entries can be updated, combined, sorted, printed in a number of styles, and exported. BibTeX and BibLaTeX .bib files can be read into R and converted to BibEntry objects. Interfaces to NCBI Entrez, CrossRef, and Zotero are provided for importing references and references can be created from locally stored PDF files using Poppler. Includes functions for citing and generating a bibliography with hyperlinks for documents prepared with RMarkdown or RHTML.

View Documentation

goodpractice

CRAN Staff maintained

Advice on R Package Building

Maintainer

Mark Padgham

Description

Give advice about good practices when building R packages. Advice includes functions and syntax to avoid, package structure, code complexity, code formatting, etc.

View Documentation

ruODK

An R Client for the ODK Central API

Maintainer

Florian W. Mayer

Description

Access and tidy up data from the ODK Central API. ODK Central is a clearinghouse for digitally captured data using ODK https://docs.getodk.org/central-intro/. It manages user accounts and permissions, stores form definitions, and allows data collection clients like ODK Collect to connect to it for form download and submission upload. The ODK Central API is documented at https://docs.getodk.org/central-api/.

View Documentation

gitignore

CRAN Peer-reviewed

Create Useful .gitignore Files for your Project

Maintainer

Philippe Massicotte

Description

Simple interface to query gitignore.io to fetch gitignore templates that can be included in the .gitignore file. More than 450 templates are currently available.

View Documentation

srr

Staff maintained

rOpenSci Review Roclets

Maintainer

Mark Padgham

Description

Companion package to rOpenSci statistical software review project.

View Documentation

plater

CRAN Peer-reviewed

Read, Tidy, and Display Data from Microtiter Plates

Maintainer

Sean Hughes

Description

Tools for interacting with data from experiments done in microtiter plates. Easily read in plate-shaped data and convert it to tidy format, combine plate-shaped data with tidy data, and view tidy data in plate shape.

View Documentation

autotest

Staff maintained

Automatic Package Testing

Maintainer

Mark Padgham

Description

Automatic testing of R packages via a simple YAML schema.

View Documentation

credentials

CRAN Staff maintained

Tools for Managing SSH and Git Credentials

Maintainer

Jeroen Ooms

Description

Setup and retrieve HTTPS and SSH credentials for use with git and other services. For HTTPS remotes the package interfaces the git-credential utility which git uses to store HTTP usernames and passwords. For SSH remotes we provide convenient functions to find or generate appropriate SSH keys. The package both helps the user to setup a local git installation, and also provides a back-end for git/ssh client libraries to authenticate with existing user credentials.

View Documentation

postdoc

CRAN Staff maintained

Minimal and Uncluttered Package Documentation

Maintainer

Jeroen Ooms

Description

Generates simple and beautiful one-page HTML reference manuals with package documentation. Math rendering and syntax highlighting are done server-side in R such that no JavaScript libraries are needed in the browser, which makes the documentation portable and fast to load.

View Documentation

DataPackageR

CRAN Peer-reviewed

Construct Reproducible Analytic Data Sets as R Packages

Maintainer

Dave Slager

Description

A framework to help construct R data packages in a reproducible manner. Potentially time consuming processing of raw data sets into analysis ready data sets is done in a reproducible manner and decoupled from the usual R CMD build process so that data sets can be processed into R objects in the data package and the data package can then be shared, built, and installed by others without the need to repeat computationally costly data processing. The package maintains data provenance by turning the data processing scripts into package vignettes, as well as enforcing documentation and version checking of included data objects. Data packages can be version controlled on GitHub, and used to share data for manuscripts, collaboration and reproducible research.

View Documentation
Scientific use cases

Finak, G., Mayer, B., Fulp, W., Obrecht, P., Sato, A., Chung, E., … Gottardo, R. (2018). DataPackageR: Reproducible data preprocessing, standardization and sharing using R/Bioconductor for collaborative data analysis. Gates Open Research, 2, 31. https://doi.org/10.12688/gatesopenres.12832.2

jenkins

Staff maintained

Simple Jenkins Client for R

Maintainer

Jeroen Ooms

Description

Manage jobs and builds on your Jenkins CI server https://jenkins.io/. Create and edit projects, schedule builds, manage the queue, download build logs, and much more.

View Documentation

mauricer

CRAN Peer-reviewed

Work with BEAST2 Packages

Maintainer

Richèl J.C. Bilderbeek

Description

BEAST2 (https://www.beast2.org) is a widely used Bayesian phylogenetic tool, that uses DNA/RNA/protein data and many model priors to create a posterior of jointly estimated phylogenies and parameters. BEAST2 is commonly accompanied by BEAUti 2 (https://www.beast2.org), which, among others, allows one to install BEAST2 package. This package allows to work with BEAST2 packages from R.

View Documentation

nlrx

CRAN Peer-reviewed

Setup, Run and Analyze NetLogo Model Simulations from R via XML

Maintainer

Sebastian Hanss

Description

Setup, run and analyze NetLogo (https://ccl.northwestern.edu/netlogo/) model simulations in R. nlrx experiments use a similar structure as NetLogos Behavior Space experiments. However, nlrx offers more flexibility and additional tools for running and analyzing complex simulation designs and sensitivity analyses. The user defines all information that is needed in an intuitive framework, using class objects. Experiments are submitted from R to NetLogo via XML files that are dynamically written, based on specifications defined by the user. By nesting model calls in future environments, large simulation design with many runs can be executed in parallel. This also enables simulating NetLogo experiments on remote high performance computing machines. In order to use this package, Java and NetLogo (>= 5.3.1) need to be available on the executing system.

View Documentation
Scientific use cases

Kaaronen, R. O., & Strelkovskii, N. (2019). Cultural Evolution of Sustainable Behaviours: Pro-Environmental Tipping Points in an Agent-Based Model. https://doi.org/10.31234/osf.io/w6dpa
Wesener, F., Szymczak, A., Rillig, M. C., & Tietjen, B. (2020). Stress priming affects fungal competition – evidence from a combined experimental and modeling study. https://doi.org/10.1101/2020.03.04.976357
Adams, R. I., Bhangar, S., Dannemiller, K. C., Eisen, J. A., Fierer, N., Gilbert, J. A., … Bibby, K. (2016). Ten questions concerning the microbiomes of buildings. Building and Environment, 109, 224–234. https://doi.org/10.1016/j.buildenv.2016.09.001
D’Orazio, M., Bernardini, G., & Quagliarini, E. (2020). Sustainable and resilient strategies for touristic cities against COVID-19: an agent-based approach. arXiv preprint arXiv:2005.12547. https://arxiv.org/pdf/2005.12547.pdf
Kopp, T., & Salecker, J. (2020). How traders influence their neighbours: Modelling social evolutionary processes and peer effects in agricultural trade networks. Journal of Economic Dynamics and Control, 117, 103944. https://doi.org/10.1016/j.jedc.2020.103944
Azizi, A., Mubayi, A., & Mubayi, A. (2020). The Impact of Individual’s Ecological Factors on the Dynamics of Alcohol Drinking among Arizona State University Students: An Application of the Survey Data-driven Agent-based Model. arXiv preprint arXiv:2011.01876 https://arxiv.org/abs/2011.01876.
Widyastuti, K., Imron, M. A., Pradopo, S. T., Suryatmojo, H., Sopha, B. M., Spessa, A., & Berger, U. (2020). PeatFire: an agent-based model to simulate fire ignition and spreading in a tropical peatland ecosystem. International Journal of Wildland Fire. https://doi.org/10.1071/wf19213
Dahirel, M., Bertin, A., Haond, M., Blin, A., Lombaert, E., Calcagno, V., … Vercken, E. (2020). Shifts from pulled to pushed range expansions caused by reduction of landscape connectivity. https://doi.org/10.1101/2020.05.13.092775
Ghoreishi, M., Razavi, S., & Elshorbagy, A. (2021). Understanding human adaptation to drought: agent-based agricultural water demand modeling in the Bow River Basin, Canada. Hydrological Sciences Journal, 66(3), 389–407. doi:10.1080/02626667.2021.1873344

mcbette

CRAN Peer-reviewed

Model Comparison Using babette

Maintainer

Richèl J.C. Bilderbeek

Description

BEAST2 (https://www.beast2.org) is a widely used Bayesian phylogenetic tool, that uses DNA/RNA/protein data and many model priors to create a posterior of jointly estimated phylogenies and parameters. mcbette allows to do a Bayesian model comparison over some site and clock models, using babette (https://github.com/ropensci/babette/).

View Documentation

karel

CRAN Peer-reviewed

Learning programming with Karel the robot

Maintainer

Marcos Prunello

Description

This is the R implementation of Karel the robot, a programming language created by Dr. R. E. Pattis at Stanford University in 1981. Karel is an useful tool to teach introductory concepts about general programming, such as algorithmic decomposition, conditional statements, loops, etc., in an interactive and fun way, by writing programs to make Karel the robot achieve certain tasks in the world she lives in. Originally based on Pascal, Karel was implemented in many languages through these decades, including Java, C++, Ruby and Python. This is the first package implementing Karel in R.

View Documentation

gittargets

CRAN Peer-reviewed

Data Version Control for the Targets Package

Maintainer

William Michael Landau

Description

In computationally demanding data analysis pipelines, the targets R package (2021, doi:10.21105/joss.02959) maintains an up-to-date set of results while skipping tasks that do not need to rerun. This process increases speed and increases trust in the final end product. However, it also overwrites old output with new output, and past results disappear by default. To preserve historical output, the gittargets package captures version-controlled snapshots of the data store, and each snapshot links to the underlying commit of the source code. That way, when the user rolls back the code to a previous branch or commit, gittargets can recover the data contemporaneous with that commit so that all targets remain up to date.

View Documentation

osfr

CRAN Peer-reviewed

Interface to the Open Science Framework (OSF)

Maintainer

Aaron Wolen

Description

An interface for interacting with OSF (https://osf.io). osfr enables you to access open research materials and data, or create and manage your own private or public projects.

View Documentation
Scientific use cases

Corput, D. V. D. (2020). Locked in Syndrome Machine Learning Classification using Sentence Comprehension EEG Data. arXiv preprint arXiv:2006.12336 https://arxiv.org/pdf/2006.12336.pdf

r2readthedocs

Staff maintained

Convert R Package Documentation to a readthedocs Website

Maintainer

Mark Padgham

Description

Convert R package documentation to a readthedocs website.

View Documentation

fellingdater

Estimate, report and combine felling dates of historical tree-ring series

Maintainer

Kristof Haneca

Description

fellingdater is an R package that aims to facilitate the analysis and interpretation of tree-ring data from wooden cultural heritage objects and structures. The package standardizes the process of computing and combining felling date estimates, both for individual and groups of related tree-ring series.

View Documentation

assertr

CRAN Peer-reviewed

Assertive Programming for R Analysis Pipelines

Maintainer

Tony Fischetti

Description

Provides functionality to assert conditions that have to be met so that errors in data used in analysis pipelines can fail quickly. Similar to stopifnot() but more powerful, friendly, and easier for use in pipelines.

View Documentation
Scientific use cases

Petersen, A. H., & Ekstrøm, C. T. (2019). dataMaid: Your Assistant for Documenting Supervised Data Quality Screening in R. Journal of Statistical Software, 90(6). https://doi.org/10.18637/jss.v090.i06
van der Loo, M. P., & de Jonge, E. (2019). Data Validation Infrastructure for R. arXiv preprint arXiv:1912.09759. https://arxiv.org/pdf/1912.09759.pdf
Brick, C., McDowell, M., & Freeman, A. L. J. (2020). Risk communication in tables versus text: a registered report randomized trial on “fact boxes.” Royal Society Open Science, 7(3), 190876. https://doi.org/10.1098/rsos.190876
Goel, A., & Vitek, J. (2019). On the design, implementation, and use of laziness in R. Proceedings of the ACM on Programming Languages, 3(OOPSLA), 1–27. doi:10.1145/3360579

tokenizers

CRAN Peer-reviewed

Fast, Consistent Tokenization of Natural Language Text

Maintainer

Thomas Charlon

Description

Convert natural language text into tokens. Includes tokenizers for shingled n-grams, skip n-grams, words, word stems, sentences, paragraphs, characters, shingled characters, lines, Penn Treebank, regular expressions, as well as functions for counting characters, words, and sentences, and a function for splitting longer texts into separate documents, each with the same number of words. The tokenizers have a consistent interface, and the package is built on the stringi and Rcpp packages for fast yet correct tokenization in UTF-8.

View Documentation
Scientific use cases

A. Mullen, L., Benoit, K., Keyes, O., Selivanov, D., & Arnold, J. (2018). Fast, Consistent Tokenization of Natural Language Text. Journal of Open Source Software, 3(23), 655. https://doi.org/10.21105/joss.00655
Pajo, J. (2018). Quantitative Falsification for Qualitative Findings. Social Science Computer Review, 089443931876795. https://doi.org/10.1177/0894439318767956
Casey, Jerome (2018). Text Analytics Techniques in the Digital World: a Sentiment Analysis Case Study of the Coverage of Climate Change on US News Networks. Irish Communication Review: Vol. 16: Iss. 1, Article 7. https://arrow.dit.ie/icr/vol16/iss1/7
Gye-Soo, K. 2018. Text Mining and Big Data Analysis in the Relational Database with R. International Journal of Trend in Research and Development. 4(5): 384-386. http://www.ijtrd.com/papers/IJTRD12170.pdf
Ficcadenti, V., Cerqueti, R., & Ausloos, M. (2019). A joint text mining-rank size investigation of the rhetoric structures of the US Presidents’ speeches. Expert Systems with Applications. https://doi.org/10.1016/j.eswa.2018.12.049
Calderone, A. (2019). A Computational Analysis of Natural Languages to Build a Sentence Structure Aware Artificial Neural Network. arXiv preprint arXiv:1906.05491 https://arxiv.org/pdf/1906.05491.pdf
Ulibarri, N., & Scott, T. A. (2019). Environmental hazards, rigid institutions, and transformative change: How drought affects the consideration of water and climate impacts in infrastructure management. Global Environmental Change, 59, 102005. https://doi.org/10.1016/j.gloenvcha.2019.102005
Claes, M., & Mäntylä, M. (2020). 20-MAD–20 Years of Issues and Commits of Mozilla and Apache Development. arXiv preprint arXiv:2003.14015. https://arxiv.org/pdf/2003.14015.pdf
Scott, T. A., Ulibarri, N., & Perez Figueroa, O. (2020). NEPA and National Trends in Federal Infrastructure Siting in the United States. Review of Policy Research. https://doi.org/10.1111/ropr.12399
Grassl, P., Schraffenberger, H., Zuiderveen Borgesius, F., & Buijzen, M. (2020, July 21). Dark and bright patterns in cookie consent requests. https://doi.org/10.31234/osf.io/gqs5h
López Galán, A., Chung, W.-S., & Marshall, N. J. (2020). Dynamic Courtship Signals and Mate Preferences in Sepia plangon. Frontiers in Physiology, 11. https://doi.org/10.3389/fphys.2020.00845
Brandão, L. A. C., Agrelli, A., Bernardo, L., Paparella, F., Moura, R., & Crovella, S. (2020). PlatCOVID: A Novel Web Tool to Analyze, Curate and Share COVID-19 Literature. doi:10.21203/rs.3.rs-42169/v1

baRcodeR

CRAN Peer-reviewed

Label Creation for Tracking and Collecting Data from Biological Samples

Maintainer

Robert Colautti

Description

Tools to generate unique identifier codes and printable barcoded labels for the management of biological samples. The creation of unique ID codes and printable PDF files can be initiated by standard commands, user prompts, or through a GUI addin for R Studio. Biologically informative codes can be included for hierarchically structured sampling designs.

View Documentation
Scientific use cases

Walker, V. K., Das, P., Li, P., Lougheed, S. C., Moniz, K., Schott, S., … Koch, I. (2020). Identification of Arctic Food Fish Species for Anthropogenic Contaminant Testing Using Geography and Genetics. Foods, 9(12), 1824. https://doi.org/10.3390/foods9121824

birdsize

Estimate Avian Body Size Distributions

Maintainer

Renata Diaz

Description

Generate estimated body size distributions for populations or communities of birds, given either species ID or species’ mean body size. Designed to work naturally with the North American Breeding Bird Survey, or with any dataset of bird species, abundance, and/or mean size data.

View Documentation

tracerer

CRAN Peer-reviewed

Tracer from R

Maintainer

Richèl J.C. Bilderbeek

Description

BEAST2 (https://www.beast2.org) is a widely used Bayesian phylogenetic tool, that uses DNA/RNA/protein data and many model priors to create a posterior of jointly estimated phylogenies and parameters. Tracer (https://github.com/beast-dev/tracer/) is a GUI tool to parse and analyze the files generated by BEAST2. This package provides a way to parse and analyze BEAST2 input files without active user input, but using R function calls instead.

View Documentation

ezknitr

Avoid the Typical Working Directory Pain When Using knitr

Maintainer

Dean Attali

Description

An extension of knitr that adds flexibility in several ways. One common source of frustration with knitr is that it assumes the directory where the source file lives should be the working directory, which is often not true. ezknitr addresses this problem by giving you complete control over where all the inputs and outputs are, and adds several other convenient features to make rendering markdown/HTML documents easier.

View Documentation

gistr

Work with GitHub Gists

Maintainer

Scott Chamberlain

Description

Work with GitHub gists from R (e.g., https://en.wikipedia.org/wiki/GitHub#Gist, https://docs.github.com/en/github/writing-on-github/creating-gists/). A gist is simply one or more files with code/text/images/etc. This package allows the user to create new gists, update gists with new files, rename files, delete files, get and delete gists, star and un-star gists, fork gists, open a gist in your default browser, get embed code for a gist, list gist commits, and get rate limit information when authenticated. Some requests require authentication and some do not. Gists website: https://gist.github.com/.

View Documentation

datapack

A Flexible Container to Transport and Manipulate Data and Associated Resources

Maintainer

Matthew B. Jones

Description

Provides a flexible container to transport and manipulate complex sets of data. These data may consist of multiple data files and associated meta data and ancillary files. Individual data objects have associated system level meta data, and data files are linked together using the OAI-ORE standard resource map which describes the relationships between the files. The OAI- ORE standard is described at https://www.openarchives.org/ore/. Data packages can be serialized and transported as structured files that have been created following the BagIt specification. The BagIt specification is described at https://tools.ietf.org/html/draft-kunze-bagit-08.

View Documentation

chlorpromazineR

CRAN Peer-reviewed

Convert Antipsychotic Doses to Chlorpromazine Equivalents

Maintainer

Eric Brown

Description

As different antipsychotic medications have different potencies, the doses of different medications cannot be directly compared. Various strategies are used to convert doses into a common reference so that comparison is meaningful. Chlorpromazine (CPZ) has historically been used as a reference medication into which other antipsychotic doses can be converted, as “chlorpromazine-equivalent doses”. Using conversion keys generated from widely-cited scientific papers, e.g. Gardner et. al 2010 doi:10.1176/appi.ajp.2009.09060802 and Leucht et al. 2016 doi:10.1093/schbul/sbv167, antipsychotic doses are converted to CPZ (or any specified antipsychotic) equivalents. The use of the package is described in the included vignette. Not for clinical use.

View Documentation
Scientific use cases

Kim, J., Plitman, E., Iwata, Y., Nakajima, S., Mar, W., Patel, R., … Graff-Guerrero, A. (2020). Neuroanatomical profiles of treatment-resistance in patients with schizophrenia spectrum disorders. Progress in Neuro-Psychopharmacology and Biological Psychiatry, 99, 109839. https://doi.org/10.1016/j.pnpbp.2019.109839

outcomerate

CRAN Peer-reviewed

AAPOR Survey Outcome Rates

Maintainer

Rafael Pilliard Hellwig

Description

Standardized survey outcome rate functions, including the response rate, contact rate, cooperation rate, and refusal rate. These outcome rates allow survey researchers to measure the quality of survey data using definitions published by the American Association of Public Opinion Research (AAPOR). For details on these standards, see AAPOR (2016) https://www.aapor.org/Standards-Ethics/Standard-Definitions-(1).aspx.

View Documentation

Prev

Page 1 of 0

Next