rOpenSci | Data Publication

Data Publication

Document and Release Your Data
Showing 10 of 12
frictionless
CRAN Peer-reviewed

Read and Write Frictionless Data Packages

Peter Desmet
Description

Read and write Frictionless Data Packages. A Data Package (https://specs.frictionlessdata.io/data-package/) is a simple container format and standard to describe and package a collection of (tabular) data. It is typically used to publish FAIR (https://www.go-fair.org/fair-principles/) and open datasets.

View Documentation

Generate Citation File Format (cff) Metadata for R Packages

Diego Hernangómez
Description

The Citation File Format version 1.2.0 doi:10.5281/zenodo.5171937 is a human and machine readable file format which provides citation metadata for software. This package provides core utilities to generate and validate this metadata.

View Documentation

Generate CodeMeta Metadata for R Packages

Carl Boettiger
Description

The Codemeta Project defines a JSON-LD format for describing software metadata, as detailed at https://codemeta.github.io. This package provides utilities to generate, parse, and modify codemeta.json files automatically for R packages, as well as tools and examples for working with codemeta.json JSON-LD more generally.

View Documentation

Managing Larger Data on a GitHub Repository

Carl Boettiger
Description

Helps store files as GitHub release assets, which is a convenient way for large/binary data files to piggyback onto public and private GitHub repositories. Includes functions for file downloads, uploads, and managing releases via the GitHub API.

View Documentation
Scientific use cases
  1. Boettiger, C. (2018). Managing Larger Data on a GitHub Repository. Journal of Open Source Software, 3(29), 971. https://doi.org/10.21105/joss.00971
RNeXML
CRAN

Semantically Rich I/O for the NeXML Format

Carl Boettiger
Description

Provides access to phyloinformatic data in NeXML format. The package should add new functionality to R such as the possibility to manipulate NeXML objects in more various and refined way and compatibility with ape objects.

View Documentation
Scientific use cases
  1. Stöver, B. C., Wiechers, S., & Müller, K. F. (2019). JPhyloIO: a Java library for event-based reading and writing of different phylogenetic file formats through a common interface. BMC Bioinformatics, 20(1). https://doi.org/10.1186/s12859-019-2982-3

A universal client for depositing and accessing research data anywhere

Mark Padgham
Description

A universal client for depositing and accessing research data anywhere. Currently supported services are zenodo and figshare.

View Documentation

Generate CRediT Author Statements

Josep Pueyo-Ros
Description

A tiny package to generate CRediT author statements (https://credit.niso.org/). It provides three functions: create a template, read it back and generate the CRediT author statement in a text file.

View Documentation
EML
CRAN

Read and Write Ecological Metadata Language Files

Carl Boettiger
Description

Work with Ecological Metadata Language (EML) files. EML is a widely used metadata standard in the ecological and environmental sciences, described in Jones et al. (2006), doi:10.1146/annurev.ecolsys.37.091305.110031.

View Documentation

Create Lightweight Schema.org Descriptions of Data

Bryce Mecum
Description

The goal of dataspice is to make it easier for researchers to create basic, lightweight, and concise metadata files for their datasets. These basic files can then be used to make useful information available during analysis, create a helpful dataset “README” webpage, and produce more complex metadata formats to aid dataset discovery. Metadata fields are based on the Schema.org and Ecological Metadata Language standards.

View Documentation