Wednesday, October 23, 2024 From rOpenSci (https://ropensci.org/blog/2024/10/23/news-october-2024/). Except where otherwise noted, content on this site is licensed under the CC-BY license.
Dear rOpenSci friends, it’s time for our monthly news roundup!
You can read this post on our blog. Now let’s dive into the activity at and around rOpenSci!
Exciting News! The Chan Zuckerberg Initiative has awarded rOpenSci a new grant to foster sustainable scientific software as a pillar of Open Science in Latin America by building capacity and community. With this $340K grant, we’re planning to launch a Spanish-language version of our Champions Program, along with other new initiatives to make sustainable software development more accessible to researchers across the region. Read more in our announcement.
Each package page on R-universe now features a guided tour built with intro.js, so that you might get an overview of all information presented on package pages. To start the tour, click on the “Website Tour” button on the navbar – note that it does not work on mobile devices. Example: xml2 package.
rOpenSci Community Manager Yanina Bellini Saibene will deliver a keynote talk at BioNT Community Event & CarpentryConnect-Heidelberg 2024, on November 14th in Heidelberg, Germany.
Several members of the rOpenSci community will be part of this year’s LatinR—Latin American Conference About the Use of R in R&D!
Will Landau and Julia Silge are the keynote speakers of the conference!
François Michonneau will teach the tutorial Working with larger than memory data in R with Arrow and DuckDB in English. Tuesday, November 19th. 10:00 to 12:00 UTC -3.
Beatriz Milz and Yani Bellini Saibene tutorial Introduccion a Github para Investigador(a|e)s in Spanish. Monday, November 18th. 14:00 to 17:00 UTC -3.
rOpenSci also organizes one of the satellite event Tradução + Hackathon = Traslaton rOpenSci with Pedro Faria, Francesca Palmeira, Beatriz Milz, and Yani Bellini Saibene: we will get together to learn about rOpenSci and how to contribute to the Portuguese translation project. Monday, November 18th. 10:00 to 12:00 UTC-3. (find your time)
Check the full list of talks and other events were rOpenSci members will be participating.
The eighth annual UK Research Software Engineering conference took place in Newcastle from September 3rd to 5th. Mark Padgham presented a talk on “rOpenSci statistical software peer-review: The importance and challenge of community engagement”. The talk was second in a two-part session devoted to software peer review, and followed a walk-through demonstration of the process of submissing to Journal of Open Source software. Find Mark’s slides.
A comunidade R fala português 🕓 Wednesday, 06 November 2024 13:00 UTC With Pedro Faria, Beatriz Milz, Francesca Belem Lopes Palmeira.
The R Community is carrying out several translations into Portuguese of different resources: books, packages and guides.
In this community call, three leaders from the Portuguese-speaking community will present their experiences leading and being part of these projects and share how we can participate in these efforts, including rOpenSci Developer Guide translation project.
This community call will be followed by a traslaton at LatinR, for those who want to contribute.
Making your first contribution to Open Source can be both empowering and intimidating. As such, we’re exited to announce a special series of activities to support first-time contributors to rOpenSci packages! 🎉
Are you an rOpenSci package maintainer who would like to help someone make their first contribution?
See our blog post for more details and how to sign up (by November 24).
Read all about coworking!
Join us for social coworking & office hours monthly on first Tuesdays! Hosted by Steffi LaZerte and various community hosts. Everyone welcome. No RSVP needed. Consult our Events page to find your local time and how to join.
Tuesday, November 5th, 09:00 Americas Pacific (17:00 UTC), “Getting to know Openscapes” with cohost Stefanie Butland and Steffi LaZerte.
Tuesday, December 3rd, 09:00 Australia Western (01:00 UTC), “Getting Involved in the Antarctic/Southern Ocean rOpenSci Community” with cohosts Michael Sumner and Ben Raymond, and Steffi LaZerte.
And remember, you can always cowork independently on work related to R, work on packages that tend to be neglected, or work on what ever you need to get done!
The following five packages recently became a part of our software suite:
pkgmatch, developed by Mark Padgham: Find R packages matching either descriptions or other R packages.
rsi, developed by Michael Mahoney: Downloads spatial data from spatiotemporal asset catalogs (STAC), computes standard spectral indices from the Awesome Spectral Indices project (Montero et al. (2023) doi:10.1038/s41597-023-02096-0) against raster data, and glues the outputs together into predictor bricks. Methods focus on interoperability with the broader spatial ecosystem; function arguments and outputs use classes from sf and terra, and data downloading functions support complex CQL2 queries using rstac. It is available on CRAN. It has been reviewed by Felipe Carvalho and Michael Sumner.
allcontributors, developed by Mark Padgham together with Chris Hartgerink: Acknowledge all contributors to a project via a single function call. The function appends to a README or other specified file(s) a table with names of all individuals who contributed via code or repository issues. The package also includes several additional functions to extract and quantify contributions to any repository. It is available on CRAN.
chopin, developed by Insang Song together with Kyle Messier: Geospatial data computation is parallelized by grid, hierarchy, or raster files. Based on future and mirai parallel backends, terra and sf functions as well as convenience functions in the package can be distributed over multiple threads. The simplest way of parallelizing generic geospatial computation is to start from par_pad_*
functions to par_grid
, par_hierarchy
, or par_multirasters
functions. Virtually any functions accepting classes in terra or sf packages can be used in the three parallelization functions. A common raster-vector overlay operation is provided as a function extract_at
, which uses exactextractr, with options for kernel weights for summarizing raster values at vector geometries. Other convenience functions for vector-vector operations including simple areal interpolation (summarize_aw
) and summation of exponentially decaying weights (summarize_sedc
) are also provided. It has been reviewed by Alec L. Robitaille and Eric R. Scott.
eDNAjoint, developed by Abigail G. Keller: Models integrate environmental DNA (eDNA) detection data and traditional survey data to jointly estimate species catch rate (see package vignette: https://ednajoint.netlify.app/). Models can be used with count data via traditional survey methods (i.e., trapping, electrofishing, visual) and replicated eDNA detection/nondetection data via polymerase chain reaction (i.e., PCR or qPCR) from multiple survey locations. Estimated parameters include probability of a false positive eDNA detection, a site-level covariates that scale the sensitivity of eDNA surveys relative to traditional surveys, and catchability coefficients for traditional gear types. Models are implemented with a Bayesian framework (Markov chain Monte Carlo) using the Stan probabilistic programming language. It is available on CRAN. It has been reviewed by Chitra M. Saraswati and Saras M. Windecker.
Discover more packages, read more about Software Peer Review.
The following eleven packages have had an update since the last newsletter: pkgmatch (v0.4.0
), rsi (v0.3.0
), charlatan (v0.6.1
), git2r (v0.35.0
), nodbi (v0.10.7
), rgbif (v3.8.1
), riem (v0.3.2
), ruODK (v1.5.0
), spatsoc (v0.2.6
), targets (1.8.0
), and tidyhydat (v0.7.0
).
There are thirteen recently closed and active submissions and 6 submissions on hold. Issues are at different stages:
One at ‘6/approved’:
One at ‘5/awaiting-reviewer(s)-response’:
Two at ‘4/review(s)-in-awaiting-changes’:
cancerprof, API Client for State Cancer Profiles. Submitted by Brian Park.
gigs, Assess Fetal, Newborn, and Child Growth with International Standards. Submitted by Simon Parker. (Stats).
Three at ‘3/reviewer(s)-assigned’:
emodnet.wfs, Access EMODnet Web Feature Service data through R. Submitted by Maëlle Salmon.
sits, Satellite Image Time Series Analysis for Earth Observation Data Cubes. Submitted by Gilberto Camara.
fwildclusterboot, Fast Wild Cluster Bootstrap Inference for Linear Models. Submitted by Alexander Fischer. (Stats).
Two at ‘2/seeking-reviewer(s)’:
QuadratiK, A Collection of Methods Using Kernel-Based Quadratic Distances for. Submitted by Giovanni Saraceno. (Stats).
galamm, Generalized Additive Latent and Mixed Models. Submitted by Øystein Sørensen. (Stats).
Four at ‘1/editor-checks’:
rredlist, IUCN Red List Client. Submitted by William Gearty.
forcis, An R Client to Access the FORCIS Database. Submitted by Nicolas Casajus.
fireexposuR, Compute and Visualize Wildfire Exposure. Submitted by Air Forbes.
mbquartR, Finding Manitoba Quarter Sections. Submitted by Alex Koiter.
Find out more about Software Peer Review and how to get involved.
Fostering Open Science in Latin America: CZI Awards Funds for Sustainable Research Software Development by Yanina Bellini Saibene. Foster sustainable scientific software as a pillar of open science in Latin America through building capacity and community. Other languages: Fomentando la Ciencia Abierta en América Latina: CZI otorga fondos para el desarrollo sostenible de software de investigación (es).
Looking for Maintainers to Support First-Time Contributors by Steffi LaZerte and Yanina Bellini Saibene. Announcing a Community Call and Coworking sessions to support first contributions.
If you’re interested in maintaining any of the R packages below, you might enjoy reading our blog post What Does It Mean to Maintain a Package?.
USAboundaries (and USAboundariesdata), historical and contemporary boundaries of the United States of America . Issue for volunteering.
historydata, datasets for historians. Issue for volunteering.
Also refer to our help wanted page – before opening a PR, we recommend asking in the issue whether help is still needed.
Some useful tips for R package developers. 👀
Want to add more variation to your test cases? Don’t miss Etienne Bacher’s post Using property-based testing in R where he explains “Property-based testing is a way of testing where we give random inputs to the function we want to test and we want to ensure that no matter the inputs, the output will respect some properties.”. The post includes examples.
If you need to compare versions in your code, for instance installed version and version at which a feature was added, these two functions might prove handy:
numeric_version()
from base R that, given a string representation of a version, creates an object you can for instance compare. Example usage in the greta R package.utils::compareVersion()
that compares string representations of versions. Example usage in the targets R package.In a blog post, Maëlle Salmon describes how to create and use a simple custom roxygen2 tag that adds a section to the Rd file.
Thanks for reading! If you want to get involved with rOpenSci, check out our Contributing Guide that can help direct you to the right place, whether you want to make code contributions, non-code contributions, or contribute in other ways like sharing use cases. You can also support our work through donations.
If you haven’t subscribed to our newsletter yet, you can do so via a form. Until it’s time for our next newsletter, you can keep in touch with us via our website and Mastodon account.