Thursday, March 27, 2025 From rOpenSci (https://ropensci.org/blog/2025/03/27/news-march-2025/). Except where otherwise noted, content on this site is licensed under the CC-BY license.
Dear rOpenSci friends, it’s time for our monthly news roundup! You can read this post on our blog. Now let’s dive into the activity at and around rOpenSci!
We have great news: The call for applications to be part of the new cohort of our 2025 Program is now open! And for the first time it will be in Spanish!
Our program seeks to identify, recognize and reward people who are leaders in an open science community, research software engineering and the R programming community.
This year’s program is focused on people from Latin America and for the first time will be conducted entirely in Spanish. The main goal is to foster sustainable research software as a pillar of Open Science in Latin America through capacity and community building.
Find out more in our call for applications open until Wednesday, April 30, 2025.
Thanks to funding by Google Season of Docs, we were able to start a new comprehensive documentation project for all users and developers of R-universe. We established a central place where we collect the various sources of information and describe examples and use cases, using popular authoring tools to support collective maintenance.
Read more in our blog post, read the documentation website.
The NumFOCUS Diversity & Inclusion in Scientific Computing (“DISC”) Unconf took place as a hybrid event in São Paulo, Brazil, from March 14 to 16, 2025. Our community manager, Yanina Bellini Saibene, and rOpenSci Champions, Andrea Gomez Vargas and Liz Hare, participated.
Read more in our report from the event.
This event also presented the opportunity to meet in person with other members of the rOpenSci community in Sao Paulo, in the picture we can see Francesca Palmeira (rOpenSci Champions), Andrea Gomez Vargas (rOpenSci Champions), Beatriz Milz (rOpenSci Editor and Mentor), Haydee Svab (rOpenSci Champions) and Yanina Bellini Saibene (rOpenSci Community Manager).
All the resources for the community call in Spanish Fomentando la Ciencia Abierta en América Latina - Programa de rOpenSci are now available on our website. Check the video and links to other resources related to speakers’ experience as mentors and mentees in the rOpenSci Champions Program.
Read all about coworking!
And remember, you can always cowork independently on work related to R, work on packages that tend to be neglected, or work on what ever you need to get done!
The following five packages recently became a part of our software suite:
pangoling, developed by Bruno Nicenboim: Provides access to word predictability estimates using large language models (LLMs) based on transformer architectures via integration with the Hugging Face ecosystem. The package interfaces with pre-trained neural networks and supports both causal/auto-regressive LLMs (e.g., GPT-2; Radford et al., 2019) and masked/bidirectional LLMs (e.g., BERT; Devlin et al., 2019, doi:10.48550/arXiv.1810.04805) to compute the probability of words, phrases, or tokens given their linguistic context. By enabling a straightforward estimation of word predictability, the package facilitates research in psycholinguistics, computational linguistics, and natural language processing (NLP). It has been reviewed by Lisa Levinson and Utku Turk.
mbquartR, developed by Alex Koiter: This package has four main functions: 1) download the Manitoba Original Survey Legal Descriptions data set; 2) find the coordinates of a quarter sections given the legal land description (e.g., “NE-11-33-29W”); 3) find the legal land description using coordinates (lat and long); and 4) plot these points on a map. It has been reviewed by Emily H Markowitz and Sheila Saia.
mapmetadata, developed by Rachael Stickland: Prior to gaining full access to health datasets, explore publicly available metadata and map metadata onto predefined research domains. This package uses structural metadata files downloaded from the Health Data Research Gateway (https://healthdatagateway.org/en). In theory, any metadata file with the same structure as the files downloaded from this gateway can be used with this package, but the package has been developed and tested on metadata files from this gateway only. It has been reviewed by Zoë Turner and Yohann Mansiaux.
geotargets, developed by Nicholas Tierney together with Eric Scott and Andrew Brown: Provides extensions for various geographic spatial file formats, such as shape files and rasters. Currently provides support for the terra geographic spatial formats. See the vignettes for worked examples, demonstrations, and explanations of how to use the various package extensions. It has been reviewed by Anthony Martinez and Denisse Fierro Arcos.
saperlipopette, developed by Maëlle Salmon: Holds functions creating Git messes, that users would then solve, to follow https://ohshitgit.com/.
Discover more packages, read more about Software Peer Review.
The following five packages have had an update since the last newsletter: ijtiff (v3.1.0
), mapmetadata (v4.0.2
), tidyhydat (0.7.1
), vcr (v1.7.0
), and waywiser (v0.6.2
).
There are twelve recently closed and active submissions and 6 submissions on hold. Issues are at different stages:
Four at ‘6/approved’:
butterfly, Verification For Continually Updating Timeseries Data. Submitted by Thomas Zwagerman.
geotargets, Targets Extensions for Geospatial Formats. Submitted by Nicholas Tierney.
mapmetadata, Map health metadata onto predefined research domains. Submitted by Rachael Stickland.
mbquartR, Finding Manitoba Quarter Sections. Submitted by Alex Koiter.
Two at ‘4/review(s)-in-awaiting-changes’:
gtexr, Query the GTEx Portal API. Submitted by Alasdair Warwick.
pkgmatch, Find R Packages Matching Either Descriptions or Other R Packages. Submitted by mark padgham.
Three at ‘3/reviewer(s)-assigned’:
forcis, An R Client to Access the FORCIS Database. Submitted by Nicolas Casajus.
sits, Satellite Image Time Series Analysis for Earth Observation Data Cubes. Submitted by Gilberto Camara.
fwildclusterboot, Fast Wild Cluster Bootstrap Inference for Linear Models. Submitted by Alexander Fischer. (Stats).
One at ‘2/seeking-reviewer(s)’:
Two at ‘1/editor-checks’:
read.abares, Provides simple downloading, parsing and importing of Australian. Submitted by Adam H. Sparks.
capybara, Fast and Memory Efficient Fitting of Linear Models With High-Dimensional. Submitted by Mauricio “Pachá” Vargas Sepúlveda.
Find out more about Software Peer Review and how to get involved.
Software Review, Perspectives from an Academic by Alex Koiter. Comparing rOpenSci peer review to traditional academic reviews. Other languages: Revisión del software, perspectivas de un académico (es).
Better documentation for R-universe! by Maëlle Salmon, Jeroen Ooms, and Noam Ross. We are excited to announce that R-Universe now has a documentation website thanks to Google Season of Docs.
rOpenSci Champions Program 2025: In Spanish! by Yanina Bellini Saibene. The call is open for a new cohort of our Champions Program with a focus on Latin America. Apply. Other languages: Nueva cohorte del Programa de Campeon(a|e)s rOpenSci (es).
Template To Handle Code of Conduct Incidents Reports by Natalia Morandeira and Yanina Bellini Saibene. Learn how rOpenSci’s template provides a structured approach to effectively and confidentially analyze Code of Conduct incident reports.
rOpenSci Code of Conduct Annual Review by Yanina Bellini Saibene, Mark Padgham, and Natalia Morandeira. Updates for version 2.5 of rOpenSci’s Code of Conduct.
rOpenSci Participation at NumFOCUS’ DISC Unconf by Yanina Bellini Saibene. Report of rOpenSci participation at the NumFOCUS hybrid unconference.
Three use cases of our packages and resources have been reported since we sent the last newsletter.
Visualizing Highways, Toll Booths with {ggplot2} in R with {osmextract}. Reported by Aditya Dahiya.
Combining with Population Density Rasters with OSM data in R with {sf}, {osmextract} and {ggplot2}. Reported by Aditya Dahiya.
Translating course to Spanish and French. Reported by Yann Say.
Explore other use cases and report your own!
If you’re interested in maintaining any of the R packages below, you might enjoy reading our blog post What Does It Mean to Maintain a Package?.
photosearcher, Searches Flickr for photographs and metadata. Issue for volunteering.
MODIStsp, automatic download and preprocessing of MODIS Land Products Time Series. Issue for volunteering.
hddtools, Hydrological Data Discovery Tools. Issue for volunteering.
USAboundaries (and USAboundariesdata), historical and contemporary boundaries of the United States of America . Issue for volunteering.
historydata, datasets for historians. Issue for volunteering.
Refer to our help wanted page – before opening a PR, we recommend asking in the issue whether help is still needed.
The bib2df package, for parsing BibTeX files into tibbles, would need some help! Issue for volunteering.
Some useful tips for R package developers. 👀
lintr::use_lintr()
If you use the lintr package for static code analysis, you might need to add a configuration file that will activate some linters, deactive others, exclude files from the linting. To create it, you can run lintr::use_lintr()
. Remember this is a function of lintr, not usethis!
Speaking of linting, you might remember about a package we mentioned in last August’s newsletter, that would not only find problems in your package like lintr, but also fix them. This package by Etienne Bacher is now called flir.
Have you ever used the styler R package to style your codebase? You might be interested in the new formatter Air. Read more in the post by Davis Vaughan and Lionel Henry on the tidyverse blog.
Compared to styler, Air offers much less customization, but is much faster. A section of the announcement is dedicated to the comparison with styler.
Have you heard of the beta IDE Positron developed by Posit (formerly RStudio)? It might be worth a try, for instance for being able to install Git extensions like GitLens, or for the nice UI for unit tests. Note that it’s still in beta.
Other IDEs popular in the R community are
Looking for a dataset for tests or docs, available in base R? The penguins dataset, originally from the palmerpenguins package, is coming to the base R datasets package! Thanks to to Ella Kaye, Heather Turner, and Kristen Gorman for their work on this.
Read all about Git Stash, and why and how you might use it, in a post by Meghan Harris.
Thanks for reading! If you want to get involved with rOpenSci, check out our Contributing Guide that can help direct you to the right place, whether you want to make code contributions, non-code contributions, or contribute in other ways like sharing use cases. You can also support our work through donations.
If you haven’t subscribed to our newsletter yet, you can do so via a form. Until it’s time for our next newsletter, you can keep in touch with us via our website and Mastodon account.