Friday, May 21, 2021 From rOpenSci (https://ropensci.org/blog/2021/05/21/ropensci-news-digest-may-2021/). Except where otherwise noted, content on this site is licensed under the CC-BY license.
Dear rOpenSci friends, it’s time for our monthly news roundup!
You can read this post on our blog. Now let’s dive into the activity at and around rOpenSci!
We have no less than three categories of (online :wink:) events coming up: social co-working events, a community call, a conference with contributions from four rOpenSci staff members. :sparkles:
Our next community call is about rOpenSci’s R-universe Project and will take place on Tuesday, 25 May 2021 17:00 UTC. The R-universe platform is a new umbrella project under which rOpenSci experiments with new ideas for improving publication and discovery of research software packages in R. In this 1-hour community call, Jeroen Ooms will explain the basic steps of setting up your own universe, and getting started with publishing packages (including experimental software, development versions, research compendia) and articles on your personal subdomain. We are interested in hearing your thoughts and ideas on how this infrastructure can improve how researchers publish and discover R packages and related content.
Video and resources from our past community call Set Up Your Package to Foster a Community were posted. rOpenSci puts ongoing effort into exploring and communicating how developers can best attract attention to their package (e.g. usage, citations, or feedback), or how to set up their repository to encourage the types of contributions they want. In this 1-hour community call, Maëlle Salmon, Hugo Gruson, and Steffi LaZerte shared tips and examples on how to do this! Sina Rüeger summarized the call with help from Mark Padgham.
We held our first two Social Co-Working and Label-athon events on April 29th and May 13th. These were truly international events with 14 participants from Canada, Germany, Switzerland, USA, the UK, France, Singapore, and Mexico. We had a blast socializing (scavenger hunts!) and co-working on our personal R projects including repository organization, creating/labelling issues, preliminary research, personal GitHub README files, and those packages you can never find time for. We had quiet and noisy rooms to work in, so everyone was able to find an environment that worked best for them. Sound like fun? We have two more coming up!
Thursday May 27th 9am Pacific (16:00 UTC) - Registration encouraged!
Thursday June 10th 5pm Pacific (Fri June 11th 00:00 UTC) - Registration encouraged!
Registration for the useR! 2021 conference has begun, with Early Bird Deadline until May 28th. useR! conferences are non-profit conferences organized by community volunteers for the community, supported by the R Foundation. useR! 2021 will take place online.
Get excited for four contributions by rOpenSci staff members:
Jeroen Ooms, rOpenSci Lead Infrastructure Engineer, will give a keynote talk about building a modern and scalable package build infrastructure, such as R-universe.
Maëlle Salmon, Research Software Engineer with rOpenSci, will give a tutorial, GET better at testing your R package! on Wednesday, July the 7th at 9-12AM GMT+2. This tutorial is about advanced testing of R packages, with HTTP testing as a case study.
Stefanie Butland, rOpenSci community manager, will present on rOpenSci’s Model for Managing a Federated Open Source Software Community.
Mark Padgham, rOpenSci Software Research Scientist, will also give a regular talk, about his original autotest package for automatic testing of packages.
Find out about more events.
Another great piece of news: censo2017 was the first package to pass rOpenSci software peer-review in Spanish! :tada: The censo2017 package by Pachá (aka Mauricio Vargas Sepúlveda) provides access to the 2017 Chilean Census. Editor Melina Vidoni was in charge of this review, the reviewers were Frans van Dunné and María Paula Caldas. Frans van Dunné most kindly translated our review template to Spanish!
Thank you to people in the Latin American R community for discussion and encouragement that helped make this happen.
The following two packages recently became a part of our software suite:
censo2017, developed by Mauricio Vargas: Provee un acceso conveniente a mas de 17 millones de registros de la base de datos del Censo 2017. Los datos fueron importados desde el DVD oficial del INE usando el Convertidor REDATAM creado por Pablo De Grande. Esta paquete esta documentado intencionalmente en castellano asciificado para que funcione sin problema en diferentes plataformas. (Provides convenient access to more than 17 million records from the Chilean Census 2017 database. The datasets were imported from the official DVD provided by the Chilean National Bureau of Statistics by using the REDATAM converter created by Pablo De Grande and in addition it includes the maps accompanying these datasets.). It is available on CRAN. It has been reviewed by Frans van Dunné, Maria Paula Caldas.
exoplanets, developed by Tyler Littlefield: The goal of exoplanets is to provide access to NASA’s Exoplanet Archive TAP Service. For more information regarding the API please read the documentation https://exoplanetarchive.ipac.caltech.edu/index.html. It is available on CRAN. It has been reviewed by Maëlle Salmon, Nichole Monhait.
Discover more packages, read more about Software Peer Review.
The following forty packages have had an update since the latest newsletter: gert (
v1.3.0), babette (
v2.3), BaseSet (
v0.0.16), beastier (
v2.3.1), beautier (
v2.5), bold (
v1.2.0), brranching (
v0.7.0), c14bazAAR (
2.3.0), circle (
v0.7.1), codemetar (
v0.3.0), drake (
7.13.2), exoplanets (
v0.2.1), fingertipsR (
v1.0.6), ijtiff (
v2.2.6), jqr (
v1.2.1), lingtypology (
v1.1.5), magick (
v2.7.2), mauricer (
v2.4), mcbette (
v1.14), MtreeRing (
v1.4.5), oai (
v0.3.2), opencv (
v0.2.1), pangaear (
v1.1.0), pathviewr (
v1.0.1), rbhl (
v0.9.2), rdhs (
0.7.2), rglobi (
v0.2.24), rnoaa (
v1.3.4), rzmq (
v0.9.8), solrium (
v1.2.0), ssh (
v0.8.0), stantargets (
0.0.2), tarchetypes (
0.2.0), targets (
0.4.2), terrainr (
v0.4.0), tidyhydat (
0.5.3), tracerer (
v2.2), UCSCXenaTools (
v1.4.4), workloopR (
v1.1.4), writexl (
There are sixteen recently closed and active submissions and 4 submissions on hold. Issues are at different stages:
Three at ‘6/approved’:
stantargets, Targets for Stan Workflows. Submitted by Will Landau.
dataspice, Create Lightweight Schema.org Descriptions of Data. Submitted by Bryce Mecum.
censo2017, Base de Datos de Facil Acceso del Censo 2017 de Chile. Submitted by Pachamaltese.
Three at ‘4/review(s)-in-awaiting-changes’:
tidytags, Simple Collection and Powerful Analysis of Twitter Data. Submitted by Bret Staudt Willet.
healthdatacsv, Access data in the healthdata.gov catalog. Submitted by iecastro.
chemspiderapi, R Wrapper for ChemSpider’s API Services. Submitted by Raoul Wolf.
Six at ‘3/reviewer(s)-assigned’:
gendercoder, Recodes Sex/Gender Descriptions Into A Standard Set. Submitted by Emily Kothe.
mctq, An R Package for the Munich ChronoType Questionnaire. Submitted by Daniel Vartanian.
ROriginStamp, Interface to OriginStamp API to Obtain Trusted Time Stamps. Submitted by Rainer M Krug.
awardFindR, QDR awardFindR. Submitted by Michael C. McCall.
slopes, Calculate Slopes of Roads, Rivers and Trajectories. Submitted by RFlx.
occCite, Querying and Managing Large Biodiversity Occurrence Datasets. Submitted by Hannah Owens.
Three at ‘2/seeking-reviewer(s)':
rsat, Tools for Downloading, Customizing, and Processing Time Series of Satellite Images from Landsat, MODIS, and Sentinel. Submitted by Unai Pérez-Goya.
allodb, Tree Biomass Estimation at Extratropical Forest Plots. Submitted by Erika Gonzalez-Akre.
jagstargets, Targets for JAGS Workflows. Submitted by Will Landau.
One at ‘1/editor-checks’:
Find out more about Software Peer Review and how to get involved.
Below are the citations recently added to our database of 1295 articles, that you can explore on our citations page. We found use of…
biomartr in Böttcher, A., Büttner, M., Tritschler, S., Sterr, M., Aliluev, A., Oppenländer, L., … Lickert, H. (2021). Non-canonical Wnt/PCP signalling regulates intestinal stem cell lineage priming towards enteroendocrine and Paneth cell fates. Nature Cell Biology, 23(1), 23–31. https://doi.org/10.1038/s41556-020-00617-2
bold, rentrez in Batovska, J., Piper, A., Valenzuela, I., Cunningham, J., & Blacket, M. (2020). Developing a Non-destructive Metabarcoding Protocol for Detection of Pest Insects in Bulk Trap Catches. https://doi.org/10.21203/rs.3.rs-125070/v1
comtradr in Chini, C. M., & Peer, R. A. M. (2021). The traded water footprint of global energy from 2010 to 2018. Scientific Data, 8(1). https://doi.org/10.1038/s41597-020-00795-6
CoordinateCleaner in Esquivel, D. A., Aya-Cuero, C., Penagos, A. P., Chacón-Pacheco, J., Agámez-López, C. J., Ochoa, A. V., … Bennett, D. (2020). Updating the distribution of Vampyrum spectrum (Chiroptera, Phyllostomidae) in Colombia: new localities, potential distribution and notes on its conservation. Neotropical Biology and Conservation, 15(4), 689–709. https://doi.org/10.3897/neotropical.15.e58383
CoordinateCleaner in Suissa, J. S., & Sundue, M. A. (2020). Diversity Patterns of Neotropical Ferns: Revisiting Tryon’s Centers of Richness and Endemism. American Fern Journal, 110(4). https://doi.org/10.1640/0002-8444-110.4.211
CoordinateCleaner, taxize in Tamme, R., Pärtel, M., Kõljalg, U., Laanisto, L., Liira, J., Mander, Ü., … Zobel, M. (2020). Global macroecology of nitrogen‐fixing plants. Global Ecology and Biogeography, 30(2), 514–526. https://doi.org/10.1111/geb.13236
FedData in Porter, W. T., Barrand, Z. A., Wachara, J., DaVall, K., Mihaljevic, J. R., Pearson, T., … Nieto, N. C. (2021). Predicting the current and future distribution of the western black-legged tick, Ixodes pacificus, across the Western US using citizen science collections. PLOS ONE, 16(1), e0244754. https://doi.org/10.1371/journal.pone.0244754
neotoma in Byun, E., Sato, H., Cowling, S. A., & Finkelstein, S. A. (2020). Extensive wetland development in mid-latitude North America during the Bølling–Allerød. Nature Geoscience, 14(1), 30–35. https://doi.org/10.1038/s41561-020-00670-4
NLMR in Thompson, P. R., Derocher, A. E., Edwards, M. A., & Lewis, M. A. (2021). Describing spatiotemporal memory patterns using animal movement modelling. arXiv preprint arXiv:2101.04183. https://arxiv.org/pdf/2101.04183
nlrx in Dahirel, M., Bertin, A., Haond, M., Blin, A., Lombaert, E., Calcagno, V., … Vercken, E. (2020). Shifts from pulled to pushed range expansions caused by reduction of landscape connectivity. https://doi.org/10.1101/2020.05.13.092775
osmdata in Moradi, M. (2020). Evaluating the quality of OSM roads and buildings in the Québec Province. https://corpus.ulaval.ca/jspui/bitstream/20.500.11794/67232/1/36576.pdf
osmdata in Wilkins, E.J. 2020. Using Social Media to Assess the Impact of Weather and Climate on Visitation to Outdoor Recreation Settings. https://digitalcommons.usu.edu/etd/7986
pdftools in Li, B., Trueman, B. F., Rahman, M. S., & Gagnon, G. A. (2021). Controlling lead release due to uniform and galvanic corrosion — An evaluation of silicate-based inhibitors. Journal of Hazardous Materials, 407, 124707. https://doi.org/10.1016/j.jhazmat.2020.124707
pdftools, tesseract in Hines, R. E., Grandage, A. J., & Willoughby, K. G. (2020). Staying Afloat: Planning and Managing Climate Change and Sea Level Rise Risk in Florida’s Coastal Counties. Urban Affairs Review, 107808742098052. https://doi.org/10.1177/1078087420980526
phylocomr, brranching in Perez, T. M., Socha, A., Tserej, O., & Feeley, K. J. (2021). Photosystem II heat tolerances characterize thermal generalists and the upper limit of carbon assimilation. Plant, Cell & Environment. https://doi.org/10.1111/pce.13990
rfishbase in Palacios-Abrantes, J., Reygondeau, G., Wabnitz, C. C. C., & Cheung, W. W. L. (2020). The transboundary nature of the world’s exploited marine species. Scientific Reports, 10(1). doi:10.1038/s41598-020-74644-2
rfishbase in Parravicini, V., Casey, J. M., Schiettekatte, N. M. D., Brandl, S. J., Pozas-Schacre, C., Carlot, J., … Stuart-Smith, R. D. (2020). Delineating reef fish trophic guilds with global gut content data synthesis and phylogeny. PLOS Biology, 18(12), e3000702. https://doi.org/10.1371/journal.pbio.3000702
rgbif in Ascensão, F., D’Amico, M., Martins, R. C., Rebelo, R., Barbosa, A. M., Bencatel, J., … Capinha, C. (2021). Distribution of alien tetrapods in the Iberian Peninsula. NeoBiota, 64, 1–21. https://doi.org/10.3897/neobiota.64.55597
rgbif in Freimuth, J., Bossdorf, O., Scheepens, J. F., & Willems, F. M. (2021). Climate warming changes synchrony of plants and pollinators. https://doi.org/10.1101/2021.01.10.425984
rgbif in Liao, H., Li, D., Zhou, T., Huang, B., Zhang, H., Chen, B., & Peng, S. (2020). The role of functional strategies in global plant distribution. Ecography, 44(4), 493–503. https://doi.org/10.1111/ecog.05476
rgbif in Pineda-Munoz, S., Wang, Y., Lyons, S. K., Tóth, A. B., & McGuire, J. L. (2021). Mammal species occupy different climates following the expansion of human impacts. Proceedings of the National Academy of Sciences, 118(2), e1922859118. https://doi.org/10.1073/pnas.1922859118
rgbif in Santos, G. N. dos, Silva, A. C. da, & Higuchi, P. (2020). IMPACT OF CLIMATE CHANGE ON THE GEOGRAPHICAL DISTRIBUTION OF A CLOUD FOREST INDICATOR TREE SPECIES. Revista Árvore, 44. https://doi.org/10.1590/1806-908820200000032
rnaturalearth in Ionela-Andreea, P., & Marian, N. (2020). Cluster Analysis of Regional Research and Development Disparities in Europe. Studies in Business and Economics, 15(3), 303–312. https://doi.org/10.2478/sbe-2020-0060
rnaturalearth in Mari, A., Roloff, T.-C., Stange, M., Soegaard, K. K., Asllanaj, E., Tauriello, G., … Egli, A. (2021). Global surveillance of potential antiviral drug resistance in SARS-CoV-2: proof of concept focussing on the RNA-dependent RNA polymerase. https://doi.org/10.1101/2020.12.28.20248663
rnaturalearth in Pogorevc, N., Simčič, M., Khayatzadeh, N., Soelkner, J., Berger, B., Bojkovski, D., … Horvat, S. (2020). Post-Genotyping Optimization of Dataset Formation Could Affect Genetic Diversity Parameters: An Example of Analyses with Alpine Goat Breeds. https://doi.org/10.21203/rs.3.rs-133590/v1
rnaturalearth in Shaw, E. C., Fowler, R., Ohadi, S., Bayly, M. J., Barrett, R. A., Tibbits, J., … Cousens, R. D. (2020). Explaining the worldwide distributions of two highly mobile species: Cakile edentula and Cakile maritima. Journal of Biogeography, 48(3), 603–615. https://doi.org/10.1111/jbi.14024
rotl in Sandoval-Herrera, N. I., Mastromonaco, G. F., Becker, D. J., Simmons, N. B., & Welch, K. C. (2021). Inter- and intra-specific variation in hair cortisol concentrations of Neotropical bats. https://doi.org/10.1101/2021.01.10.426004
rredlist, rfishbase in Murgier, J., McLean, M., Maire, A., Mouillot, D., Loiseau, N., Munoz, F., … Auber, A. (2021). Rebound in functional distinctiveness following warming and reduced fishing in the North Sea. Proceedings of the Royal Society B: Biological Sciences, 288(1942), 20201600. https://doi.org/10.1098/rspb.2020.1600
rtweet in Boehm, F. J., & Hanlon, B. M. (2021). What Is Happening on Twitter? A Framework for Student Research Projects With Tweets. Journal of Statistics and Data Science Education, 29(sup1), S95–S102. https://doi.org/10.1080/10691898.2020.1848486
rtweet in Gutiérrez García-Pardo, I., Guevara Gil, J. A., Gómez González, D., Castro Cantalejo, J., & Espínola Vílchez, R. (2021). Community Detection Problem Based on Polarization Measures. An application to Twitter: the COVID-19 case in Spain. https://doi.org/10.20944/preprints202101.0080.v1
rtweet in Lucas, B., & Landman, T. (2020). Social listening, modern slavery, and COVID-19. Journal of Risk Research, 1–21. https://doi.org/10.1080/13669877.2020.1864009
skimr in Behrendt, I., Fasshauer, M., & Eichner, G. (2020). Gluten Intake and All-Cause and Cause-Specific Mortality: Prospective Findings from the UK Biobank. The Journal of Nutrition, 151(3), 591–597. https://doi.org/10.1093/jn/nxaa387
taxize in Albery, G. F., Carlson, C. J., Cohen, L. E., Eskew, E. A., Gibb, R., Ryan, S. J., … Becker, D. J. (2021). Urban-adapted mammal species have more known pathogens. https://doi.org/10.1101/2021.01.02.425084
taxize in Janssen, J., & Gomez, L. (2021). An examination of the import of live reptiles from Indonesia by the United States from 2000 to 2015. Journal for Nature Conservation, 59, 125949. https://doi.org/10.1016/j.jnc.2020.125949
taxize in Meireles, J. E., Cavender‐Bares, J., Townsend, P. A., Ustin, S., Gamon, J. A., Schweiger, A. K., … O’Meara, B. C. (2020). Leaf reflectance spectra capture the evolutionary history of seed plants. New Phytologist, 228(2), 485–493. https://doi.org/10.1111/nph.16771
treeio in Gates, M. W., Zhang, Y. M., & Buffington, M. L. (2020). The great greenbriers gall mystery resolved? New species of Aprostocetus Westwood (Hymenoptera, Eulophidae) gall inducer and two new parasitoids (Hymenoptera, Eurytomidae) associated with Smilax L. in southern Florida, USA. Journal of Hymenoptera Research, 80, 71–98. https://doi.org/10.3897/jhr.80.59466
treeio in Maruyama, H., Masago, A., Nambu, T., Mashimo, C., Takahashi, K., & Okinaga, T. (2020). Inter-site and interpersonal diversity of salivary and tongue microbiomes, and the effect of oral care tablets. F1000Research, 9, 1477. https://doi.org/10.12688/f1000research.27502.1
treeio in Oswald, K. N., Lee, A. T. K., & Smit, B. (2021). Seasonal metabolic adjustments in an avian evolutionary relict restricted to mountain habitat. Journal of Thermal Biology, 95, 102815. https://doi.org/10.1016/j.jtherbio.2020.102815
treeio in Toparslan, E., Karabag, K., & Bilge, U. (2020). A workflow with R: Phylogenetic analyses and visualizations using mitochondrial cytochrome b gene sequences. PLOS ONE, 15(12), e0243927. https://doi.org/10.1371/journal.pone.0243927
UCSCXenaTools in Cui, Y., Hunt, A., Li, Z., Birkin, E., Lane, J., Ruge, F., & Jiang, W. G. (2021). Lead DEAD/H box helicase biomarkers with the therapeutic potential identified by integrated bioinformatic approaches in lung cancer. Computational and Structural Biotechnology Journal, 19, 261–278. https://doi.org/10.1016/j.csbj.2020.12.007
webchem in Hammoud, Z., & Kramer, F. (2020). Multipath: An R Package to Generate Integrated Reproducible Pathway Models. Biology, 9(12), 483. https://doi.org/10.3390/biology9120483
webchem in Su, Q.-Z., Vera, P., Nerín, C., Lin, Q.-B., & Zhong, H.-N. (2021). Safety concerns of recycling postconsumer polyolefins for food contact uses: Regarding (semi-)volatile migrants untargetedly screened. Resources, Conservation and Recycling, 167, 105365. https://doi.org/10.1016/j.resconrec.2020.105365
Thank you for citing our tools!
Three use cases of our packages and resources have been reported since we sent the last newsletter.
Scraping Google Play Reviews with RSelenium. Reported by jlaw.
Easily analyzing tweets using rtweet. Reported by Amit Arora.
Most tweeted words each month: a year snapshot. Reported by Soraya Campbell.
Explore other use cases and report your own!
There’s no open call for new maintainers at this point but you can refer to our contributing guide for finding ways to get involved!
As the maintainer of an rOpenSci package, feel free to contact us on Slack or email
[email protected] to get your call for maintainer featured in the next newsletter.
Some useful tips for R package developers. :eyes:
Sometimes you find yourself documenting internal functions with roxygen2 without really wanting everyone to get to see the docs.
Do you know the difference between the
@internalmeans there will be a manual page for the function but it won’t be in the documentation index. "
@keywords internalis useful because it removes the function from the documentation index; it’s useful for functions aimed primarily at other developers, not typical users of the package."
@NoRdmeans there won’t even be a manual page so the roxygen2 documentation of that function only lives in the R script. We at rOpenSci strongly encourage documentation of internal functions, and advise doing so within code itself rather than as separate manual pages – that is, using
@internal. It is also recommended in the tidyverse style guide.
Speaking of documentation, when writing examples, do you know the difference between
(Is this newsletter section turning into a quizz? :sweat_smile:)
The roxygen2 docs feature a table explaining when different types of examples are run depending on the context (in
R CMD check,
R CMD check –as-cran) which is very useful when you wonder how to safeguard your package examples on e.g. CRAN.
The same documentation section introduces the very handy
@examplesIf roxygen2 tag that you can use for finer control.
Now on to code archaeology! Do you want to “pinpoint the version at which a certain argument appeared in a specific function of a package (or that its default value changed)", e.g. to figure out the minimal version required? You might like the apicheck package by David Hugh-Jones, that helps you “explore the historical API of functions in CRAN packages”; and rcheology by the same author for data on base packages. Related to this are:
Thanks for reading! If you want to get involved with rOpenSci, check out our Contributing Guide that can help direct you to the right place, whether you want to make code contributions, non-code contributions, or contribute in other ways like sharing use cases.
If you haven’t subscribed to our newsletter yet, you can do so via a form. Until it’s time for our next newsletter, you can keep in touch with us via our website and Twitter account.