rOpenSci | Blog

All posts (Page 69 of 122)

Tuesday, December 11, 2018

Generating reasonable starting trees for complex phylogenetic analyses

I never really thought I would write an R package. I use R pretty casually. Then, this year, I was invited to participate during the last week of the Analytical Paleobiology short course, an intensive month-long experience in quantitative paleontology. I was thrilled to be invited. But I got a slight sinking feeling in my stomach when I realized all the materials were in R.

And so I, a Pythonista, decided I would spend some of my maternity leave writing R packages to try to blend in with students who had spent the month living and breathing R.

...

By April Wright

Wednesday, December 5, 2018

Community Call - Governance strategies for open source research software projects

🎤 Dan Sholler, rOpenSci Postdoctoral Fellow

🕘 Tuesday, December 18, 2018, 10-11AM PST; 7-8PM CET (find your timezone)

☎️ Details for joining the Community Call. Everyone is welcome. No RSVP needed.

Researchers use open source software for the capabilities it provides, such as streamlined data access and analysis and interoperability with other pieces of the scientific computing ecosystem. For most complex software, generating these technical capabilities requires building and governing a community via sound management practices, activities that are often less visible than code contributions and other software development work. And unless the initial developers commit to doing all the needed work for a long time, a community needs to develop to sustain the software, and in many cases, to determine where the software should go. In this call, we’ll pull back the cover on some of the non-technical work that goes into building and sustaining a software project by highlighting the governance challenges projects face and the strategies they use to overcome them.

...

By Dan Sholler, Stefanie Butland

Tuesday, December 4, 2018

rnoaa: new data sources and NCDC units

We’ve just released a new version of rnoaa with A LOT of changes. Check out the release notes for a complete list of changes.

We’ll highlight a few things in this post:

New data sources in the package
NCDC units added to the output of ncdc()

Links:

rnoaa source code: https://github.com/ropensci/rnoaa
rnoaa on CRAN: https://cran.rstudio.com/web/packages/rnoaa/

🔗
Installation

Install the lastest from CRAN

install.packages("rnoaa")

Some binaries are not up yet on CRAN - you can also install from GitHub:

...

By Scott Chamberlain

Tuesday, December 4, 2018

Detecting spatiotemporal groups in relocation data with spatsoc

spatsoc is an R package written by Alec Robitaille, Quinn Webber and Eric Vander Wal of the Wildlife Evolutionary Ecology Lab (WEEL) at Memorial University of Newfoundland. It is the lab’s first R package and was recently accepted through the rOpenSci onboarding process with a big thanks to reviewers Priscilla Minotti and Filipe Teixeira, and editor Lincoln Mullen.

spatsoc started as a single function (what would eventually become group_pts) written by Alec in 2017 to help answer some of the questions that Quinn and Eric were asking about how animal social structure is related to spatial processes. These ideas were originally proposed by Quinn and Eric in their recent review paper ¹. After our ideas were published, we began using this early function to determine when GPS collared caribou (Rangifer tarandus) in Newfoundland were recorded within 50 m of one another, within 5 minutes. This spatiotemporal grouping allowed us to build and analyze social association networks with asnipe and igraph. Using animal telemetry data with social network analysis allowed us to draw new insights from a long term movement dataset.

...

By Alec Robitaille, Quinn Webber, Eric Vander Wal

Monday, December 3, 2018

restez: Query GenBank locally

🔗
What is `restez`?

R packages for interacting with the National Center for Biotechnology Information (NCBI) have, to-date, depended on API query calls via NCBI’s Entrez. For computational analyses that require the automated look-up of reams of biological sequence data, piecemeal querying via bandwith-limited requests is evidently not ideal. These queries are not only slow, but they depend on network connections and the remote server’s consistent behaviour. Additionally, users who make very large requests over extended periods of time run the risk of being blocked.

...

By Dom Bennett