phylotaR: Retrieve Orthologous Sequences from GenBank

August 8, 2018

By: Dom Bennett

In this technote I will outline what phylotaR was developed for, how to install it and how to run it with some simple examples. What is phylotaR? In any phylogenetic analysis it is important to identify sequences that share the same orthology – homologous sequences separated by speciation events. This is often performed by simply searching an online sequence repository using sequence labels. Relying solely on sequence labels, however, can miss sequences that have either not been labelled, have unanticipated names or have been mislabelled.

Extracting and Processing eBird Data

August 7, 2018

By: Matthew Strimas-Mackey

eBird is an online tool for recording bird observations. The eBird database currently contains over 500 million records of bird sightings, spanning every country and nearly every bird species, making it an extremely valuable resource for bird research and conservation. These data can be used to map the distribution and abundance of species, and assess how species’ ranges are changing over time. This dataset is available for download as a text file; however, this file is huge (over 180 GB!

A package for dimensionality reduction of large data

August 1, 2018

By: Sean Hughes  |  Angela Li  |  Ju Kim  |  Malisa Smith  |  Ted Laderas

Motivation Note: Recently, two new UMAP R packages have appeared. These new packages provide more features than umapr does and they are more actively developed. These packages are: umap, which provides the same Python wrapping function as umapr and also an R implementation, removing the need for the Python version to be installed. It is available on CRAN. uwot, which also provides an R implementation, removing the need for the Python version to be installed.

What's inside? pkginspector provides helpful tools for inspecting package contents

July 17, 2018

By: Sam Albers  |  Leonardo Collado-Torres  |  Mauro Lepore  |  Joyce Robbins  |  Noam Ross  |  Omayma Said

R packages are widely used in science, yet the code behind them often does not come under scrutiny. To address this lack, rOpenSci has been a pioneer in developing a peer review process for R packages. The goal of pkginspector is to help that process by providing a means to better understand the internal structure of R packages. It offers tools to analyze and visualize the relationship among functions within a package, and to report whether or not functions’ interfaces are consistent.

phylogram: dendrograms for evolutionary analysis

July 12, 2018

By: Shaun Wilkinson

Evolutionary biologists are increasingly using R for building, editing and visualizing phylogenetic trees. The reproducible code-based workflow and comprehensive array of tools available in packages such as ape, phangorn and phytools make R an ideal platform for phylogenetic analysis. Yet the many different tree formats are not well integrated, as pointed out in a recent post. The standard data structure for phylogenies in R is the “phylo” object, a memory efficient, matrix-based tree representation.

Page 1 of 5