Rogue Scholar

Published August 12, 2023 in Syntaxus baccata

Working on translating a key to the European shield bug nymphs (Puchkov, 1961) I thought I would look for pictures of the earlier life stages (nymphs, Fig. 1) of shield bugs (Pentatomoidea) on iNaturalist and found few observations actually had the life stage annotation.

Identification ResourcesTaxonomyComputer and Information Sciences

On the modelling of the content of identification keys

https://doi.org/10.59350/h4ak9-k6d72

Published February 1, 2023 in Syntaxus baccata

Author Lars Willighagen

Last november I wrote a blog post about how to model the taxonomic coverage of identification keys. I wanted to model this coverage to be able to determine to what extent an identification key applies to a given observation or specimen, for use in my Library of Identification Resources project. For the same project I also find it useful to be able to archive identification keys.

Identification ResourcesTaxonomyComputer and Information Sciences

On the modelling and application of the taxonomic coverage of identification keys

https://doi.org/10.59350/sv0th-dqv28

Published November 9, 2022 in Syntaxus baccata

Author Lars Willighagen

The main feature of the Library of Identification Resources is the description of the identification key (or matrix, reference, etc.). This description should on its basis specify when the key can or should be used.

BiologyBiodiversityGenomicsQ&ATaxonomyBiological Sciences

Not all species are equal: Using the h-index to quantify taxonomic bias (author Q&A)

https://doi.org/10.59350/cjmkx-fze40

Published August 15, 2022 in GigaBlog

Author Hans Zauner

The h-index is a metric that was invented to summarise the publication output and impact of researchers. In a new GigaScience article, authors from the University of New South Wales (Australia) adopt the controversial metric for a completely different purpose: to explore systematic differences in research interest ( taxonomic bias ), using mammals as an example.

Identification ResourcesTaxonomyComputer and Information Sciences

Library of Identification Resources

https://doi.org/10.59350/h8qka-z4a05

Published August 6, 2022 in Syntaxus baccata

Author Lars Willighagen

Since around this time last year, I have been working on creating a library of identification resources. Here, “identification resources” are identification keys, multi-access (matrix) keys, other works that can aid in the identification of species. The project is managed on GitHub: https://github.com/identification-resources.

Google MapsGraphMammal Species Of The WorldMammalsTaxonomyComputer and Information Sciences

Large graph viewer experiments

https://doi.org/10.59350/7esgr-61v1

Published January 2, 2022 in iPhylo

Author Roderic Page

I keep returning to the problem of viewing large graphs and trees, which means my hard drive has accumulated lots of failed prototypes. Inspired by some recent discussions on comparing taxonomic classifications I decided to package one of these (wildly incomplete) prototypes up so that I can document the idea and put the code somewhere safe.

CommunityPackagesTaxonomyTaxizeEpidemiologyComputer and Information Sciences

Using Open-Access Tools (rentrez, taxize) to Find Coronaviruses, Their Genetic Sequences, and Their Hosts

https://doi.org/10.59350/qenh9-cyj40

Published November 10, 2020 in rOpenSci - open tools for open science

Author Liam Brierley

Emerging viruses might be on everyone’s mind right now, but as an epidemiologist and disease ecologist I’ve always been interested in how and why pathogens move from animal hosts to humans.The current pandemic of the disease we call COVID-19 is caused by Severe acute respiratory syndrome (SARS) coronavirus 2 (SARS-CoV-2), a virus that has emerged from wildlife like SARS coronavirus and Middle East respiratory syndrome (MERS) coronavirus

RTaxonomyDatabaseComputer and Information Sciences

stories behind archived packages

https://doi.org/10.59350/fdsdc-5y351

Published September 10, 2020 in recology

Author Scott Chamberlain

\ Code is often arranged in packages for any given language. Packages are often cataloged in a package registry of some kind: NPM for node, crates.io for Rust, etc. For R, that registry is either CRAN or Bioconductor (for the most part). CRAN has the concept of an archived package.

TaxonomyScientificGolangNamextRgnparserComputer and Information Sciences

Scientific Name Parsing: rgnparser and namext

https://doi.org/10.59350/15sa9-b4z64

Published August 25, 2020 in rOpenSci - open tools for open science

Author Scott Chamberlain

I’m starting to tackle a few hard packages (spplit and spenv) having to do with integrating disparate data sources. I’ll talk here about spplit. I haven’t worked on spplit in a few years; I thought I’d make another attempt with “fresh” eyes. There are many use cases I can imagine for spplit; I’ll highlight a few.

RTaxonomyDatabaseComputer and Information Sciences

taxizedb: an update

https://doi.org/10.59350/xghgv-67748

Published August 17, 2020 in recology

Author Scott Chamberlain

taxizedb arose from pain in using taxize when dealing with large amounts of data in a single request or doing a lot of requests of any data size. taxize works with remote data sources on the web, so there’s a number of issues that can slow the response down: internet speed, server response speed (was a response already cached or not; or do they even use caching), etc.

Rogue Scholar Posts

Finding shield bug nymphs on iNaturalist

On the modelling of the content of identification keys

On the modelling and application of the taxonomic coverage of identification keys

Not all species are equal: Using the h-index to quantify taxonomic bias (author Q&A)

Library of Identification Resources

Large graph viewer experiments

Using Open-Access Tools (rentrez, taxize) to Find Coronaviruses, Their Genetic Sequences, and Their Hosts

stories behind archived packages

Scientific Name Parsing: rgnparser and namext

taxizedb: an update