Scraping the bottom of the barrel: are rare high throughput sequences artifacts?

dc.citation.doi10.1016/j.funeco.2014.08.006en_US
dc.citation.epage225en_US
dc.citation.jtitleFungal Ecologyen_US
dc.citation.spage221en_US
dc.citation.volume13en_US
dc.contributor.authorBrown, Shawn Paul
dc.contributor.authorVeach, Allison M.
dc.contributor.authorRigdon-Huss, Anne R.
dc.contributor.authorGrond, Kirsten
dc.contributor.authorLickteig, Spencer K.
dc.contributor.authorLothamer, Kale M.
dc.contributor.authorOliver, Alena K.
dc.contributor.authorJumpponen, Ari M.
dc.contributor.authoreidspbrown1en_US
dc.contributor.authoreidamveachen_US
dc.contributor.authoreidannehussen_US
dc.contributor.authoreidkgronden_US
dc.contributor.authoreidlothameren_US
dc.contributor.authoreidakoliveren_US
dc.contributor.authoreidarien_US
dc.date.accessioned2015-09-22T15:49:39Z
dc.date.available2015-09-22T15:49:39Z
dc.date.published2015en_US
dc.description.abstractMetabarcoding data generated using next-generation sequencing (NGS) technologies are overwhelmed with rare taxa and skewed in Operational Taxonomic Unit (OTU) frequencies comprised of few dominant taxa. Low frequency OTUs comprise a rare biosphere of singleton and doubleton OTUs, which may include many artifacts. We present an in-depth analysis of global singletons across sixteen NGS libraries representing different ribosomal RNA gene regions, NGS technologies and chemistries. Our data indicate that many singletons (average of 38 % across gene regions) are likely artifacts or potential artifacts, but a large fraction can be assigned to lower taxonomic levels with very high bootstrap support (~32 % of sequences to genus with ≥90 % bootstrap cutoff). Further, many singletons clustered into rare OTUs from other datasets highlighting their overlap across datasets or the poor performance of clustering algorithms. These data emphasize a need for caution when discarding rare sequence data en masse: such practices may result in throwing the baby out with the bathwater, and underestimating the biodiversity. Yet, the rare sequences are unlikely to greatly affect ecological metrics. As a result, it may be prudent to err on the side of caution and omit rare OTUs prior to downstream analyses.en_US
dc.identifier.urihttp://hdl.handle.net/2097/20444
dc.language.isoen_USen_US
dc.relation.urihttp://www.sciencedirect.com/science/article/pii/S1754504814001068en_US
dc.rightsNOTICE: this is the author's version of a work that was accepted for publication in . Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in Fungal Ecology, 13 (2015), DOI: 10.1016/j.funeco.2014.08.006en_US
dc.subjectFungien_US
dc.subjectHigh-throughput sequencingen_US
dc.subjectRare biosphereen_US
dc.subjectSingletonen_US
dc.titleScraping the bottom of the barrel: are rare high throughput sequences artifacts?en_US
dc.typeArticle (author version)en_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
JumpponenFungalEcol2015.pdf
Size:
493.5 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.62 KB
Format:
Item-specific license agreed upon to submission
Description: