Separating homeologs by phasing in the tetraploid wheat transcriptome

dc.citationKrasileva, K., . . . & Dubcovsky, J. (2013). Separating homeologs by phasing in the tetraploid wheat transcriptome. Genome Biology, 14(6), 1-19. https://doi.org/10.1186/gb-2013-14-6-r66
dc.citation.doi10.1186/gb-2013-14-6-r66en_US
dc.citation.issn1474-760X
dc.citation.issue6en_US
dc.citation.jtitleGenome Biologyen_US
dc.citation.spageR66en_US
dc.citation.volume14en_US
dc.contributor.authorKrasileva, Ksenia V.
dc.contributor.authorBuffalo, Vince
dc.contributor.authorBailey, Paul
dc.contributor.authorPearce, Stephen
dc.contributor.authorAyling, Sarah
dc.contributor.authorTabbita, Facundo
dc.contributor.authorSoria, Marcelo
dc.contributor.authorWang, Shichen
dc.contributor.authorAkhunov, Eduard D.
dc.contributor.authorUauy, Cristobal
dc.contributor.authorDubcovsky, Jorge
dc.contributor.authorIWGS Consortium
dc.contributor.authoreidwangscen_US
dc.contributor.authoreideakhunoven_US
dc.date.accessioned2014-03-03T21:44:07Z
dc.date.available2014-03-03T21:44:07Z
dc.date.issued2013-06-25
dc.date.published2013en_US
dc.descriptionCitation: Krasileva, K., . . . & Dubcovsky, J. (2013). Separating homeologs by phasing in the tetraploid wheat transcriptome. Genome Biology, 14(6), 1-19. https://doi.org/10.1186/gb-2013-14-6-r66
dc.description.abstractBackground: The high level of identity among duplicated homoeologous genomes in tetraploid pasta wheat presents substantial challenges for de novo transcriptome assembly. To solve this problem, we develop a specialized bioinformatics workflow that optimizes transcriptome assembly and separation of merged homoeologs. To evaluate our strategy, we sequence and assemble the transcriptome of one of the diploid ancestors of pasta wheat, and compare both assemblies with a benchmark set of 13,472 full-length, non-redundant bread wheat cDNAs. Results: A total of 489 million 100 bp paired-end reads from tetraploid wheat assemble in 140,118 contigs, including 96% of the benchmark cDNAs. We used a comparative genomics approach to annotate 66,633 open reading frames. The multiple k-mer assembly strategy increases the proportion of cDNAs assembled full-length in a single contig by 22% relative to the best single k-mer size. Homoeologs are separated using a post-assembly pipeline that includes polymorphism identification, phasing of SNPs, read sorting, and re-assembly of phased reads. Using a reference set of genes, we determine that 98.7% of SNPs analyzed are correctly separated by phasing. Conclusions: Our study shows that de novo transcriptome assembly of tetraploid wheat benefit from multiple k-mer assembly strategies more than diploid wheat. Our results also demonstrate that phasing approaches originally designed for heterozygous diploid organisms can be used to separate the close homoeologous genomes of tetraploid wheat. The predicted tetraploid wheat proteome and gene models provide a valuable tool for the wheat research community and for those interested in comparative genomic studies.en_US
dc.description.versionArticle: Version of Record
dc.identifier.urihttp://hdl.handle.net/2097/17202
dc.language.isoen_USen_US
dc.relation.urihttps://doi.org/10.1186/gb-2013-14-6-r66en_US
dc.rights© 2013 Krasileva et al.; licensee BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
dc.rights.urihttps://creativecommons.org/licenses/by/2.0/
dc.subjectTranscriptome assemblyen_US
dc.subjectMultiple k-mer assemblyen_US
dc.subjectWheaten_US
dc.subjectPolyploiden_US
dc.subjectTriticum urartuen_US
dc.subjectTriticum turgidumen_US
dc.subjectPseudogenesen_US
dc.titleSeparating homeologs by phasing in the tetraploid wheat transcriptomeen_US
dc.typeTexten_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
AkhunovGenomeBiol2013.pdf
Size:
852.43 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.62 KB
Format:
Item-specific license agreed upon to submission
Description: