O.S. Kozlova*, Z.I. Abramova**
Kazan Federal University, Kazan, 420008 Russia
E-mail: *olga-sphinx@yandex.ru, **ziabramova@mail.ru
Received March 6, 2018
Full text PDF
Abstract
A completely new version of assembly of the African anhydrobiotic midge Polypedilum vanderplanki genome derived by deep DNA sequencing of the Pv11 cell line has been discussed. The input data include paired-end and mate-paired reads with various insert sizes supplemented with ultra-long reads of Pacific Biosciences platform sequencing. We have shown that the resulting set of scaffolds has higher continuity and completeness metrics and, besides, can provide more correct predictions of coding sequences as compared to the previous assembly version, which has been demonstrated based on heat-shock proteins HSP20 and HSP70.
Keywords: Polypedilum vanderplanki, anhydrobiosis, DNA sequencing, genome assembly
Acknowledgments. The work is performed according to the Russian Government Program of Competitive Growth of Kazan Federal University.
Figure Captions
Fig. 1. Factor distribution histogram of 21-mers based on all Illumina data. Red line – approximation of the entire statistical model (erroneous k-mers and genomic k-mers), blue and green lines – approximation of the models for heterozygous and homozygous k-mers, respectively.
References
- Cornette R, Kikawada T. The induction of anhydrobiosis in the sleeping chironomid: Current status of our knowledge. IUBMB Life, 2011, vol. 63, no. 6, pp. 419–429. doi: 10.1002/iub.463.
- Nakahara Y, Watanabe M, Fujita A, Kanamori Y, Tanaka D, Iwata K, Furuki T, Sakurai M, Kikawada T, Okuda T. Effects of dehydration rate on physiological responses and survival after rehydration in larvae of the anhydrobiotic chironomid. J. Insect Physiol., 2008, vol. 54, no. 8, pp. 1220–1225. doi: 10.1016/j.jinsphys.2008.05.007.
- Watanabe M., Kikawada T, Okuda T. Increase of internal ion concentration triggers trehalose synthesis associated with cryptobiosis in larvae of Polypedilum vanderplanki. J. Exp. Biol., 2003, vol. 206, pt. 13, pp. 2281–2286. doi: 10.1242/jeb.00418.
- Ryabova A., Mukae K., Cherkasov A., Cornette R., Shagimardanova E., Sakashita T., Okuda T., Kikawada T., Gusev O. Genetic background of enhanced radioresistance in an anhydrobiotic insect: Transcriptional response to ionizing radiations and desiccation. Extremophiles, 2017, vol. 21, no. 1, pp. 109–120. doi: 10.1007/s00792-016-0888-9.
- Gusev O., Suetsugu Y., Cornette R., Kawashima T., Logacheva M.D., Kondrashov A.S., Penin A.A., Hatanaka R., Kikuta S., Shimura S., Kanamori H., Katayose Y., Matsumoto T., Shagimardanova E., Alexeev D., Govorun V., Wisecaver J., Mikheyev A., Koyanagi R., Fujie M., Nishiyama T., Shigenobu S., Shibata T.F., Golygina V., Hasebe M., Okuda T., Satoh N., Kikawada T. Comparative genome sequencing reveals genomic signature of extreme desiccation tolerance in the anhydrobiotic midge. Nat. Commun., 2014, vol. 5, art. 4784, pp. 1–9. doi: 10.1038/ncomms5784.
- Eid J., Fehr A., Gray J., Luong K., Lyle J., Otto G., Peluso P., Rank D., Baybayan P., Bettman B., Bibillo A., Bjornson K., Chaudhuri B., Christians F., Cicero R., Clark S., Dalal R., Dewinter A., Dixon J., Foquet M., Gaertner A., Hardenbol P., Heiner C., Hester K., Holden D., Kearns G., Kong X., Kuse R., Lacroix Y., Lin S., Lundquist P., Ma C., Marks P., Maxham M., Murphy D., Park I., Pham T., Phillips M., Roy J., Sebra R., Shen G., Sorenson J., Tomaney A., Travers K., Trulson M., Vieceli J., Wegener J., Wu D., Yang A., Zaccarin D., Zhao P., Zhong F., Korlach J., Turner S. Real-time DNA sequencing from single polymerase molecules. Science, 2009, vol. 323, no. 5910, pp. 133–138. doi: 10.1126/science.1162986.
- Bolger A.M., Lohse M., Usadel B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics, 2014, vol. 30, no. 15, pp. 2114–2120. doi: 10.1093/bioinformatics/btu170.
- O'Connell J., Schulz-Trieglaff O., Carlson E., Hims M.M., Gormley N.A., Cox A.J. NxTrim: Optimized trimming of Illumina mate pair reads. Bioinformatics, 2015, vol. 31, no. 12, pp. 2035–2037. doi: 10.1093/bioinformatics/btv057.
- Chikhi R., Medvedev P. Informed and automated k-mer size selection for genome assembly. Bioinformatics, 2014, vol. 30, no. 1, pp. 31–37. doi: 10.1093/bioinformatics/btt310.
- Chin C.S., Alexander D.H., Marks P., Klammer A.A., Drake J., Heiner C., Clum A., Copeland A., Huddleston J., Eichler E.E., Turner S.W., Korlach J. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat. Methods, 2013, vol. 10, no. 6, pp. 563–569. doi: 10.1038/nmeth.2474.
- Ye Ch., Hill C.M., Wu Sh., Ruan J., Ma Zh. (Sam). DBG2OLC: Efficient assembly of large genomes using long erroneous reads of the third generation sequencing technologies. Sci. Rep., 2016, vol. 6, art. 31900, pp. 1–9. doi: 10.1038/srep31900.
- Kajitani R., Toshimoto K., Noguchi H., Toyoda A., Ogura Y., Okuno M., Yabana M., Harada M., Nagayasu E., Maruyama H., Kohara Y., Fujiyama A., Itoh T. Efficient de novo assembly of highly heterozygous genomes from whole-genome shotgun short reads. Genome Res., 2014, vol. 24, no. 8, pp. 1384–1395. doi: 10.1101/gr.170720.113.
- English A.C., Richards S., Han Y., Wang M., Vee V., Qu J., Qin X., Muzny D.M., Reid J.G., Worley K.C., Gibbs R.A. Mind the gap: Upgrading genomes with Pacific Biosciences RS long-read sequencing technology. PLoS ONE, 2012, vol. 7, no. 11, art. e47768, pp. 1–12. doi: 10.1371/journal.pone.0047768.
- Wences A.H., Schatz M.C. Metassembler: Merging and optimizing de novo genome assemblies. Genome Biol., 2015, vol. 16, art. 207, pp. 1–10. doi: 10.1186/s13059-015-0764-4.
- Boetzer M., Henkel C.V., Jansen H.J., Butler D., Pirovano W. Scaffolding pre-assembled contigs using SSPACE. Bioinformatics, 2011, vol. 27, no. 4, pp. 578–579. doi: 10.1093/bioinformatics/btq683.
- Nadalin F., Vezzi F., Policriti A. GapFiller: A de novo assembly approach to fill the gap within paired reads. BMC Bioinf., 2012, vol. 13, suppl. 14, art. S8, pp. 1–16. doi: 10.1186/1471-2105-13-S14-S8.
- Grabherr M.G., Haas B.J., Yassour M., Levin J.Z., Thompson D.A., Amit I., Adiconis X., Fan L., Raychowdhury R., Zeng Q., Chen Z., Mauceli E., Hacohen N., Gnirke A., Rhind N., di Palma F., Birren B.W., Nusbaum C., Lindblad-Toh K., Friedman N., Regev A. Full-length transcriptome assembly from RNA-seq data without a reference genome. Nat. Biotechnol., 2011, vol. 29, no. 7, pp. 44–52. doi: 10.1038/nbt.1883.
- Kent W.J. BLAT – the BLAST-like alignment tool. Genome Res., 2002, vol. 12, no. 4, pp. 656–664. doi: 10.1101/gr.229202.
- Xue W., Li J.T., Zhu Y.P., Hou G.Y., Kong X.F., Kuang Y.Y., Sun X.W. L_RNA_scaffolder: Scaffolding genomes with transcripts. BMC Genomics, 2013, vol. 14, art. 604, pp. 1–14. doi: 10.1186/1471-2164-14-604.
- Gurevich A., Saveliev V., Vyahhi N., Tesler G. QUAST: Quality assessment tool for genome assemblies. Bioinformatics, 2013, vol. 29, no. 8, pp. 1072–1075. doi: 10.1093/bioinformatics/btt086.
- Simão F.A., Waterhouse R.M., Ioannidis P., Kriventseva E.V., Zdobnov E.M. BUSCO: Assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics, 2015, vol. 31, no. 19, pp. 3210–3212. doi: 10.1093/bioinformatics/btv351.
- Li X., Waterman M.S. Estimating the repeat structure and length of DNA sequences using L-tuples. Genome Res., 2003, vol. 13, no. 8, pp. 1916–1922. doi: 10.1101/gr.1251803.
- Mazin P.V., Shagimardanova E., Kozlova O., Cherkasov A., Sutormin R., Stepanova V.V., Stupnikov A., Logacheva M., Penin A., Sogame Y., Cornette R., Tokumoto S., Miyata Y., Kikawada T., Gelfand M.S., Gusev O. Cooption of heat shock regulatory system for anhydrobiosis in the sleeping chironomid Polypedilum vanderplanki. Proc Natl. Acad. Sci. U S A., 2018, vol. 115, no. 10, pp. E2477–E2486. doi: 10.1073/pnas.1719493115.
- Kozlova O., Cherkasov A., Przhiboro A., Shagimardanova E. Complexity of expression control of HSP70 genes in extremophilic midges. BioNanoScience, 2016, vol. 6, no. 4, pp. 388–391. doi: 10.1007/s12668-016-0256-3.
For citation: Kozlova O.S., Abramova Z.I. Assembly of anhydrobiotic midge Polypedilum vanderplanki genome using Illumina and PacBio data. Uchenye Zapiski Kazanskogo Universiteta. Seriya Estestvennye Nauki, 2018, vol. 160, no. 2, pp. 214–226. (In Russian)
The content is available under the license Creative Commons Attribution 4.0 License.