CHAPTER 2 ASSESSMENT OF POPULATION DIFFERENTIATION AND
2.4 D ISCUSSION
2.4.3 More markers are required to cover through the genome of S
pimpinellifolium
The observed and expected heterozygosity of this population were 0.0761 and 0.2786, respectively, slightly higher than those in previous researches (Blanca et al., 2012; Blanca et al., 2015). Since S. pimpinellifolium was detected with up to a 40%
outcrossing rate (Rick et al., 1977) and demonstrated high genetic variation, it is expected to cause rapid LD decay. In this study, LD decay was within 18 Kb throughout the genome, which was much shorter than cultivated tomatoes (Bauchet et al., 2017;
Sim et al., 2012). However, to put at least one SNP marker within each of 18 Kb intervals in this genome, the 900-Mb tomato genome would require at least 50,000 markers to fulfill QTL detection in GWAS. Therefore, acquiring many SNPs using different methods is essential to conduct a GWAS in the S. pimpinellifolium population.
Here, we proposed three possible approaches to increase markers. One is to increase the sample size evenly for each subpopulation (Brachi, Morris, & Borevitz, 2011). Since approximately 64% of alleles were rare in this population, the augmentation of the subpopulation size may adjust rare alleles to common alleles, potentially increasing the SNPs without extending coverage. One is to construct DNA libraries with a frequently cutting restriction enzyme. This approach can be simulated and optimized in silico to
balance sequencing resource between sample sizes and sequencing coverage (Shirasawa et al., 2016). Another is exome sequencing, a selective genome sequencing technology that selects desired sequencing regions by the hybridization of designed probes (Kaur &
Gaikwad, 2017). Based on tomato genome sequence information, such as the gene model or EST database, one could design different sets of probes to limit sequencing regions (Ruggieri et al., 2017). Given the approximately 110 Mb total gene length in the ITAG2.4 gene model, the potential coverage could reach 12% and all target the gene region. This exome sequencing strategy may be able to increase SNPs without increasing population size.
2.5 Reference
Alexander, D. H., Novembre, J., & Lange, K. (2009). Fast model-based estimation of ancestry in unrelated individuals. Genome Research.
https://doi.org/10.1101/gr.094052.109
Bauchet, G., Grenier, S., Samson, N., Bonnet, J., Grivet, L., & Causse, M. (2017). Use of modern tomato breeding germplasm for deciphering the genetic control of agronomical traits by Genome Wide Association study. Theoretical and Applied Genetics. https://doi.org/10.1007/s00122-017-2857-9
Blanca, J., Cañizares, J., Cordero, L., Pascual, L., Diez, M. J., & Nuez, F. (2012).
Variation Revealed by SNP Genotyping and Morphology Provides Insight into the Origin of the Tomato. PLoS ONE. https://doi.org/10.1371/journal.pone.0048198 Blanca, J., Montero-Pau, J., Sauvage, C., Bauchet, G., Illa, E., Díez, M. J., … Cañizares,
J. (2015). Genomic variation in tomato, from wild ancestors to contemporary breeding accessions. BMC Genomics. https://doi.org/10.1186/s12864-015-1444-1 Brachi, B., Morris, G. P., & Borevitz, J. O. (2011). Genome-wide association studies in
plants: The missing heritability is in the field. Genome Biology.
https://doi.org/10.1186/gb-2011-12-10-232
Bradbury, P. J., Zhang, Z., Kroon, D. E., Casstevens, T. M., Ramdoss, Y., & Buckler, E.
S. (2007). TASSEL: Software for association mapping of complex traits in diverse samples. Bioinformatics. https://doi.org/10.1093/bioinformatics/btm308
Catchen, J., Hohenlohe, P. A., Bassham, S., Amores, A., & Cresko, W. A. (2013).
Stacks: An analysis tool set for population genomics. Molecular Ecology.
https://doi.org/10.1111/mec.12354
Dobritsa, A. P., & Dobritsa, S. V. (1980). DNA protection with the DNA methylase M · BbvI from Bacillus brevis var. GB against cleavage by the restriction
endonucleases PstI and PvuII. Gene.
https://doi.org/10.1016/0378-1119(80)90128-6
Etter, P. D., Bassham, S., Hohenlohe, P. A., Johnson, E. A., & Cresko, W. A. (2011).
SNP Discovery and Genotyping for Evoultionary Genetics Using RAD Sequencing. Molecular Methods for Evolutionary Genetics, 772(2), 1–19.
https://doi.org/10.1007/978-1-61779-228-1
Excoffier, L., Smouse, P. E., & Quattro, J. M. (1992). Analysis of molecular variance inferred from metric distances among DNA haplotypes: Application to human mitochondrial DNA restriction data. Genetics.
Fernandez-Pozo, N., Menda, N., Edwards, J. D., Saha, S., Tecle, I. Y., Strickler, S.
R., … Mueller, L. A. (2015). The Sol Genomics Network (SGN)-from genotype to phenotype to breeding. Nucleic Acids Research.
https://doi.org/10.1093/nar/gku1195
Fulton, T. M., Chunwongse, J., & Tanksley, S. D. (1995). Microprep protocol for extraction of DNA from tomato and other herbaceous plants. Plant Molecular
Biology Reporter, 13(3), 207–209. https://doi.org/10.1007/BF02670897 Gaunt, T. R., Rodríguez, S., & Day, I. N. M. (2007). Cubic exact solutions for the
estimation of pairwise haplotype frequencies: Implications for linkage disequilibrium analyses and a web tool “CubeX.” BMC Bioinformatics.
https://doi.org/10.1186/1471-2105-8-428
Goudet, J., & Jombart, T. (2015). hierfstat: Estimation and Tests of Hierarchical F-Statistics.
Hijmans, R. J. (2016). geosphere: Spherical Trigonometry. R package version 1.5-5.
Illumina. (2014). Infinium ® Genotyping Data Analysis. Technical Note, 10p.
https://doi.org/10.1111/j.1532-950X.2005.00092.x
Jombart, T. (2008). Adegenet: A R package for the multivariate analysis of genetic markers. Bioinformatics. https://doi.org/10.1093/bioinformatics/btn129
Kaur, P., & Gaikwad, K. (2017). From Genomes to GENE-omes: Exome Sequencing Concept and Applications in Crop Improvement. Frontiers in Plant Science.
https://doi.org/10.3389/fpls.2017.02164
Meirmans, P. G. (2012). The trouble with isolation by distance. Molecular Ecology.
https://doi.org/10.1111/j.1365-294X.2012.05578.x
Pembleton, L. W., Cogan, N. O. I., & Forster, J. W. (2013). StAMPP: An R package for calculation of genetic differentiation and structure of mixed-ploidy level
populations. Molecular Ecology Resources.
https://doi.org/10.1111/1755-0998.12129
Rao, E. S., Kadirvel, P., Symonds, R. C., Geethanjali, S., & Ebert, A. W. (2012). Using SSR markers to map genetic diversity and population structure of Solanum
pimpinellifolium for development of a core collection. Plant Genetic Resources:
Characterisation and Utilisation. https://doi.org/10.1017/S1479262111000955
Remington, D. L., Thornsberry, J. M., Matsuoka, Y., Wilson, L. M., Whitt, S. R., Doebley, J. F., … Buckler, E. S. (2001). Structure of linkage disequilibrium and phenotypic associations in the maize genome. Proceedings of the National Academy of Sciences. https://doi.org/10.1073/pnas.201394398
Rick, C. M., Fobes, J. F., & Holle, M. (1977). Genetic variation in Lycopersicon pimpinellifolium: Evidence of evolutionary change in mating systems. Plant Systematics and Evolution. https://doi.org/10.1007/BF00984147
Rogers, J. S. (1972). Measures of similarity and genetic distance. In In Studies in Genetics VII (pp. 145–153). Austin, Texas: University of Texas Publication 7213.
Ruggieri, V., Anzar, I., Paytuvi, A., Calafiore, R., Cigliano, R. A., Sanseverino, W., &
Barone, A. (2017). Exploiting the great potential of Sequence Capture data by a new tool, SUPER-CAP. DNA Research : An International Journal for Rapid Publication of Reports on Genes and Genomes.
https://doi.org/10.1093/dnares/dsw050
Shirasawa, K., Hirakawa, H., & Isobe, S. (2016). Analytical workflow of double-digest restriction site-associated DNA sequencing based on empirical and in silico optimization in tomato. DNA Research. https://doi.org/10.1093/dnares/dsw004 Sim, S. C., Durstewitz, G., Plieske, J., Wieseke, R., Ganal, M. W., van Deynze, A., …
Francis, D. M. (2012). Development of a large SNP genotyping array and generation of high-density genetic maps in tomato. PLoS ONE.
https://doi.org/10.1371/journal.pone.0040563
Sim, S. C., van Deynze, A., Stoffel, K., Douches, D. S., Zarka, D., Ganal, M. W., … Francis, D. M. (2012). High-Density SNP Genotyping of Tomato (Solanum lycopersicum L.) Reveals Patterns of Genetic Variation Due to Breeding. PLoS ONE. https://doi.org/10.1371/journal.pone.0045520
Weir, B. S., & Cockerham, C. C. (1984). Estimating F-Statistics for the Analysis of Population Structure. Evolution, 38(6), 1358–1370.
Wright, S. (1943). Isolation by distance. Genetics, 28, 114–138.
https://doi.org/10.5194/isprs-Archives-XLII-5-W1-419-2017
Zuriaga, E., Blanca, J. M., Cordero, L., Sifres, A., Blas-Cerdán, W. G., Morales, R., &
Nuez, F. (2009). Genetic and bioclimatic variation in Solanum pimpinellifolium.
Genetic Resources and Crop Evolution.
https://doi.org/10.1007/s10722-008-9340-z