- Open Access
Evaluation of genetic diversity in short duration cotton (Gossypium hirsutum L.)
© The Author(s) 2019
- Received: 3 September 2018
- Accepted: 5 December 2018
- Published: 11 January 2019
Cotton (Gossypium hirsutum L.) is an important fiber crop in Bangladesh. Genetic diversity among the genotypes of a germplasm has a great importance for cotton breeding. An experiment was carried out at the experimental field of Cotton Research, Training and Seed Multiplication Farm, Sreepur, Gazipur during the cropping season of 2015–2016 with 100 genotypes to evaluate genetic diversity of cotton genotypes for short duration using field performance.
The genotypes under study were grouped into ten clusters through multivariate analysis using GENSTAT-5. Cluster III contained maximum number of genotypes (16) while cluster X contained the least number of genotypes (7). The inter cluster distances were larger than intra cluster distances in all cases suggesting wider genetic diversity among the genotypes of different clusters. The maximum and minimum inter cluster distances were observed between clusters II and V (10.78) and clusters VIII and IX (3.30), respectively. The results indicated diverse and close relationship among the genotypes of those clusters. Earliness index, single boll weight and days to boll opening showed the higher contribution to the genetic divergence among 19 characters.
Based on the results of genetic diversity and earliness index, the genotypes from cluster II could be used as parent in hybridization program for the development of short duration cotton variety.
- Cotton (Gossypium hirsutum L.)
- Genetic diverity
- Earliness index
Cotton is the most important natural fiber in the world for textile manufacture, accounting for about 50% of all fibers used in the textile industry (Fryxell 1992). It is the member of Malvaceae family and genus Gossypium. There are four species in the genus Gossypium — G. hirsutum L., G. barbadense L., G. arboreum L. and G. herbaceum L.— that were domesticated independently as source of textile fibre (Brubaker et al. 1999). Gossypium hirsutum L. is known as New World or upland cotton having tetraploid (2n = 4x = 52) with the genome AADD (Brubaker et al. 1999). The place of origin of the genus is not known, however, the primary centres of diversity for the genus are west-central and southern Mexico (18 species), north-east Africa and Arabia (14 species) and Australia (17 species). Cotton is currently the leading fibre crop worldwide and is grown commercially in the temperate and tropical regions of more than 50 countries (Smith 1999). The major countries/regions of cotton production include USA, India, China, the Middle East and Australia.
Cotton is one of the important cash crops and the main raw materials of textile industry in Bangladesh. It is commonly known as ‘Kapas tula’ in Bangladesh. It is primarily cultivated for its lint, which is spun into yarn. Yarn is used for textile and several industrial uses. Raw cotton is also used for medical and surgical purposes. Around 4%–5% of the national requirement is fulfilled through the local production and remaining 95%–96% is fulfilled by importing raw cotton from USA (40%), Commonwealth of Independent States (CIS) (35%), Australia, Pakistan, South Africa and other cotton producing countries/regions (25%) (Hamjah and Chowdhury 2014). The demand of textile products of cotton is increasing day by day due to increasing global population. Bangladesh’s cotton consumption is expected to almost double by 2022, strongly retaining its position of world’s second largest cotton importers (Anonymous 2016).
In Bangladesh, cotton is generally grown as a rain fed crop in northern region and central region covering more than 32 districts out of 61 plain districts. Cotton is mostly cultivated as sole crop, but farmers are intended to grow three or more crops on the same land. Due to long duration (6–7 months) cotton can’t be fixed in the exiting cropping pattern. Short duration cotton varieties will enhance production of cotton by enhancing acreage of cotton. The achievement of earliness is a basic breeding objective in upland cotton (Egamberdiev 1996; Braden and Smith 2004). Earliness in cotton can avoid yield losses due to seasonal threat of biotic and abiotic stresses and increase in economic return by reducing input cost like less input of fertilizer and irrigation and also less labor cost (Ali et al. 2003). So it is necessary to develop short duration cotton varieties to increase the interest of the farmer and cotton yield. Cotton earliness is a quantitative trait (Kassianenko et al. 2003). Various plant characteristics have been used to determine earliness in cotton. One node decrease in the first sympodial branch matures the cotton crop by approximately 4 to 7 days earlier (Ahmad et al. 2008; Baloch et al. 2014). Kairon and Singh (1996) determined that short duration cottons set fruits at the 4th or the 5th node while long duration varieties set them at the 8th or the 9th node. Several other researchers (Kerby et al. 1990; Kairon and Singh 1996; Baloch and Baloch 2004) have reported strong relationship between early maturity and lower sympodial branch node number and sympodial branch length.
Genetic diversity is the foundation for the development of new varieties. Better understanding of genetic diversity will help to select diverse parents for hybridization program. So there is a need to characterize available cotton genotypes using statistical tools and to utilize them in the breeding program. Cultivated cotton genotypes have narrow genetic base (Abdukarimov et al. 2003; McCarty et al. 2005). To broaden the genetic base through breeding program, the quantification of genetic divergence among available germplasm is prerequisite and a major goal in plant breeding. Information on genetic diversity within and among closely related genotypes is essential for a cogent use of germplasm (Govindaraj et al. 2015). Successful breeding program depends on the inclusive knowledge and understanding of the genetic diversity within and among the elite genetic materials of the existing germplasm. It enables plant breeder to identify promising genotypes as parental sources that will generate diverse populations for selection and for the development of improved cotton varieties. Keeping in view the above stated informations, the current study was aimed to assess genetic diversity of short duration cotton genotypes using various plant characteristics related to earliness in cotton.
The experiment was carried out at the experimental field of Cotton Research, Training and Seed Multiplication Farm, Cotton Development Board (CDB), Sreepur, Gazipur during the cropping season 2015–2016. The healthy and disease–free seeds of 100 genotypes from the Gene Bank of CDB were used as experimental materials. Conventionally the time of cotton sowing in Bangladesh is from 15 July to 15 August, while harvest during December to January. The seeds were sown on 30th July 2015. The experiment was laid out in a Randomized Complete Block Design (RCBD) with three replications. The plot size was 12.15 m2. Row to row distance was 90 cm and plant to plant distance was 45 cm. There was 90 cm gap between two replications. Whole amount of compost, 10% urea, 50% TSP and 25% MoP were applied as basal dose during land preparation and remaining fertilizer were applied at three installments. All intercultural practices were done according to CDB standard. Data were recorded on the following characters: days to emergence, node number bearing first sympodial branch (NFB), number of monopodial branches per plant, number of sympodial branches per plant, number of secondary fruiting branches per plant, leaf shape and color, plant height, days to squaring, days to flowering, days to first boll opening, number of flowers per plant, number of bolls per plant, single boll weight, first picking percentage, earliness index [calculated based on Bartlett (1973)], Ginning Out Turn (GOT %) [(weight of lint/weight of seed)*100] and seed cotton yield per plant. The collected data were statistically analyzed. Analysis of variance was performed by using the help of general linear model procedure of computer package SAS (2000). Mean data for each character were subjected to multivariate analysis methods viz, Principal component analysis (PCA), Principal coordinate analysis (PCO), Canonical variate analysis (CVA) and Cluster analysis (CLSA) using GENSTAT-5. Principal components were computed from the correlation matrix and genotypic scores obtained for the first component and succeeding components with latent roots greater than unity (Mahalonobis 1936; Jeger et al. 1983). Inter-distances between genotypes were calculated by Principal coordinate analysis (Digby et al. 1989). The clustering was done using non-hierarchical classification. Computation of average intra-cluster distance for each cluster was calculated by taking possible D2 values within the members of a cluster obtained from the PCO after the clusters are formed. The utilized formula was ∑D2/n, where ∑D2 is the sum of distances between all possible combinations (n) of the genotypes included in a cluster. The square root of the average D2 value represents the distance (D) within cluster.
Cotton earliness is a quantitative trait which is mainly affected by environment and crop genotype (Kassianenko et al. 2003). Development of early maturing cotton varieties nowadays has become one of the important objectives of cotton breeders because of many reasons, such as short duration cotton cultivars can avoid yield losses that occur due to diseases, insect-pest (particularly bollworms) unfavorable and weather conditions (Singh 2004). The growing of early maturing cotton cultivars has an advantage of proper time for rotation of other crops allowing timely sowing of wheat in cotton-wheat-cotton cropping system in different countries (Ali et al. 2003). Late maturity of cotton also causes poor fiber quality (Salam et al. 1993). Moreover, the short duration cotton genotypes are economical regarding cost of production because early maturing cultivars evade from biotic and abiotic risks (Anderson et al. 1976; Anjum et al. 2001).
Principal component analysis (PCA)
Eigen values and percentage of variation for 19 principal component axes in 100 cotton genotypes
Principal component axes
Distribution of 100 cotton genotypes into ten clusters
Number of genotypes
Name of genotypes
BC-0073, BC-0074, BC-0075, BC-0211, Win all 5, SR/L-17, SR/L-26, SR/L-30, SR/L-47, SR/L-55
BC-0113, BC-0292, BC-0293, BC-0319, BC-0332, BC-0335, BC-0349, BC-0353
BC-0002, BC-0168, BC-0232, BC-0236, BC-0244, BC-0259, BC-0270, BC-0272, BC-0273, BC-0281, BC-0283, BC-0289, BC-0291, BC-0303, BC-0337, BC-0372
BC-0231, BC-0295, BC-0301, BC-3004, BC-0305, BC-0358, CB-9, CB-10, CB-12, CB-13
BC-0112, BC-0276, BC-0312, BC-0316, BC-0318, BC-0331, BC-0369, BC-0374, SR/L-42
BC-0333, BC-0354, BC-0355, BC-0359, BC-0362, BC-0366, BC-0378, BC-0383, CB-14
BC-0111, BC-0119, BC-0278, BC-0279, BC-0286, BC-0308, BC-0314, BC-0322, BC-0376, BC-0382
BC-0386, BC-0390, BC-0469, BC-0476, BC-0480, BC-0481, BC-0495, SR/L-51
BC-0037, BC-0306, BC-0470, BC-0475, BC-0482, BC-0483, BC-0492, BC-0496, BC-0501, BC-0505, Win all 6, SR/L-36, SR/L-56
BC-0294, BC-0375, BC-0493, BC-0497, BC-0502, SR-15, SR/L-14
Conical variate analysis (CVA)
Average intra and inter cluster distances (D2) values for 100 cotton genotypes
The maximum inter-cluster distance indicated that the genotypes in these clusters was far diverse than those of other clusters. The minimum inter-cluster distance was observed between clusters VIII and IX (3.30) indicating a close relationship among the genotypes of those clusters. The highest intra-cluster distance was found in cluster III (0.76) followed by cluster IX (0.63). The lowest intra-cluster distance was noticed for cluster X (0.16). These results revealed that the genotypes in cluster III were distantly related; on the other hand the genotypes in cluster X were closely related. The genotypes belonging to the distant clusters (II and V) could be used in hybridization program for obtaining a wide spectrum of variation among the segregates (Ali et al. 2012).
Cluster mean for the characters
Cluster mean for 19 characters in 100 cotton genotypes
Days to emergence
No. of plant at harvest
Plant height /cm
Days to squaring/d
Days to flowering/d
Days to boll opening/d
Node no. bearing first sympodial branch (NFB)
Number of monopodial branches
Number of sympodial branches
Number of secondary fruiting branches
Number of flowers per plant
Number of bolls per plant
Percent boll retention/%
Single boll weight /g
Percent first pick/%
Seed cotton yield /kg
In cluster VI, the highest mean value was found for number of plant at harvest (26.44) and the lowest mean value for the character days to boll opening (105.22 d). Cluster VII showed the highest mean for the character number of sympodial branches per plant (18.23). Cluster VIII had the maximum cluster mean value for the character germination percentage (95.83%) and minimum cluster mean value for the character number of monopodial branches per plant (0.60). Cluster IX had the lowest cluster mean values for the character days to squaring (39.54 d), days to flowering (53.15 d) and single boll weight (5.49 g). None of the genotypes included in the cluster X having high mean values for any important characters.
Contribution of characters towards divergence of the genotypes
Relative contribution of 19 characters of cotton genotypes to the total divergence
Days to emergence
No. of plant at harvest
− 0.132 8
Plant height /cm
Days to squaring
Days to flowering
Days to boll opening
Node number of bearing first sympodial branch (NFB)
Number of monopodial branch
Number of sympodial branch
Number of secondary fruiting branch
Number of flowers per plant
Number of bolls per plant
Percent boll retention
Single boll weight
Seed cotton yield
Percent first pick
In Vector I (major axis of differentiation), important characters for genetic divergence were days to emergence, germination percentage and GOT having positive vector values, while in Vector II, the second axis of differentiation, plant height, node number of bearing first sympodial branch (NFB), number of sympodial branches per plant, number of flowers per plant, number of bolls per plant and percent boll retention were important.
Genetic divergence of 100 upland cotton genotypes were investigated for short duration, yield related attributes and seed cotton yield using field performance. Principal component analysis showed that 10 components showed major role in total diversity and cluster analysis helped in the identification of superior genotypes for further utilization in breeding program. Cluster analysis showed that cluster III contained maximum number of genotypes (16) while cluster X contained the least number of genotypes(7). The maximum and minimum inter-cluster distances were observed between clusters II and V (10.78) and VIII and IX (3.30), respectively. Based on genetic diversity analysis, the genotypes from cluster II (BC 349), VI (BC 0378, CB 14) and IX (Win all 6) could be used as parents in hybridization program.
The author acknowledges the support of the Cotton Development Board (CDB), Dhaka, Bangladesh for providing all research inputs and bearing the cost of field experiment. The authors would like to thank CDB authority for their support. The authors also would like to acknowledge their gratitude towards Bangabandhu Sheikh Mujibur Rahman Agricultural University authority for their support.
Availability of data and materials
The data generated and analyzed during the current study are available from the corresponding author(s) on reasonable request.
Data sharing not applicable for this article as all datasets were presented in the manuscript.
Akter T is a MS student who generated data of the current study and wrote the manuscript. Islam AKMA, Rasul MG and Ahmed JU are the member of her advisory committee who designed the experiments, analysed and interpreted the data and critically reviewed the manuscript. Kundu S provided germplasm of cotton and Khalequzzaman M provided facilities and monitored field experiment. All authors read and approved the final manuscript and have made substantive intellectual contributions to the manuscript.
Ethics approval and consent to participate
Consent for publication
All co-authors has consent for submission of manuscript.
The authors declare that there is no competing interests.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Abdukarimov AS, Djataev S, Abdukarimov I. Cotton research in Uzbekistan: elite varieties and future cotton breeding. Proceeding World Cotton Research Conference, Cape Town, South Africa. 9–13 March 2003; p. 5–15.Google Scholar
- Ahmad SAE, Ahmad S, Ashraf MUH, et al. Assessment of yield related morhological measures for earliness in upland cotton (Gossypium hirsutum L.). Pak J Bot. 2008;40(3):1201–7.Google Scholar
- Ali CR, Arshad M, Khan MI, Fzal M. Study of earliness in commercial cotton (G. hirsutum L.) genotypes. J Res Sci. 2003;14(2):153–7.Google Scholar
- Ali M, Mian MAK, Rasul MG, et al. Genetic diversity in local aromatic rice (Oryza sativa L.) genotypes. Bangladesh J Plant Breed Genet. 2012;25(2):33–40.Google Scholar
- Anderson JM, Bridge RR, Heagler AM, Tupper GR. The economic impact of recently developed early season cotton strains on firm and regional cropping system and income. Proceed. Beltwide Cotton Prod Res Conf. Memphis: National Cotton Council of America; 1976. p. 98–100.Google Scholar
- Anjum R, Soomro AR, Chang MA. Measurement of earliness in upland cotton. Pak J Biol Sci. 2001;4(4):462–3.View ArticleGoogle Scholar
- Anonymous. Increasing cotton production for sustainable RMG. Bangladesh Knitwear Manufacturers & Exporters Association, The independent, 2016: http://m.theindependentbd.com/home/printnews/55950. Access on 23 Oct 2018.
- Baloch MJ, Baloch QB. Plant characters in relation to earliness in cotton (Gossypium hirsutum). Proc Pakistan Acad Sci. 2004;41(2):103–8.Google Scholar
- Baloch MJ, Khan NU, Rajput MA, et al. Yield related morphological measures of short duration cotton genotypes. J Anim Plant Sci. 2014;24:1198–211.Google Scholar
- Bartlett MS. Some examples of statistical methods of research in agriculture and applied botany. Int J R Stat Soc B. 1973;4:37–70.Google Scholar
- Braden CA, Smith CW. Phenotypic measurements of fiber associations of near-long staple upland cotton. Crop Sci. 2004;44:2032–7.View ArticleGoogle Scholar
- Brubaker CL, Bourland FM, Wendel JE. The origin and domestication of cotton. Chapter 1.1. In: Smith CW, Cothren JT, editors. Cotton: Origin, History, Technology, and Production. New York: Wiley; 1999. p. 3–31.Google Scholar
- Digby P, Galway N, Lane P. GENSTAT 5: a second course. Oxford: Oxford Science Publications; 1989. p. 103–8.Google Scholar
- Egamberdiev AE. Breeding for early maturing varieties of cotton. In: Proceedings of 55th plenary meeting of the ICAC, Tashkent, Uzbekistan; 1996. p. 9–12.Google Scholar
- Fryxell PA. A revised taxonomic interpretation of Gossypium L. (Malvacea). Rheedae. 1992;2:108–68.Google Scholar
- Govindaraj MM, Vetriventhan M, Srinivasan M. Importance of genetic diversity assessment in crop plants and its recent advances: an overview of its analytical perspectives. Genet Res Int. 2015:1–14. https://doi.org/10.1155/2015/431487.
- Hamjah MA, Chowdhury MAK. Measuring climatic and hydrological effects on cash crop production and production forecasting in Bangladesh using ARIMAX model. Math Theory Model. 2014;4(6):138–52.Google Scholar
- Jeger MI, Garethojies D, Griffiths E. Components of partial resistance of wheat seedling of Septoria nodorum. Euphytica. 1983;32:575–84.View ArticleGoogle Scholar
- Kairon MS, Singh VV. Genetic diversity of short duration cottons. In: Proceedings of 55th Plenary Meeting of the ICAC, Tashkent, Uzbekistan; 1996. p. 5–9.Google Scholar
- Kassianenko VA, Dragavtsev VA, Razorenov GI, et al. Variability of cotton (Gossypium hirsutum L.) with regard to earliness. Genet Resour Crop Evol. 2003;50(2):157–63.View ArticleGoogle Scholar
- Kerby TA, Cassman KG, Keeley M. Genotypes and plant densities for narrow-row cotton systems. I. Height, nodes, earliness, and location of yield. Crop Sci. 1990;30(3):644–9.View ArticleGoogle Scholar
- Mahalonobis PC. On the generalized distance in statistics, vol. 2. India: Proceeding of National Institute of Sciences; 1936. p. 49–55.Google Scholar
- McCarty JC, Jenkins JN, Wu J. Primitive accession derived germplasm by cultivar crosses as sources for cotton improvement. Crop Sci. 2005;44:1231–5.View ArticleGoogle Scholar
- Salam CA, Arshad M, Afzal M. Effect of picking dates on fiber character of different commercial cotton varieties of G. hirsutum L. Pak Cotton. 1993;37(2):67–74.Google Scholar
- SAS Institute. The SAS system for Windows version 8. Cary: SAS Institute; 2000.Google Scholar
- Siddique MA, Rashid ESMH, Khalequzzaman M, et al. Genetic diversity of local rainfed rice (Oryza sativa L.). Bangladesh J Plant Breed Genet. 2010;23(2):41–6. https://doi.org/10.3329/bjpbg.v23i2.9324.
- Singh P. Cotton breeding. 2nd ed. New Delhi: Kalyani Pub; 2004. p. 118.Google Scholar
- Smith WC. Production statistics. Chapter 3.1. In: Smith WC, Cothren JT, editors. Cotton: Origin, History, Technology and Production. New York: Wiley; 1999. p. 435–49.Google Scholar