Skip to main content

Evaluation of genetic diversity in short duration cotton (Gossypium hirsutum L.)



Cotton (Gossypium hirsutum L.) is an important fiber crop in Bangladesh. Genetic diversity among the genotypes of a germplasm has a great importance for cotton breeding. An experiment was carried out at the experimental field of Cotton Research, Training and Seed Multiplication Farm, Sreepur, Gazipur during the cropping season of 2015–2016 with 100 genotypes to evaluate genetic diversity of cotton genotypes for short duration using field performance.


The genotypes under study were grouped into ten clusters through multivariate analysis using GENSTAT-5. Cluster III contained maximum number of genotypes (16) while cluster X contained the least number of genotypes (7). The inter cluster distances were larger than intra cluster distances in all cases suggesting wider genetic diversity among the genotypes of different clusters. The maximum and minimum inter cluster distances were observed between clusters II and V (10.78) and clusters VIII and IX (3.30), respectively. The results indicated diverse and close relationship among the genotypes of those clusters. Earliness index, single boll weight and days to boll opening showed the higher contribution to the genetic divergence among 19 characters.


Based on the results of genetic diversity and earliness index, the genotypes from cluster II could be used as parent in hybridization program for the development of short duration cotton variety.


Cotton is the most important natural fiber in the world for textile manufacture, accounting for about 50% of all fibers used in the textile industry (Fryxell 1992). It is the member of Malvaceae family and genus Gossypium. There are four species in the genus GossypiumG. hirsutum L., G. barbadense L., G. arboreum L. and G. herbaceum L.— that were domesticated independently as source of textile fibre (Brubaker et al. 1999). Gossypium hirsutum L. is known as New World or upland cotton having tetraploid (2n = 4x = 52) with the genome AADD (Brubaker et al. 1999). The place of origin of the genus is not known, however, the primary centres of diversity for the genus are west-central and southern Mexico (18 species), north-east Africa and Arabia (14 species) and Australia (17 species). Cotton is currently the leading fibre crop worldwide and is grown commercially in the temperate and tropical regions of more than 50 countries (Smith 1999). The major countries/regions of cotton production include USA, India, China, the Middle East and Australia.

Cotton is one of the important cash crops and the main raw materials of textile industry in Bangladesh. It is commonly known as ‘Kapas tula’ in Bangladesh. It is primarily cultivated for its lint, which is spun into yarn. Yarn is used for textile and several industrial uses. Raw cotton is also used for medical and surgical purposes. Around 4%–5% of the national requirement is fulfilled through the local production and remaining 95%–96% is fulfilled by importing raw cotton from USA (40%), Commonwealth of Independent States (CIS) (35%), Australia, Pakistan, South Africa and other cotton producing countries/regions (25%) (Hamjah and Chowdhury 2014). The demand of textile products of cotton is increasing day by day due to increasing global population. Bangladesh’s cotton consumption is expected to almost double by 2022, strongly retaining its position of world’s second largest cotton importers (Anonymous 2016).

In Bangladesh, cotton is generally grown as a rain fed crop in northern region and central region covering more than 32 districts out of 61 plain districts. Cotton is mostly cultivated as sole crop, but farmers are intended to grow three or more crops on the same land. Due to long duration (6–7 months) cotton can’t be fixed in the exiting cropping pattern. Short duration cotton varieties will enhance production of cotton by enhancing acreage of cotton. The achievement of earliness is a basic breeding objective in upland cotton (Egamberdiev 1996; Braden and Smith 2004). Earliness in cotton can avoid yield losses due to seasonal threat of biotic and abiotic stresses and increase in economic return by reducing input cost like less input of fertilizer and irrigation and also less labor cost (Ali et al. 2003). So it is necessary to develop short duration cotton varieties to increase the interest of the farmer and cotton yield. Cotton earliness is a quantitative trait (Kassianenko et al. 2003). Various plant characteristics have been used to determine earliness in cotton. One node decrease in the first sympodial branch matures the cotton crop by approximately 4 to 7 days earlier (Ahmad et al. 2008; Baloch et al. 2014). Kairon and Singh (1996) determined that short duration cottons set fruits at the 4th or the 5th node while long duration varieties set them at the 8th or the 9th node. Several other researchers (Kerby et al. 1990; Kairon and Singh 1996; Baloch and Baloch 2004) have reported strong relationship between early maturity and lower sympodial branch node number and sympodial branch length.

Genetic diversity is the foundation for the development of new varieties. Better understanding of genetic diversity will help to select diverse parents for hybridization program. So there is a need to characterize available cotton genotypes using statistical tools and to utilize them in the breeding program. Cultivated cotton genotypes have narrow genetic base (Abdukarimov et al. 2003; McCarty et al. 2005). To broaden the genetic base through breeding program, the quantification of genetic divergence among available germplasm is prerequisite and a major goal in plant breeding. Information on genetic diversity within and among closely related genotypes is essential for a cogent use of germplasm (Govindaraj et al. 2015). Successful breeding program depends on the inclusive knowledge and understanding of the genetic diversity within and among the elite genetic materials of the existing germplasm. It enables plant breeder to identify promising genotypes as parental sources that will generate diverse populations for selection and for the development of improved cotton varieties. Keeping in view the above stated informations, the current study was aimed to assess genetic diversity of short duration cotton genotypes using various plant characteristics related to earliness in cotton.

Materials and methods

The experiment was carried out at the experimental field of Cotton Research, Training and Seed Multiplication Farm, Cotton Development Board (CDB), Sreepur, Gazipur during the cropping season 2015–2016. The healthy and disease–free seeds of 100 genotypes from the Gene Bank of CDB were used as experimental materials. Conventionally the time of cotton sowing in Bangladesh is from 15 July to 15 August, while harvest during December to January. The seeds were sown on 30th July 2015. The experiment was laid out in a Randomized Complete Block Design (RCBD) with three replications. The plot size was 12.15 m2. Row to row distance was 90 cm and plant to plant distance was 45 cm. There was 90 cm gap between two replications. Whole amount of compost, 10% urea, 50% TSP and 25% MoP were applied as basal dose during land preparation and remaining fertilizer were applied at three installments. All intercultural practices were done according to CDB standard. Data were recorded on the following characters: days to emergence, node number bearing first sympodial branch (NFB), number of monopodial branches per plant, number of sympodial branches per plant, number of secondary fruiting branches per plant, leaf shape and color, plant height, days to squaring, days to flowering, days to first boll opening, number of flowers per plant, number of bolls per plant, single boll weight, first picking percentage, earliness index [calculated based on Bartlett (1973)], Ginning Out Turn (GOT %) [(weight of lint/weight of seed)*100] and seed cotton yield per plant. The collected data were statistically analyzed. Analysis of variance was performed by using the help of general linear model procedure of computer package SAS (2000). Mean data for each character were subjected to multivariate analysis methods viz, Principal component analysis (PCA), Principal coordinate analysis (PCO), Canonical variate analysis (CVA) and Cluster analysis (CLSA) using GENSTAT-5. Principal components were computed from the correlation matrix and genotypic scores obtained for the first component and succeeding components with latent roots greater than unity (Mahalonobis 1936; Jeger et al. 1983). Inter-distances between genotypes were calculated by Principal coordinate analysis (Digby et al. 1989). The clustering was done using non-hierarchical classification. Computation of average intra-cluster distance for each cluster was calculated by taking possible D2 values within the members of a cluster obtained from the PCO after the clusters are formed. The utilized formula was ∑D2/n, where ∑D2 is the sum of distances between all possible combinations (n) of the genotypes included in a cluster. The square root of the average D2 value represents the distance (D) within cluster.

Results and discussion

Cotton earliness is a quantitative trait which is mainly affected by environment and crop genotype (Kassianenko et al. 2003). Development of early maturing cotton varieties nowadays has become one of the important objectives of cotton breeders because of many reasons, such as short duration cotton cultivars can avoid yield losses that occur due to diseases, insect-pest (particularly bollworms) unfavorable and weather conditions (Singh 2004). The growing of early maturing cotton cultivars has an advantage of proper time for rotation of other crops allowing timely sowing of wheat in cotton-wheat-cotton cropping system in different countries (Ali et al. 2003). Late maturity of cotton also causes poor fiber quality (Salam et al. 1993). Moreover, the short duration cotton genotypes are economical regarding cost of production because early maturing cultivars evade from biotic and abiotic risks (Anderson et al. 1976; Anjum et al. 2001).

Principal component analysis (PCA)

The principal component analysis (PCA) provided eigen values and percent for 19 principal component axes in 100 cotton genotypes (Table 1). The results showed that the first four principal components accounted for 53.23% of the total variation among the 19 component axes of the total genotypes, whereas the eigen values were found as 83.16% in the first seven component axes.

Table 1 Eigen values and percentage of variation for 19 principal component axes in 100 cotton genotypes

Non-heirachical clustering

Based on non-heirachical clustering, the 100 cotton genotypes were grouped into ten different clusters using co-variance matrix. Distribution of genotypes into different clusters is presented in Table 2. Cluster III contained maximum number of genotypes (16) while cluster IX contained 13 genotypes and cluster I, IV, VII contained ten genotypes, respectively (Table 2). Cluster X contained the least number of genotypes (seven). BC-0358 was grouped in cluster IV that was resulted in maximum yield with minimum days to boll opening in this study.

Table 2 Distribution of 100 cotton genotypes into ten clusters

Conical variate analysis (CVA)

Intra and inter cluster distances (D2) values for 100 cotton genotypes grouped into 10 clusters are presented in Table 3. The more inter and intra cluster distances, the more variability among the genotypes between and within cluster, respectively, and vice versa. The results obtained from the present study indicated that the inter cluster distances were larger than intra cluster distances in all cases suggesting wider genetic diversity among the genotypes of different groups. The results are in agreement with Ali et al. (2012). The maximum inter cluster distance was observed between clusters II and V (10.78) followed by IV and V (9.37).

Table 3 Average intra and inter cluster distances (D2) values for 100 cotton genotypes

The maximum inter-cluster distance indicated that the genotypes in these clusters was far diverse than those of other clusters. The minimum inter-cluster distance was observed between clusters VIII and IX (3.30) indicating a close relationship among the genotypes of those clusters. The highest intra-cluster distance was found in cluster III (0.76) followed by cluster IX (0.63). The lowest intra-cluster distance was noticed for cluster X (0.16). These results revealed that the genotypes in cluster III were distantly related; on the other hand the genotypes in cluster X were closely related. The genotypes belonging to the distant clusters (II and V) could be used in hybridization program for obtaining a wide spectrum of variation among the segregates (Ali et al. 2012).

Cluster mean for the characters

The mean performances of 21 characters in six clusters are shown in Table 4. Most of the characters showed distinct difference among the clusters. In cluster I, it revealed the highest values for the character percent boll retention (74.25) and GOT (40.03%). Cluster II produced the highest cluster mean for the character number of secondary fruiting branches per plant (5.62) and earliness index (0.90) and the lowest cluster mean for plant height (115.88 cm) and node number of bearing first sympodial branch (NFB) (6.53). Cluster III had the minimum cluster mean values observed for the character days to emergence (3.56 days). Cluster IV comprised the highest mean for seed cotton yield (2.12 t/ha). Cluster V had the maximum mean values for the character number of flowers per plant (46.38) and number of bolls per plant (30.56).

Table 4 Cluster mean for 19 characters in 100 cotton genotypes

In cluster VI, the highest mean value was found for number of plant at harvest (26.44) and the lowest mean value for the character days to boll opening (105.22 d). Cluster VII showed the highest mean for the character number of sympodial branches per plant (18.23). Cluster VIII had the maximum cluster mean value for the character germination percentage (95.83%) and minimum cluster mean value for the character number of monopodial branches per plant (0.60). Cluster IX had the lowest cluster mean values for the character days to squaring (39.54 d), days to flowering (53.15 d) and single boll weight (5.49 g). None of the genotypes included in the cluster X having high mean values for any important characters.

Contribution of characters towards divergence of the genotypes

The characters contributing maximum to the divergence are given greater emphasis for deciding of the cluster for the purpose of further selection and the choice of parents for hybridization (Siddique et al. 2010). The relative contribution of 19 characters of the 100 genotypes towards the total divergence is presented in Table 5. Earliness index, single boll weight and days to boll opening had positive values in both vectors which indicated that they were the important component characters having higher contribution to the genetic divergence among genotypes studied. On the other hand, negative values in both vectors for number of plant at harvest, number of monopodial branch, number of secondary fruiting branch, days to flowering, days to squaring, percent first picks and seed cotton yield indicated the lowest contribution of these characters towards the divergence among 19 characters.

Table 5 Relative contribution of 19 characters of cotton genotypes to the total divergence

In Vector I (major axis of differentiation), important characters for genetic divergence were days to emergence, germination percentage and GOT having positive vector values, while in Vector II, the second axis of differentiation, plant height, node number of bearing first sympodial branch (NFB), number of sympodial branches per plant, number of flowers per plant, number of bolls per plant and percent boll retention were important.


Genetic divergence of 100 upland cotton genotypes were investigated for short duration, yield related attributes and seed cotton yield using field performance. Principal component analysis showed that 10 components showed major role in total diversity and cluster analysis helped in the identification of superior genotypes for further utilization in breeding program. Cluster analysis showed that cluster III contained maximum number of genotypes (16) while cluster X contained the least number of genotypes(7). The maximum and minimum inter-cluster distances were observed between clusters II and V (10.78) and VIII and IX (3.30), respectively. Based on genetic diversity analysis, the genotypes from cluster II (BC 349), VI (BC 0378, CB 14) and IX (Win all 6) could be used as parents in hybridization program.


  1. Abdukarimov AS, Djataev S, Abdukarimov I. Cotton research in Uzbekistan: elite varieties and future cotton breeding. Proceeding World Cotton Research Conference, Cape Town, South Africa. 9–13 March 2003; p. 5–15.

  2. Ahmad SAE, Ahmad S, Ashraf MUH, et al. Assessment of yield related morhological measures for earliness in upland cotton (Gossypium hirsutum L.). Pak J Bot. 2008;40(3):1201–7.

    Google Scholar 

  3. Ali CR, Arshad M, Khan MI, Fzal M. Study of earliness in commercial cotton (G. hirsutum L.) genotypes. J Res Sci. 2003;14(2):153–7.

    Google Scholar 

  4. Ali M, Mian MAK, Rasul MG, et al. Genetic diversity in local aromatic rice (Oryza sativa L.) genotypes. Bangladesh J Plant Breed Genet. 2012;25(2):33–40.

  5. Anderson JM, Bridge RR, Heagler AM, Tupper GR. The economic impact of recently developed early season cotton strains on firm and regional cropping system and income. Proceed. Beltwide Cotton Prod Res Conf. Memphis: National Cotton Council of America; 1976. p. 98–100.

    Google Scholar 

  6. Anjum R, Soomro AR, Chang MA. Measurement of earliness in upland cotton. Pak J Biol Sci. 2001;4(4):462–3.

    Article  Google Scholar 

  7. Anonymous. Increasing cotton production for sustainable RMG. Bangladesh Knitwear Manufacturers & Exporters Association, The independent, 2016: Access on 23 Oct 2018.

  8. Baloch MJ, Baloch QB. Plant characters in relation to earliness in cotton (Gossypium hirsutum). Proc Pakistan Acad Sci. 2004;41(2):103–8.

    Google Scholar 

  9. Baloch MJ, Khan NU, Rajput MA, et al. Yield related morphological measures of short duration cotton genotypes. J Anim Plant Sci. 2014;24:1198–211.

    Google Scholar 

  10. Bartlett MS. Some examples of statistical methods of research in agriculture and applied botany. Int J R Stat Soc B. 1973;4:37–70.

    Google Scholar 

  11. Braden CA, Smith CW. Phenotypic measurements of fiber associations of near-long staple upland cotton. Crop Sci. 2004;44:2032–7.

    Article  Google Scholar 

  12. Brubaker CL, Bourland FM, Wendel JE. The origin and domestication of cotton. Chapter 1.1. In: Smith CW, Cothren JT, editors. Cotton: Origin, History, Technology, and Production. New York: Wiley; 1999. p. 3–31.

    Google Scholar 

  13. Digby P, Galway N, Lane P. GENSTAT 5: a second course. Oxford: Oxford Science Publications; 1989. p. 103–8.

    Google Scholar 

  14. Egamberdiev AE. Breeding for early maturing varieties of cotton. In: Proceedings of 55th plenary meeting of the ICAC, Tashkent, Uzbekistan; 1996. p. 9–12.

    Google Scholar 

  15. Fryxell PA. A revised taxonomic interpretation of Gossypium L. (Malvacea). Rheedae. 1992;2:108–68.

    Google Scholar 

  16. Govindaraj MM, Vetriventhan M, Srinivasan M. Importance of genetic diversity assessment in crop plants and its recent advances: an overview of its analytical perspectives. Genet Res Int. 2015:1–14.

  17. Hamjah MA, Chowdhury MAK. Measuring climatic and hydrological effects on cash crop production and production forecasting in Bangladesh using ARIMAX model. Math Theory Model. 2014;4(6):138–52.

    Google Scholar 

  18. Jeger MI, Garethojies D, Griffiths E. Components of partial resistance of wheat seedling of Septoria nodorum. Euphytica. 1983;32:575–84.

    Article  Google Scholar 

  19. Kairon MS, Singh VV. Genetic diversity of short duration cottons. In: Proceedings of 55th Plenary Meeting of the ICAC, Tashkent, Uzbekistan; 1996. p. 5–9.

  20. Kassianenko VA, Dragavtsev VA, Razorenov GI, et al. Variability of cotton (Gossypium hirsutum L.) with regard to earliness. Genet Resour Crop Evol. 2003;50(2):157–63.

    Article  Google Scholar 

  21. Kerby TA, Cassman KG, Keeley M. Genotypes and plant densities for narrow-row cotton systems. I. Height, nodes, earliness, and location of yield. Crop Sci. 1990;30(3):644–9.

    Article  Google Scholar 

  22. Mahalonobis PC. On the generalized distance in statistics, vol. 2. India: Proceeding of National Institute of Sciences; 1936. p. 49–55.

    Google Scholar 

  23. McCarty JC, Jenkins JN, Wu J. Primitive accession derived germplasm by cultivar crosses as sources for cotton improvement. Crop Sci. 2005;44:1231–5.

    Article  Google Scholar 

  24. Salam CA, Arshad M, Afzal M. Effect of picking dates on fiber character of different commercial cotton varieties of G. hirsutum L. Pak Cotton. 1993;37(2):67–74.

    Google Scholar 

  25. SAS Institute. The SAS system for Windows version 8. Cary: SAS Institute; 2000.

  26. Siddique MA, Rashid ESMH, Khalequzzaman M, et al. Genetic diversity of local rainfed rice (Oryza sativa L.). Bangladesh J Plant Breed Genet. 2010;23(2):41–6.

  27. Singh P. Cotton breeding. 2nd ed. New Delhi: Kalyani Pub; 2004. p. 118.

    Google Scholar 

  28. Smith WC. Production statistics. Chapter 3.1. In: Smith WC, Cothren JT, editors. Cotton: Origin, History, Technology and Production. New York: Wiley; 1999. p. 435–49.

    Google Scholar 

Download references


The author acknowledges the support of the Cotton Development Board (CDB), Dhaka, Bangladesh for providing all research inputs and bearing the cost of field experiment. The authors would like to thank CDB authority for their support. The authors also would like to acknowledge their gratitude towards Bangabandhu Sheikh Mujibur Rahman Agricultural University authority for their support. 

Availability of data and materials

The data generated and analyzed during the current study are available from the corresponding author(s) on reasonable request.

Data sharing not applicable for this article as all datasets were presented in the manuscript.

Author information




Akter T is a MS student who generated data of the current study and wrote the manuscript. Islam AKMA, Rasul MG and Ahmed JU are the member of her advisory committee who designed the experiments, analysed and interpreted the data and critically reviewed the manuscript. Kundu S provided germplasm of cotton and Khalequzzaman M provided facilities and monitored field experiment. All authors read and approved the final manuscript and have made substantive intellectual contributions to the manuscript.

Corresponding author

Correspondence to A. K. M. A. ISLAM.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

All co-authors has consent for submission of manuscript.

Competing interests

The authors declare that there is no competing interests.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

AKTER, T., ISLAM, A.K.M.A., RASUL, M.G. et al. Evaluation of genetic diversity in short duration cotton (Gossypium hirsutum L.). J Cotton Res 2, 1 (2019).

Download citation


  • Cotton (Gossypium hirsutum L.)
  • Genetic diverity
  • Earliness index
  • GOT
  • Yield