Pharmacophore an International Research Journal
Pharmacophore
Submit Manuscript
Open Access | Published: 2020 - Issue 6

Application of SNP Marker For the Identification of Gene Variation Relating to Flowering Time in Soybean

 

 Truong Trong Ngon1, Tran Thi Thanh Thuy2*, Suk-Ha Lee3

 

 

  1. Biotechnology R & D Insititute, Can Tho University, Can Tho, Vietnam.
  2. College of Agriculture, Can Tho University, Can Tho, Vietnam.
  3. Department of Crop Science and Biotechnology, Seoul National University, Seoul, Republic of Korea.

ABSTRACT

Flowering at the right time requires the perception and processing of a diverse range of environmental and internal signals. Ninety-five soybean cultivars were used for this study. Eight ESTs relating flowering genes were selected from the soybean cDNA library in NCBI and TIGR. Primer 3 was used to design EST fragments of 500-800bp. Seven ESTs detected 39 different SNP positions. All genes contained exon region except gene Cry 02. Besides, Cop 01 ESTs contained both exon and intron regions. The same ESTs gave different LD among cultivar groups. The LD decay showed that it varied from one cultivar group to another, and from one EST to another. The flowering time of soybean varies a lot from one place to another. Therefore, gene variation at the nucleotide level will be useful for breeders to make a strategy for soybean improvement in the future.

Keywords: Gene diversity; Linkage group; Single Nucleotide Polymorphisms (SNPs).


Introduction

The switch development from vegetative to reproductive stages is the major transition in flowering plants [1-3]. The transition time from vegetative to reproductive phases of the plant is always an interaction between genotype and environment. Flowering depends on the season and growing place and it plays an important role in adaptation to the environment.   Thus, the flowering mechanism in higher plants is not only induced by environmental factors like light, and temperature but also controlled by polygenes [4]. Photoperiodism is a factor helping plant response to the length of day and night. Most soybean cultivars belong to the short-day plant group [5]. They flower when photoperiodism is shorter than a critical photoperiod [6]. Cober et al. (1996) [7] reported that there are seven genes affected flowering time and maturity. Under a long day, the dominant alleles suppress and delay flowering time as well as maturity; however, each gene can affect differently [7]. In their study, Tasma & Shoemaker (2003) [8] determined ten gene candidates influencing the flowering time of soybean; these genes were located on ten linkage groups.

Single nucleotide polymorphisms (SNPs) are a useful tool for discovering and exploiting genes more effectively in plant breeding. The nature and frequency of SNP in plants are beginning to receive significant attention. Moreover, SNP markers associated with traits have been discovered in onion, soybean, and rice [9]. Until now, SNP application is useful in plant breeding as it can be automated by a machine called Next Generation Sequence (NGS) [10-12]. The aim of this study was to discover gene variation at the nucleotide level among different soybean cultivar groups.

Materials and Methods

Soybean plant materials

Ninety-five soybean cultivars from Vietnam, China, Japan, and Korea were used to discover SNP in soybean. These cultivars were originated from different ecological regions. They had diverse flowering time. Besides, maturity for each cultivar group was also different. These materials were good for researching gene variation.

DNA extraction

DNA extraction can be carried out by fresh leaves from young seedlings, from 15 to 20 days old, as described by Shure et al. (1983) [13].  DNA concentration was measured by using a Hoechst dye-base protocol for fluorescence spectrophotometer (Model F-4500, Hitachi Lt., Ibaragi, Japan). The DNA solutions were diluted to working concentration with Tris-EDTA buffer (pH 8.0) and stored at –20oC until use.

Design of PCR Primers and Extension Primers

Seven sequences were selected from the soybean cDNA library derived from clones of the NCBI gene bank. One gene of Tentative Consensus sequences (TCs) was chosen from the soybean cDNA library available from The Institute for Genomic Research (TIGR) databases (http://www.tigr.org) (Table 1).

Table 1.  Soybean ESTs selected from GeneBank and primer sequence information

Accession no.

Description

Forward

Reverse

AI900864

COL02 (Constans like 2)

ATGTCGTATTCGAGTGGGATTG

GTTCGTACCGATTCGATTCTTC

AI900211

GT02

GAGACAATGGCTTTGCTCAATA

GTTGTTGTTGTTGTCGGTTGTT

AW309100

CRY02(Cryptochrome2)

TTGTTTCAACTTTCCCTTCACC

GGTAGCCATTGCCTCACATATT

AW099970

GAI

ATCCAATAGGTTCGGGCATC

AAGCATCCTTGATTCTCTTCCA

AI495592

CCA1

GATCAAATGCTCAAACGGTACT

AGATGTTCGGTAGAGGCCAAT

AW186405

COP1

TCCTTGGTGCACATTAGCTG

AATTGGCACAGGCAGTGATT

AW200732

PHYB (Phytochrome B)

CAGACAAAGCACATGTCACTCA

GCAGCAAGAAATCAGACAACAC

 

Primers were designed using Primer 3 software [14]. The PCR primers were designed to have melting temperatures from 58oC to 68oC, lengths between 20 and 25 bases, and GC content between 40% and 60%.

Initial examination of PCR Primers

All PCR primers were used to amplify the genomic DNA of 95 soybean cultivars. Amplification reaction used standard PCR reagents including 200ng of genomic DNA, 0.8mM of each forward and reverse primer, 200mM of each nucleotide, 10x reaction buffer (750mM Tris-HCl pH 8.5, 200mM (NH4)2SO4, 25mM MgCl2). The total volume of Taq polymerase (Vivagen, Korea) reaction was 50ml. PCR cycling conditions were as follows: 4-minute initial denaturation at 94oC, 30-second denaturation at 94oC, 30-second annealing at 58oC-68oC (depending upon optimal annealing indicated by Primer3), and 1-minute extension at 72oC for 30 cycles on a PTC-225 Peltier Thermal Cycler (MJ Research, Inc. Watertown, MA, USA). PCR products were resolved on 1.0% Ethidium-stained agarose gels. Only PCR products amplified as a single fragment were used for further sequencing reactions. Those produced no or mismatch products were further tested using lower or higher annealing temperature until producing a single amplicon of the predicted length.

Direct Sequencing of PCR products

PCR products producing a single discrete band were purified by AccuPrepâ PCR purification kit (Bioneer, Korea). The purified PCR product was directly sequenced using one of the PCR primers with the BigDye Terminator Cycle Sequencing Kit (Applied Biosystem, Foster City, CA, USA). The reaction reagents including 50ng of template DNA, 0.64mM of primer, 1x reaction buffer (400mM Tris-HCl pH 9.0. 10mM MgCl2), 0.3ml of BigDye Terminator in a total volume of 5ml. PCR cycling conditions were as follows: 4 min initial denaturation at 94oC, 10 seconds denaturation at 90oC, 5 seconds annealing at 50oC, and 1 min extension at 72oC for 40 cycles on an MJ Tetrad thermocycler (MA, USA). The labeling reaction mixture was ethanol-precipitated and resuspended in 10ml of distilled water. ABI 3700 sequencer (Applied Biosystems, Foster City, CA, USA) was used to analyze sequences. Those primers that produced high-quality sequences were used to amplify and sequence the genomic DNA of 95 soybean cultivars as described above.

SNPs Discovery

The sequence data from 95 soybean cultivars were analyzed with SeqScapeâ Software (Applied Biosystem, Foster City, CA, USA). SeqScape is a sequence comparison tool for variant identification, SNP discovery, and validation. It considers alignment depth, the base appears in each of the sequences with base high quality. Putative SNPs were only accepted as true sequence variants if the quality value exceeded 20. It means a 1% chance basecall is incorrect.

Nucleotide Diversity (q )

Nucleotide diversity (q ) was calculated by the method described by Halushka et al. (1999) [15].

q  = K/aL                         a = 

where K is the number of SNPs identified in an alignment length, n is alleles and L is the total length of the sequence (bp).

Results and Discussion

Characterization of SNP marker

The amplified consensus sequence length varied from 270bp to 794bp (Table 2). All SNP markers contained exon, except Col 02. Besides, Cop 01 consists of both exon and intron.

Table 2. Characterization of eight ESTs

Genes

Consensus sequence length

Sequence region

No. of SNP

 

(bp)

Exon

Intron

 

Col 02

301

1-301

 

2

GT 02

359

1-359

 

3

Cry 02

545

 

1-545

7

GAI

247

1-247

 

2

CCA 1

270

1-270

 

1

Cop 01

701

559-652

1-558; 653-701

5

Phy B

331

1-331

 

1

 

Our SNP discovery research in soybean focused on genes designed from EST using 95 cultivars. Exon had the shortest length of 63 bp for Cop 01, and the longest length of 359 bp for GT 02. The more SNP positions, the more diverse in observed genes. The diversity of gene controlling flowering time affected phenotypic variation in the soybean cultivar group because gene function codes RNA messenger and then controls the synthesis of protein.

The diversity of gene controlling flowering time affected phenotypic variation in the soybean cultivar group because gene function codes RNA messenger and then controls the synthesis of protein. Protein and function of seven ESTs related to flowering time were shown in Table 3. The function of eight ESTs responses to light, temperature, and circadian clock regulation, respectively to promote flowering. Of seven EST-encoded proteins, only the protein of two ESTs (GT 2 and Cop 1) was not yet classified (Table 3).

 

Table 3. Function of proteins encoded by genes related to the flowering time

Genes

Protein encoded

Function

References

Col 2

Constant like protein

Promote flowering in response to inductive photoperiod

Puterill et al., 1995.

GT 2

Not yet classified

Light regulation

Smalle et al., 1998.

Cry 2

Flavoprotein

Blue-light Photoreceptor

Lin et al., 1996.

GAI

Member of a novel family with putative transcription factor

Promote flowering in response to non-inductive photoperiod

Peng et al., 1997.

CCA 1

MYB-related transcriptor

Circadian clock Regulation

Wang & Tobin, 1998.

Cop 1

Not yet classified

Light regulation

Deng et al., 1992.

Phy B

Apoprotein

R-FR light photoreceptor

Sharrock & Quail, 1989.

 

Variation in seven ESTs

Gene variations contained three types: indels, transversions, and transitions, but transversion types were common (Table 4). For each position, the Chinese cultivar group gave many gene variations as compared with the other cultivar groups. Our results contrast with the ratio of transitions to transversions among 95 genotypes. This was similar to the results of Zhu et al. (2003) [16]. A summary of multiple- and single-nucleotide substations, as well as multiple- and single-base indels, is presented in Table 4.

Table 4.  Gene variation types of seven ESTs

ESTs

SNP

Mutant types

Number of cultivars

 

position

 

 

China

Japan

Korea

Vietnam

Total

Col 02

104

Transversion

T

22

15

17

24

78

 

 

 

A

2

8

7

0

17

 

267

Transition

A

22

14

17

24

77

 

 

 

G

2

9

7

0

18

GT 02

240

Transition

G

24

22

23

24

93

 

 

 

A

0

0

1

0

1

 

284

Transversion

G

0

0

1

0

1

 

 

 

T

24

22

23

24

93

 

290

Transversion

C

24

22

23

24

93

 

 

 

A

0

0

1

0

1

Cry 02

137

Insertion

A

23

23

24

24

94

 

 

Deletion

-

1

0

0

0

1

 

146;1-

Insertion

CTTCT

2

0

0

6

2

 

146;5

Deletion

-----

22

23

24

18

87

 

193

Transversion

A

24

21

23

24

92

 

 

 

C

0

2

1

0

3

 

211

Transversion

A

23

23

24

24

94

 

 

 

C

1

0

0

0

1

 

268

Transversion

A

23

23

24

24

94

 

 

 

T

1

0

0

0

1

 

298

Transversion

G

23

23

24

24

94

 

 

 

C

1

0

0

0

1

 

320

Transition

G

23

23

24

24

94

 

 

 

A

1

0

0

0

1

GAI

229

Transition

G

24

16

18

21

79

 

 

 

A

0

6

6

1

13

CCA 1

69:1

Insertion

A

1

1

1

1

4

 

 

Deletion

-

23

22

23

21

89

 

150

Transition

G

20

23

23

22

88

 

 

 

A

4

0

1

0

5

Cop 01

266

Tranversion

T

22

23

21

20

86

 

 

 

A

2

0

2

4

8

 

468

Transition

T

22

23

21

20

86

 

 

 

C

2

0

2

4

8

 

516

Transversion

T

0

1

0

5

6

 

 

 

A

24

22

23

19

88

 

530

Transversion

C

22

23

21

21

87

 

 

 

A

2

0

2

3

7

 

588

Transition

G

22

23

21

20

86

 

 

 

T

2

0

2

4

8

Phy B

254

Insertion

AG

24

23

24

22

93

 

 

Deletion

--

0

0

0

2

2

Seven observed ESTs detected about 39 SNP. Among them, Cry 02 gave high SNP positions (Table 4). Chinese, Japanese, and Korean cultivar groups had more variation than the Vietnamese cultivar group through seven ESTs observed. This demonstrated that the Vietnamese cultivar group had narrow genetic variation. This indicated that selection affected the Vietnamese cultivar group, or the mutation rate was low. It is difficult to get a genetic advance with a narrow genetic variation source in plant breeding.

DNA Polymorphism at seven ESTs

Nucleotide polymorphism can be measured by many methods, for example, haplotype (gene) diversity, nucleotide diversity, Pi (p) [17], Theta (q) (per site), etc.

In our study, nucleotide diversity was estimated as theta (q), the number of segregating sites [18], and its standard deviation (Sq). These parameters were estimated by DNA Sequence polymorphism software version 4.0 [19]. Theta value explained the nucleotide diversity of sequences for each gene. The higher the values, the more the diversity. Chinese cultivar group had the highest theta values in Col 02, GAI, and Cop 01. Other cultivar groups gave high theta values one ESTs, for example, the Japanese group for Cry 02, Korean group for GT 02, and Vietnamese group for CCA 1 (Table 5). Chinese cultivar group had the highest theta values in three of the eight markers (Table 5).

Table 5. Nucleotide diversity (q) values of eight ESTs using the program DnaSP 4.0 (Rozas et al. 2005)

ESTs

China

Japan

Korea

Vietnam

Col 02

0.677

0.659

0.651

0.556

GT 02

0.323

0.533

0.630

0.610

Cry 02

0.589

0.622

0.554

0.614

GAI

0.559

0.463

0.370

0.462

CCA 1

0.532

0.625

0.661

0.720

Cop 01

0.561

0.449

0.516

0.480

Phy B

0.309

0.511

0.428

0.436

Mean ± SD

0.518 ± 0.13

0.556 ± 0.08

0.549 ± 0.11

0.542 ± 0.10

CV%

25.46

14.12

19.27

18.48

In summary, one of the most important environmental factors affecting flowering time is the daily duration of light, the photoperiod, which was first discovered by Garner & Allard in the 1920s [18]. Recent molecular genetic studies in a facultative long-day plant, Arabidopsis, have made notable progress in identifying genetic pathways and molecular components associated with the control of flowering time and the function of the circadian clock, which have been discussed in two recent Updates [20, 21]. From Arabidopsis alone, at least 80 loci were reported to affect the timing of flowering [22]. For soybean, at least seven genes have been reported to affect flowering time and maturity [7]. In our study, eight ESTs detected 39 SNPs. Chinese cultivar gave more diverse than others. Vietnamese group had a narrow variation. SNPs are the only new generation molecular markers for individual genotyping needed for molecular marker-assisted selection (MAS). There is some evidence that the stability of SNPs and, therefore, the fidelity of their inheritance is higher than that of the other marker systems like SSRs and AFLPs [9].

Conclusions

Seven observed ESTs detected about 39 SNP. Among them, Cry 02 gave high SNP positions. Chinese, Japanese, and Korean cultivar groups had more variation compare to the Vietnamese cultivar group through seven observed ESTs. This demonstrated that the Vietnamese cultivar group had narrow genetic variation. This indicated that selection affected the Vietnamese cultivar group, or the mutation rate was low. Chinese cultivar groups were more diverse than other groups. All observed genes contained exon region; except Cry 02. Furthermore, Cop 01 genes contained both exon and intron regions. With same primer different LDs among cultivar groups were observed. The longer distance, the lower r2. In conclusions, there was wide variation in flowering time of soybean. Gene diversity at nucleotide level will be useful for breeders to make strategy for soybean improvements in the future.

References

  1. Mouradov A, Cremer F, Coupland G. Control of flowering time: interacting pathways as a basis for diversity. The Plant Cell. 2002 May 1;14(suppl 1):S111-30.
  2. Simpson GG, Dean C. Arabidopsis, the Rosetta stone of flowering time?. Science. 2002 Apr 12;296(5566):285-9.
  3. Khalaj MA, Suresh Kumar P, Roosta HR. Evaluation of Nutrient Uptake and Flowering of Gerbera in Response of Various Growing Media. World. 2019;8(4):12-8.
  4. Watterson GA. On the number of segregating sites in genetical models without recombination. Theoretical population biology. 1975 Apr 1;7(2):256-76.
  5. Nizkii S, Kodirova G, Kubankova G. Determining the Amino Acid Composition of Soybean Proteins Using IR Scanners. International Journal of Pharmaceutical Research & Allied Sciences. 2020 Apr 1;9(2).
  6. Abe J, Komatsu K, Shimamoto Y. A gene for insensitivity of flowering to incandescent long daylength (ILD) is located in the linkage group 4. Soybean genetics newsletter (USA). 1998.
  7. Cober ER, Tanner JW, Voldeng HD. Genetic control of photoperiod response in early‐maturing, near‐isogenic soybean lines. Crop science. 1996 May;36(3):601-5.
  8. Tasma IM, Shoemaker RC. Mapping flowering time gene homologs in soybean and their association with maturity (E) loci. Crop Science. 2003 Jan;43(1):319-2
  9. Gupta PK, Roy JK, Prasad M. Single nucleotide polymorphisms: a new paradigm for molecular marker technology and DNA polymorphism detection with emphasis on their use in plants. Curr Sci. 2001 Feb 25;80(4):524-35.
  10. Giordano M, Oefner PJ, Underhill PA, Sforza LL, Tosi R, Richiardi PM. Identification by denaturing high-performance liquid chromatography of numerous polymorphisms in a candidate region for multiple sclerosis susceptibility. Genomics. 1999 Mar 15;56(3):247-53.
  11. Mashizi M.K., Golestan A.T. Study of the Effect of HBX Gene of Hepatitis B Virus on Liver Cancer Progress. Entomol. Appl. Sci. Let. 2020; 7 (3): 55-65.
  12. Alshehri MA. Identification of Algae Species Using Advanced Molecular Techniques. Int. J. Pharm. Res. Allied Sci. 2020;9(1):142-59.                                 .
  13. Shure M, Wessler S, Fedoroff N. Molecular identification and isolation of the Waxy locus in maize. Cell. 1983 Nov 1;35(1):225-33.
  14. Rozen S, Skaletsky H. Primer3 on the WWW for general users and for biologist programmers. InBioinformatics methods and protocols 2000 (pp. 365-386). Humana Press, Totowa, NJ.
  15. Halushka MK, Fan JB, Bentley K, Hsie L, Shen N, Weder A, Cooper R, Lipshutz R, Chakravarti A. Patterns of single-nucleotide polymorphisms in candidate genes for blood-pressure homeostasis. Nature genetics. 1999 Jul;22(3):239-47.
  16. Zhu YL, Hyatt S, Quigley C, Song QJ, Grimm D, Young N, Cregan P. Single nucleotide polymorphisms (snps) in soybean genes, cdnas, and random genomic sequence. InPlant and Animal Genome IX Conference 2001 Jan 13.
  17. Nei M. Estimation of average heterozygosity and genetic distance from a small number of individuals. Genetics. 1978 Jul 20;89(3):583-90.
  18. Thomas B, Vince-Prue D. Photoperiodism in Plants, Academic Press. New York, NY. 1997:1-26.
  19. Rozas J., Rozas R. DnaSP version 4.1. An integrated program for molecular population genetics and molecular evolution analysis. Bioinformatics. 2005;15 (2): 174-175.
  20. Pineiro M, Coupland G. The control of flowering time and floral identity in Arabidopsis. Plant Physiology. 1998 May 1;117(1):1-8.
  21. Somers DE. The physiology and molecular bases of the plant circadian clock. Plant Physiology. 1999 Sep 1;121(1):9-20.
  22. Levy YY, Dean C. The transition to flowering. The plant cell. 1998 Dec 1;10(12):1973-89.
QR code:

Short Link:
Quick Access

Pharmacophore
ISSN: 2229-5402

Pharmacophore
© 2024 All rights reserved
Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.