Plantgenome2017 09 0085 PDF

Published November 15, 2018
The Plant Genome o r i g i n a l r es e a rc h
Association Analysis of Three Diverse Rice

(Oryza sativa L.) Germplasm Collections
for Loci Regulating Grain Quality Traits
Trevis D. Huggins, Ming-Hsuan Chen, Robert G. Fjellstrom, Aaron K. Jackson,
Anna M. McClung, and Jeremy D. Edwards*
USDA-ARS, Dale Bumpers National Rice Research Center, 2890 Hwy. 130 East, Stuttgart, AR 72160.
Abstract Rice (Oryza sativa L.) end-use cooking quality core ideas
is vital for producers and billions of consumers worldwide.
Grain quality is a complex trait with interacting genetic and • We characterized core and minicore subsets of the
environmental factors. Deciphering the complex genetic USDA National Small Grains Collection of global rice
architecture associated with grain quality provides essential accessions for grain quality.
information for improved breeding strategies to enhance • We identified loci and candidate genes for grain
desirable traits that are stable across variable climatic and quality and grain chalk traits in rice diversity panels.
environmental conditions. In this study, genome-wide association • We detected loci with pleiotropic effects across
(GWA) analysis of three rice diversity panels, the USDA rice core
multiple grain quality and agronomic traits.
subset (1364 accessions), the minicore (MC) (173 accessions
after removing non-sativa), and the high density rice array–MC • We demonstrated the utility for genome-wide
(HDMC) (383 accessions), with simple sequence repeats, single association (GWA) discovery in a minicore selected to
nucleotide polymorphic markers, or both, revealed large- and maximize diversity with a minimal panel size.
small-effect loci associated with known genes and previously
uncharacterized genomic regions. Clustering of the significant
regions in the GWA results suggests that multiple grain quality
traits are inherited together. The 11 novel candidate loci for
grain quality traits and the seven candidates for grain chalk
C rop germplasm collections preserve and provide
access to useful genetic diversity that is critical for
continued crop improvement (McCouch et al., 2013).
identified are involved in the starch biosynthesis pathway. These ex situ collections often comprise tens of thou-
This study highlights the intricate pleiotropic relationships that sands of plant accessions (Bockelman et al., 2003), and it
exist in complex genotype–phenotypic associations and gives is impractical to exhaustively explore the entire collec-
a greater insight into effective breeding strategies for grain tion for most traits. For this reason, core collections are
quality improvement. developed that represent phenotypic, genotypic, and geo-
graphical diversity with minimal redundancy. To enable
Abbreviations: AAC, apparent amylose content; ASV, alkali spreading value; more intensive phenotyping and genotyping, a subset
BrL, brown rice grain length; BrW, brown rice grain width; Chk, grain chalk;
FDR, false discovery rate; FNP, functional nucleotide polymorphisms; GBSS,
granule bound starch synthase; GWA, genome-wide association; HD, days Citation: Huggins, T.D., M.-H. Chen, R.G. Fjellstrom, A.K. Jackson,
to heading; HDRA, high-density rice array; HDMC, HDRA–minicore; indel, A.M. McClung, and J.D. Edwards. 2019. Association Analysis of Three
insertion–deletion; MAF, minor allele frequency; MC, minicore; MC-pub, MC
Diverse Rice (Oryza sativa L.) Germplasm Collections for Loci Regulating
with phenotype data published prior to 2009; MC09, MC with phenotype
Grain Quality Traits. Plant Genome 12:170085. doi: 10.3835/
data from 2009; MLM, mixed linear model; MSU7, Michigan State University
plantgenome2017.09.0085
Rice Genome Annotation Project Release version 7; NPBR, Nipponbare marker
designed at the functional nucleotide polymorphism; PC, principal component;
PHt, plant height; QTL, quantitative trait locus; RCS, USDA rice core subset Received 30 Sept. 2017. Accepted 3 Apr. 2018.
collection; RDP1, Rice Diversity Panel 1; RDP2, Rice Diversity Panel 2; RgL, seed *Corresponding author (jeremy.edwards@ars.usda.gov).
length; RgW, seed width; SNP, single nucleotide polymorphism; SS, soluble
starch synthase; SSR, simple sequence repeat; WxIn1, Waxy Intron 1. This is an open access article distributed under the CC BY-NC-ND
license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
Copyright © Crop Science Society of America
5585 Guilford Rd., Madison, WI 53711 USA
1 of 21
(or MC) of maximally diverse accessions may be selected Genome-wide association studies are an analytical
from the core collection. Phenotypic and genotypic char- tool that can decipher the relationships of a trait and its
acterization of core and MC diversity panels facilitates genomic causal region. The diverse phenotypic and genetic
the discovery of new useful alleles and the introduction variation present in large sets of unrelated accessions can
of new diversity into breeding programs. be studied to uncover the genetics underlying complex
Large international germplasm collections target traits (Zhu et al., 2008; McCouch et al., 2016). Recently,
staple crops that are globally important for food security. an increase in efficient genotyping techniques has led to
Rice is one such staple grain and is consumed by billions large high-quality SNP datasets. The manageable size and
of people worldwide (Maclean et al., 2002; Sweeney and genetic diversity in the MC make it ideal for GWA analy-
McCouch, 2007). Rice is grown in over 100 countries sis, and QTLs for pericarp color, amylose content and seed
and, through breeding, has become adapted to a wide length have been identified (Wang et al., 2016).
range of climatic zones, environments, and cultural Genome-wide association, QTL mapping, and
management practices (Muthayya et al., 2014). O. sativa marker discovery present the opportunity to increase
has been subdivided into two distinct subspecies groups, the efficiency of breeding improved varieties through
JAPONICA and INDICA on the basis of numerous stud- marker-assisted selection. Increasing the economic value
ies of phylogeny, morphology, and genetics (Sweeney of the crop requires varieties that have high yield poten-
and McCouch, 2007). The JAPONICA group is further tial and superior grain quality. Rice grain quality encom-
subdivided into aromatic, temperate japonica, and tropi- passes a broad range of traits including grain shape,
cal japonica subpopulations and the INDICA group is translucency, milling yield, cooking characteristics, sen-
divided into aus and indica subpopulations. sory traits, and nutritional aspects (Fitzgerald and Resur-
There have been significant global efforts to collect, reccion, 2009). Standard market classes of rice include
preserve, and characterize rice germplasm collections. short, medium, and long grains, which are determined
The USDA-ARS National Small Grains Collection of rice by both grain dimension and other specified physico-
consists of approximately 19,000 accessions collected chemical properties required for conventional markets.
over a century from 116 countries and serves as a diverse Translucent grains are desired for essentially all market
genetic resource. A USDA rice core subset (RCS) collection classes except for opaque waxy (sweet) rice or the chalky
representative of the genetic diversity of the entire collec- rice used for risotto or paella (Calingacion et al., 2014).
tion was selected from 114 countries and consists of 1794 Chalky grains are considered to be low quality because
accessions of the Oryza genus and includes the species O. of their poor grain appearance and the negative impact
sativa, Oryza glaberrima Steud., Oryza rufipogon Griff., they have on rice cooking (Lisle et al., 2000) and milling
and Oryza nivara S.D.Sharma & Shastry. The accessions quality (Khush et al., 1978; Kadan et al., 2008).
were chosen by random stratification to maintain genetic The market value of a rice variety is ultimately depen-
diversity (Yan et al., 2003a, 2007). The genetic diversity dent on the end user, whether that is an industrial proces-
and population structure of the RCS collection were ana- sor or a consumer. Preference studies have shown that
lyzed with 71 simple sequence repeat (SSR) markers and there is tremendous global diversity in what are consid-
one insertion–deletion (indel) marker covering the entire ered to be desirable sensory quality traits (Calingacion et
genome with genetic distances of approximately 30 cM al., 2014). Amylose content, which is predominantly con-
between each marker (Agrama et al., 2009). The USDA trolled by the Waxy gene, granule bound starch synthase 1
MC collection consists of 217 accessions that represent the (GBSS 1), is considered the most important determinant
genotypic and phenotypic diversity of the RCS (Agrama of cooking and sensory (texture) quality (Fitzgerald and
et al., 2009). The MC has been evaluated with SSR mark- Resurreccion, 2009). Single nucleotide polymorphisms
ers in numerous studies, identifying quantitative trait within the GBSS 1 gene are associated with amylose con-
loci (QTLs) associated with agronomic traits (Agrama et tent and starch paste viscosity curves, which are predic-
al., 2009; Li et al., 2010, 2011), grain quality (Agrama et tors of suitability for parboiling and canning processes
al., 2009), yield components and harvest index (Li et al., (Chen et al., 2008a, 2008b). Soluble starch synthase IIa
2012), sheath blight resistance (Jia et al., 2012), hull silica (SSIIa) controls gelatinization temperature of starch gran-
content (Bryant et al., 2011), grain protein concentration ules in the rice grain, which is important in large scale
(Bryant et al., 2013), cold tolerance (Schläppi et al., 2017), industrial processing. Single nucleotide polymorphisms
and starch biosynthesis (Li et al., 2017). More recently, a in this gene (Alk) have been shown to differentiate vari-
Rice Diversity Panel was developed that consists of differ- eties with high or intermediate gelatinization tempera-
ent collections: Rice Diversity Panel 1 (RDP1), Rice Diver- tures from those with low gelatinization temperatures
sity Panel 2 (RDP2), and a collection from the National (Umemoto and Aoki, 2005; Bao et al., 2006). The chalky
Institute of Agrobiology Sciences (McCouch et al., 2016). endosperm is a result of disordered starch granules and
The accessions originated from approximately 92 coun- small, rounded, loosely packed amyloplasts (Lisle et al.,
tries, represent the five subpopulations of rice, and were 2000; Chun et al., 2009). Grain milling yield, palatability,
genotyped with a fixed array of 700,000 single nucleotide and texture are negatively affected by chalky endosperms
polymorphisms (SNPs) (Liakat Ali et al., 2011; Zhao et al., (Lisle et al., 2000; Chun et al., 2009). Grain chalk is com-
2011; Eizenga et al., 2014; McCouch et al., 2016). plexly inherited with 10 QTL being reported thus far,
2 of 21 the pl ant genome  vol . 12, no . 1  m arch 2019

some overlapping with grain dimension QTLs (Wan et Table 1. Summary of traits and units of measure used in this study
al., 2005; Song et al., 2007; Bian et al., 2013; Gao et al., Trait Unit of measurement Panel(s) with the trait‡
2013, 2016; Edwards et al., 2017). Storage protein is the Amylose content % RCS, HDMC, MC09, MC-pub
second major constituent in rice next to starch (Cham- Alkali spreading value score† RCS, HDMC, MC09, MC-pub
pagne et al., 2004). It is divided into four main groups: Brown rice grain length mm RCS, HDMC, MC09, MC-pub
glutelin, prolamin, globulin, and albumin. Multiple genes Brown rice grain width mm RCS, HDMC, MC09, MC-pub
are responsible for the synthesis of each of the four groups Days to heading d RCS, HDMC, MC09, MC-pub
(Bryant et al., 2013). Besides providing nutritional value, Plant height cm RCS, HDMC, MC09, MC-pub
storage protein affects the rice texture and processing Rough rice grain length mm RCS, HDMC, MC-pub
quality (Martin and Fitzgerald, 2002; Champagne et Rough rice grain width mm RCS, HDMC, MC-pub
al., 2009). Having genomic markers linked with various Grain chalk % MC09
cooking, processing, sensory, and nutritional traits in rice Protein % MC09
will help breeders combine crop productivity with desir- † 2: gelatinization temperature >74°C; 7: gelatinization temperature <70°C.
able end-use quality traits targeted for specific markets. ‡ RCS, rice core subset; HDMC, high-density rice array–minicore panel; MC09, minicore (MC) with
In this study, we present a GWA analysis of eight phenotype data from 2009; MC-pub, MC with previously published data. The MC09 data were
grain quality and two agronomic traits with three rice collected in 2009 in Stuttgart, AR, and Beaumont, TX. The RCS, MC-pub, and HDMC data were
diversity panels representing the genetic variation pres- collected in 1998 and 2002 in Stuttgart, AR.
ent in the O. sativa species. These panels, though they
are all subsets of the RCS, differ in the type and number non-sativa and ungenotyped accessions (Supplemental
of markers used, as well as the number of accessions. All File S1) (Agrama et al., 2009; Yan et al., 2014b). Two sets of
agronomic and grain quality data were previously pub- phenotypic data for the MC lines were used in the analy-
lished except for grain chalk which is included in this sis. One set was obtained from experiments with the RCS,
analysis. Grain chalk has been studied in numerous bipa- of which the MC is a subset, as described and published
rental populations (Wan et al., 2005; Chun et al., 2009; by Yan et al. (2007), hereafter designated as MC-pub. The
Sun et al., 2015; Zhao et al., 2016; Edwards et al., 2017; Li second set was evaluated in two field locations during the
et al., 2017), and recently has been evaluated in a hybrid 2009 growing season at the USDA-ARS Dale Bumpers
rice collection (Gong et al., 2017) and a subset of the National Rice Research Center near Stuttgart, AR, and
3k Rice Genome Project accessions (Wang et al., 2017). the USDA-ARS Rice Research Unit near Beaumont, TX,
The publication of high-resolution genotypic datasets and is designated here as MC09. The Rice Research Unit
of 3.3 million SNPs in the MC (Wang et al., 2017) and near Beaumont, TX, is located at 59°57´N, 94°30 W and
700,000 SNPs in the High Density Rice Array (HDRA) has a League clay soil (fine, smectitic, hyperthermic Oxy-
(McCouch et al., 2016), present a unique opportunity aquic Hapluderts) and the Dale Bumpers National Rice
to further unravel the underlying genetic complexity of Research Center near Stuttgart, AR, is located at 34°30´N,
grain quality. The goals of this study were (i) to discover 91°33´W and has a Dewitt silt loam soil (fine, smectitic,
genetic loci associated with grain cooking and nutri- thermic, Typic Albaqualf). Fourteen agronomic traits
tional quality traits, (ii) identify the associated candidate were evaluated as described by Agrama et al. (2009) and
genes, and (iii) determine the effectiveness of using small Li et al. (2010, 2011, 2012), of which apparent amylose
diversity panels to detect novel genetic loci. content (AAC), alkali spreading value (ASV, an indicator
of gelatinization temperature), length and width of brown
Materials and Methods and rough rice, PHt, and days to heading (HD) were
used in this analysis (Table 1). Data on the percentage of
Plant Material and Phenotyping protein (brown rice) for the MC has previously been sum-
The RCS has previously been evaluated for days to head- marized by Bryant et al. (2013). Here, we introduce an
ing, seed yield, plant height (PHt), lodging, plant type, analysis of an additional quality trait, grain chalk (Chk),
panicle type, awn type, and cooking quality parameters in the MC09. Grain chalk was evaluated in brown rice
(Yan et al., 2005a; 2005b). It was also examined for according to the procedures described by Edwards et al.
sheath blight resistance (Jia et al., 2011), straighthead (2017). In this study, the Chk phenotype is analyzed along
resistance (Yan et al., 2003b), agronomic traits (Yan et al., with previously published grain quality and agronomic
2003a, 2007, 2009; Agrama et al., 2009), and grain physi- traits (days to heading and plant height) using a set of
cochemical traits (Yan et al., 2007). We reduced the RCS newly available high-density genotypic data (Table 1).
to 1364 accessions by removal of accessions that were A set of 189 accessions genotyped by the HDRA and
non-sativa species, were collected from an unknown ori- 194 accessions genotyped by next-generation sequencing
gin (Agrama et al., 2009), were missing phenotypic data, in the MC were chosen to form the 383-accession HDMC
or failed to produce seed (Supplemental File S1). diversity panel. The 383 accessions were previously
A description of the MC has been given by Agrama evaluated by Yan et al. (2003a, 2005a, 2005b, 2007, 2009),
et al. (2009) and Li et al. (2010). Of the 217 MC accessions, since they are a subset of the RCS. Therefore, the pheno-
203 accessions were genotyped by Wang et al. (2016) typic trait data assessed in the HDMC were taken from
but the panel was reduced to 173 after the removal of the RCS data and can be found in Table 1.
huggins et al . 3 of 21
Genetic Data as the genotyped individuals, any heterozygous sites were
converted to missing data.
USDA Rice Core Collection Markers
The RCS was previously genotyped with 71 SSR markers The HDRA Dataset
and an indel as described by (Agrama et al., 2009, 2010). A genotypic dataset of 700,000 SNPs was generated by the
One SSR marker was discarded because it had excessive HDRA technology as detailed by McCouch et al. (2016).
polymorphism, whereas nine additional markers associ- The HDRA genotypic data of 1554 accessions, referred
ated with amylose content and major blast (Magnaporthe to as the HDRA panel, is the first of its kind in rice and
oryzae) disease resistance genes were used to genotype captures much of the genetic variation in rice. This high-
1364 accessions of the RCS. Four markers are specific to density SNP set afforded much higher resolution than
the Waxy gene of rice: three are SNPs in Waxy Intron what was previously available and has the ability to reveal
1 (WxIn1), Waxy Exon 6, and Waxy Exon 10; one is an genetic regions of both minor and major effects (McCouch
indel in the WxIn1 region that is scored as a 3-bp varia- et al., 2016). Single nucleotide polymorphisms from the
tion in fragment size (Chen et al., 2010). Two functional HDRA dataset for RDP1 and RDP2 were obtained from
markers specific to the Alk gene were developed: the ricediversity.org (http://ricediversity.org/proj/germplasm/
ALK marker targets a region containing two SNPs at index.cfm, accessed 14 Sept. 2018). These SNPs were fil-
positions 6,752,887 and 6,752,888 of Nipponbare [Michi- tered for MAF, the percentage missing data, and the per-
gan State University Rice Genome Annotation Project centage of heterozygosity across accessions as was done
Release version 7 (MSU7); http://rice.plantbiology.msu. for the MC. Additionally, any SNPs in the HDRA dataset
edu/, accessed 14 Sept. 2018) that change from GC to were removed when non-Nipponbare reference alleles
TT (TGCCGCGCACCTGGAGC, forward wild-type; were detected in the Nipponbare controls.
CATGCCGCGCACCTGGAAA, forward mutant; CGC-
CGAGCCGCACAAGC, reverse) (Note: the ALK marker Overlap between the HDRA and the RCS
amplifies the functional nucleotide polymorphism in the The MC collection is a subset of the RCS, whereas the
Alk gene) and the marker NPBR (Nipponbare marker RDP1 and RDP2 collections genotyped by the HDRA par-
designed at the functional nucleotide polymorphism) tially overlap with the RCS collection (Fig. 1a). Because the
targets a single A to G SNP at position 6752,756 of Nip- HDRA and MC datasets are both based on SNPs called
ponbare (CGGGTCGAACGCCGAAAC, forward wild- against the same reference genome sequence, it was pos-
type; AACGGGTCGAACGCCGAAAT, forward mutant; sible to match SNP data from the RCS accessions in the
GGCCTCAACCAGCTCTACGC, reverse). The NPBR HDRA dataset and generate a merged dataset between the
marker contains a 1-nucleotide mismatch (A instead of resequencing-derived MC SNPs and the fixed-array-based
C) added at the third base from the 3´ end of the allele HDRA SNPs, called the HDMC. The intersection of SNPs
specific primers to aid in SNP detection. Polymerase chain shared between the MC and HDRA genotype data was
reaction conditions for the ALK and NPBR markers were found with VCFTools (Danecek et al., 2011) on the basis
the same as those described in Costanzo et al. (2011) and of the SNP pseudomolecule coordinates. Only SNPs pres-
used a 67°C annealing temperature. Individual markers ent in both datasets (HDRA and the sequenced MC) were
were combined to generate haplotypes for Waxy and Alk used for the HDRA genotyped collection. The combined
and were scored as shown in Supplemental Table S1. SNPs were filtered, as was done for the MC. The compat-
ibility of the two SNP datasets was validated by generating
Minicore Resequencing a neighbor-joining tree and verifying that the 23 lines that
The MC collection was sequenced to an average depth are shared between the MC and the HDRA genotyped col-
of 1.5× by Wang et al. (2016). Because additional SNP lections appeared as nearest neighbors on the tree. Follow-
resources are now available for O. sativa that can be used ing this quality control step, when MC resequencing and
to improve SNP calling and because nonimputed SNPs HDRA SNP data were available for an accession, the MC
were desired for later steps, the SNP genotypes called SNPs were used and the HDRA SNPs were discarded.
from Wang et al. (2016) were not used. Instead, raw
reads were downloaded from the sequence read archive Population Structure of the HDMC
(BioProject PRJNA301661). The SNPs were then called The compiled panel resulted in a total of 122,102 high-
against the Nipponbare reference genome according to quality SNPs after filtering for MAF (0.05) and removing
the Genome Analysis Toolkit best practices (McKenna et heterozygous sites. The resulting genotypic data for the 383
al., 2010). The HDRA SNP dataset (McCouch et al., 2016) diverse accessions were analyzed for population structure
was used for variant recalibration. The resulting SNP with fastSTRUCTURE (Raj et al., 2014). The ‘structure.py’
calls for the MC were then filtered to exclude SNPs with a option was used to infer the number of populations (k).
minor allele frequency (MAF) below 0.05 and those with By using the admixture model for analysis, k was assigned
>60% missing data. In addition, because these are inbred values from 4 to 10 to infer populations. The number of
lines and heterozygosity is expected to be low, SNPs with components that explain the structure was determined
heterozygosity > 5% were removed. Finally, because the with the ‘chooseK.py’ option by parsing through the out-
generation of seeds that was phenotyped is not the same put for each assigned k value. The expected admixture

Fig. 1. Population structure of the high-density rice array (HDRA)–minicore (MC) merged rice diversity panel (HDMC). (a) A Venn diagram
showing the intersections of four rice collections (the three sets, the light blue, blue, and pink circles, with unbroken lines were evaluated in this
study). The numbers in the Venn diagram in (a) represent the total number of accessions in each panel. Fewer accessions were used for analysis
because of missing genotypic or phenotypic data. (b) Population structure of the 1762 HDMC rice accessions (HDRA and MC). Each acces-
sion is represented by single vertical lines and each color represents a cluster as designated by fastSTRUCTURE. (c) The five subpopulations are
visualized with the first three PCA components. The five subgroups, indica (IND), aus (AUS), aromatic (ARO), temperate japonica (TEJ), tropical
japonica (TRJ), and admixed groups (mixed subpopulation assignment) are colored as indicated.
proportions thus inferred were visualized in a distruct options in TASSEL version 5 (Bradbury et al., 2007), which
plot generated in fastSTRUCTURE (Raj et al., 2014) (Fig. calculates an association test for each marker and trait
1b). Principal components (PCs) were calculated from the combination, was used in the MLM analysis. The p-values
122,102 SNPs by applying the principal component func- returned from the MLM analysis were subjected to false
tion in TASSEL version 5 (Bradbury et al., 2007). discovery rate (FDR) testing to reduce the likelihood of
false positives in the R package “qvalue” (Storey and Tib-
Genome-Wide Association Mapping
shirani, 2003). For the RCS panel, a significance threshold
The RCS Panel of 10-3 was calculated from the FDR correction.
A total of 83 markers (72 SSRs, 2 indels, 7 SNPs, and 2
haplotypes) were used to perform GWA analysis on the The MC Panel
1364 accessions for grain quality traits, HD, and PHt in The 3.3 million SNPs from the MC resequencing were used
the TASSEL version 5 pipeline (Bradbury et al., 2007; for GWA analysis in TASSEL version 5 (Bradbury et al.,
Zhang et al., 2010). Before GWA analysis, the markers 2007). The genotype data were filtered for SNPs with a MAF
were converted to ACGT± allele groups as specified for of >0.05 and less than 30% missing sites. After filtering. 3.2
formatting in the TASSEL version 5 manual. The con- million SNPs remained. The remaining 173 accessions were
verted genotypic data were filtered with a MAF of 0.05. used in the calculation of the kinship matrix and PCs. A
The filtered data were then used to calculate principal centered identity-by-state kinship matrix and the first three
components and a kinship matrix with the centered PCs were used as covariates to account for relatedness and
identity-by-state. A MLM analysis was performed with the subpopulation structure in a MLM model. Associations
first three PCs and the kinship matrix as covariates to cor- for each marker and trait were calculated using the ‘Each-
rect for population structure and the relatedness present marker’ and ‘no-compression’ options in TASSEL version 5
in the RCS panel. The ‘Eachmarker’ and ‘no-compression’ (Bradbury et al., 2007) in the MLM analysis.
The HDMC Panel Results
In total, 123,121 SNPs for the 383 accessions were gener- Genome-Wide Association Analysis
ated after merging and filtering the MC and the HDRA. In the RCS, GWA analysis identified three significant
The SNPs with MAF > 0.05 and 30% missing sites were markers for AAC, six markers for ASV, two for brown
removed via the filtering options available in TASSEL rice grain length (BrL), one for brown rice grain width
version 5. After filtering, 122,102 SNPs remained for (BrW), six for HD, five for PHt, three for rice grain seed
the 383 accessions. A centered identity-by-state kinship length (RgL), and one for seed width (RgW) in the com-
matrix and PCs were calculated with the kinship and PCs bined (aus + indica + aromatic + temperate japonica +
options in TASSEL version 5. The GWA analysis was per- tropical japonica + admixed) group analysis (Table 2).
formed with a MLM, incorporating the kinship matrix Three significant markers were detected for AAC, two for
and the first three PCs as covariates in the model. The ASV, one for HD, and PHt in the INDICA (aus + indica
option ‘no-compression’ and ‘EachMarker’ in TASSEL + admixed aus–indica) group analysis (Table 2). In the
version 5 (Bradbury et al., 2007)were used in the analysis. JAPONICA (aromatic + temperate japonica + tropical
japonica + admixed temperate–tropical–aromatic) group
Post-Processing of GWA Results
analysis, three significant markers were detected for AAC,
Significant Regions five for ASV, two for HD, one for PHt, and one for RgW
The raw output from TASSEL containing p-values for (Table 2). The three markers identified for AAC in the
each SNP were processed with R scripts via the qqman combined group were also identified in the INDICA and
package (Turner, 2014) to generate Manhattan plots and JAPONICA group analyses. Two markers for ASV (Alk_
quantile–quantile plots. Q-values were calculated with the hap and ALK) were detected across all three groups, two
R package qvalue (Storey and Tibshirani, 2003). Because additional markers were detected in the combined and
multiple significant SNPs can be found within close prox- JAPONICA group only, one in the combined group, and
imity of each other, a Perl script was used to process the one was detected in the JAPONICA group only (Table
TASSEL output with a threshold p-value for declaring a 2). For grain dimension traits, the markers were detected
significant region and rules for determining the borders only in the combined group (one for BrW and three for
of the region based on the pseudomolecule distance sepa- RgL). Four of the detected HD markers were identified
rating adjacent significant SNPs. The distance threshold only in the combined group, one in the combined and
used was 50,000 bp and the p-value threshold used was INDICA groups, one in the combined and JAPONICA
–log10(p > 6) except in cases of excessive significant SNPs groups, and one in JAPONICA only, but none were com-
where the p-value threshold was increased. The start, mon between INDICA and JAPONICA (Table 2). Three
end, and position of the most significant SNP (peak SNP) of the five detected PHt markers were identified in the
within each region were reported by the script. combined group, one in the combined and INDICA
groups, and one in the combined and JAPONICA groups.
Candidate Gene Identification The lone marker identified for RgW was detected in the
To facilitate analysis of candidate genes, a Perl script was combined and JAPONICA groups.
used to extract all annotated genes within each signifi- Genome-wide association analysis was conducted on
cant region from the MSU7 (Ouyang et al., 2007) and the MC-pub data with the MC high-density resequenced
Rice Annotation Project (RAP1; http://rapdb.dna.affrc. genotypic dataset to investigate the genetic basis of grain
go.jp/, accessed 14 Sept. 2018) (Sakai et al., 2013) rice gene quality. The analysis of AAC identified 51 significant
annotations. The lists of genes generated by the script were genomic segments in the combined group, 60 in the
then inspected to identify probable candidate genes on the INDICA group (–log10(p) > 8), and 13 in the JAPONICA
basis of the annotated gene functions. Candidate genes group (Note: this excludes aromatics for MC09, HDMC,
were inspected via the gene annotation tracks found in the and MC), whereas 21 segments were identified in the
Ricebase genome browser (https://ricebase.org/, accessed combined group, 35 in the INDICA group, and 12 in the
11 June 2018) (Edwards et al., 2016). Genes within 250 kb of JAPONICA group for ASV (Fig. 2a, 2b; Supplemental
the significant SNP position were reported as candidates. Table S2, Supplemental Table S3, Supplemental Table S4;
Supplemental Fig. S1, Supplemental Fig. S2). Analysis of
Overlapping Regions BrL identified 12 segments in the combined group, seven
Significant regions were identified for each trait in each in the INDICA group, and two in the JAPONICA group
panel via the output generated from the Perl script men- (Fig. 2c; Supplemental Table S4). The other grain length
tioned above. An additional Perl script was used to com- trait, RgL, had 10 segments in the combined group, three
pare significant regions for overlap across traits and to in the INDICA group, and three in the JAPONICA group
report clusters. (Supplemental Fig. S7; Supplemental Table S4). Brown
rice width segments were identified in six regions in the
combined group, five in the INDICA group, and two in
the JAPONICA group (Fig. 2d; Supplemental Table S4).
Five segments were identified in the combined group, five

Table 2. Summary of significant genome-wide association analysis in the JAPONICA group (Fig. 2b; Supplemental Fig. S18;
simple sequence repeats and single nucleotide polymorphisms Supplemental Table S3, Supplemental Table S4). Eleven seg-
for grain quality and agronomic traits using the rice core subset ments were detected for BrL in the combined group, two
(RCS) panel. The analysis consisted of 83 SSR markers and 1364
in the INDICA group, and two in the JAPONICA group;
accessions. Significant markers were determined by the qvalue of
the false discovery rate threshold at 5%.
seven segments were detected in the combined group, four
in the INDICA group, and nine in the JAPONICA group
Trait Marker Chromosome Position P-value Group† for BrW (Fig. 2cd; Supplemental Fig. S19, Supplemental
AAC‡ RM190 6 1,765,637 1.15 × 10–4 A, I, J Fig S20; Supplemental Table S4). Significant segments for
AAC WxIn1 6 1,765,740 3.31 × 10–63 A, I, J PHt were detected in six regions in the combined group, 19
AAC Waxy_hap 6 1,767,999 2.93 × 10 –98 A, I, J in the INDICA group, and four in the JAPONICA group
ASV WxIn1 6 1,765,740 4.69 × 10–5 A, J (Supplemental Table S4; Supplemental Fig. S22). Thirty-one
ASV Waxy_hap 6 1,767,999 1.69 × 10–3 A segments were detected for HD in the combined group,
ASV NPBR 6 6,752,681 1.40 × 10–18 A, J 46 in the INDICA group, and 59 in the JAPONICA group
ASV Alk_hap 6 6,752,700 7.59 × 10–97 A, I, J (Supplemental Table S4; Supplemental Fig. S21).
ASV ALK 6 6,752,809 6.30 × 10 –14 A, I, J The HDMC was analyzed for population structure
ASV RM125 7 5,480,474 1.23 × 10–3 J with the 122,102 SNPs and 383 accessions. The acces-
BrL RM489 3 4,334,678 6.06 × 10–4 A sions were classified into the five rice subpopulations
BrL RM474 10 1,819,823 3.26 × 10 –4 A or admixture groups: aus, aromatic, indica, temperate
BrW RM145 2 7,706,894 1.68 × 10 –4 A japonica,and tropical japonica with a cutoff of 60% simi-
HD RM495 1 216,973 1.33 × 10–3 A larity to determine ancestry (Fig. 1b). The subpopulations
HD RM302 1 32,988,278 3.07 × 10–4 J were further combined into the INDICA group (aus–
HD RM208 2 35,141,668 1.86 × 10 –3 A indica), the JAPONICA group (temperate japonica–tropi-
HD RM489 3 4,334,678 4.25 × 10–3 A cal japonica), and admixed (indica–japonica). Principal
HD RM510 6 2,832,512 1.91 × 10–3 A component analysis separated the HDMC into distinct
HD Rid12 7 6,068,021 3.50 × 10–3 A, I clusters that corresponded to the five rice subpopula-
HD RM44 8 11,759,316 8.84 × 10–4 A, J tions, with the first three PCs explaining about 50% of
PHt RM302 1 32,988,278 9.92 × 10–5 A, the total genetic variation (Fig. 1c).
PHt RM1339 1 38,197,901 4.89 × 10–5 A, J The GWA analysis of the HDMC detected four
PHt RM431 1 38,895,037 3.43 × 10–6 A, I genomic segments in the combined group, 21 in the
PHt RM555 2 4,305,688 3.67 × 10–3 A INDICA group, and 16 in the JAPONICA group for AAC.
PHt Rid12 7 6,068,021 1.30 × 10 –3 A (Fig. 2a; Supplemental Fig. S9; Supplemental Table S2;
RgL RM489 3 4,334,678 1.34 × 10–5 A Supplemental Table S4). For ASV, two significant seg-
RgL RM161 5 20,902,648 8.27 × 10 –4 A ments were detected for the combined group, four for the
RgL RM474 10 1,819,823 3.19 × 10 –4 A INDICA group, and one for the JAPONICA group (Fig.
RgW RM408 8 126,282 4.57 × 10 –4 A, J 2b; Supplemental Fig. S10; Supplemental Table S3; Supple-
† The letters represent the group where the markers were identified as significant: (A) entire RCS, (I) mental Table S4). Seven significant segments were detected
RCS INDICA, and (J) RCS JAPONICA. in the combined group, one in the INDICA group, and
‡ AAC, amylose content; ASV, alkali spreading value; BrL, brown rice grain length; BrW, brown rice 20 in the JAPONICA group for RgL; three segments were
grain width; HD, days to heading; PHt, plant height; RgL, rough rice grain length; RgW, rough rice detected in the combined group, one in the INDICA
grain width.
group, and two in the JAPONICA group for BrL (Fig. 2c;
Supplemental Table S4). The analysis of grain width traits
in the INDICA group, and two in the JAPONICA group detected one significant segment each in the combined
for RgW (Supplemental Fig. S8; Supplemental Table S4). group, the INDICA group, and the JAPONICA group for
Plant height analysis identified six segments in the com- both BrW and RgW (Supplemental Table S4). The analysis
bined group, 21 for the INDICA group, and zero for the of PHt detected seven significant segments in the com-
JAPONICA group (Supplemental Fig. S6; Supplemental bined group, nine in the INDICA group, and zero in the
Table S4). Forty-seven significant segments were identi- JAPONICA group. Two significant segments were detected
fied in the combined group, 40 in the INDICA group, and in the combined group, 12 in the INDICA group, and one
nine in the JAPONICA group for the HD trait (Supple- in the JAPONICA group for HD (Supplemental Table S4).
mental Fig. S5; Supplemental Table S4).
Several of the traits present in the MC-pub were also Genome-Wide Association Analysis of Grain Chalk
evaluated in the MC09 experiment and were analyzed for Genome-wide association analysis using the MC09 for
comparison. Significant segments for AAC were detected grain chalk identified significant genomic segments
in 55 regions in the combined group, 59 in the INDICA on all chromosomes except for chromosome 9 in the
group, and 20 in the JAPONICA group analyses (Fig. 2a; combined analysis. Thirty-five segments were detected
Supplemental Fig. S17; Supplemental Table S2, Supplemen- in the combined group, 50 in the INDICA group, and
tal Table S4). Forty-eight segments were detected for ASV 11 in JAPONICA (aromatics were excluded) (Fig. 3;
in the combined group, 31 in the INDICA group, and three Table 3; Supplemental Table 4). Thirteen segments in the
Fig. 2. Genome-wide association analysis for amylose content, alkali spreading value, and grain length and width for the pre-2009 minicore (MC)
(MC-pub) (top), the combined high-density rice array–MC (HDMC) (middle), and the MC with phenotype data from 2009 (MC09) (bottom) diver-
sity panels. Manhattan plots of amylose content (a), alkali spreading value (b), brown rice grain length (c), and brown rice grain width (d). Manhat-
tan plots illustrate the p-values obtained from a mixed linear model with high-quality single nucleotide polymorphisms (SNPs) for each trait evaluated.
The x-axis displays SNPs along chromosomes and the y-axis displays the –log10(p) values for each SNP. The significance threshold is represented
by the black horizontal line on each Manhattan plot. Single nucleotide polymorphisms with p-values less than 10−6 were classified as significant.

Fig. 3. Genome-wide association analysis for grain chalk in the minicore rice diversity panel with phenotype data from 2009 (MC09). Man-
hattan plots (a,b,c) and quantile–quantile plots (d,e,f) of the p-values obtained from a mixed linear model with next-generation sequencing of
single nucleotide polymorphisms (SNPs). For each Manhattan plot, the x-axis displays SNPs along chromosomes and the y-axis displays the –
log10(p) values for each SNP. The significance threshold is represented by the black horizontal line on each Manhattan plot. Single nucleotide
polymorphisms with p-values greater than 10−6 were classified as significant. For each quantile–quantile plot, the x-axis displays the expected
distribution of associations across the SNPs and the y-axis displays the observed SNP distribution in –log10(p). The plots represent the five sub-
populations of O. sativa (a,d), the INDICA varietal group (b,e) consisting of indica, aus, and combined indica–aus and the JAPONICA varietal
group (c,f) containing temperate japonica, tropical japonica, and combined temperate japonica–tropical japonica (the aromatics were not
included in the JAPONICA group).
combined group, 10 in INDICA and five in JAPONICA segment comparison identified genetic regions associ-
were located in previously reported grain chalk regions ated with multiple traits (Fig. 4). Fifty-two significant
in the biparental populations, Lemont/TeQing (Zhao et regions for grain chalk overlapped the regions of other
al., 2016) and KBNT lpa/Zhe733 (Edwards et al., 2017) traits. Forty-one of these regions overlapped with AAC,
(Table 3). Analysis of grain protein content identified sig- 13 with ASV, and 10 with both AAC and ASV (Fig. 4;
nificant segments on all chromosomes. Thirty-five seg- Supplemental Table S4). Eight of the detected grain chalk
ments were detected in the combined group, seven in the regions did not overlap with either AAC or ASV; how-
INDICA group, and 39 in the JAPONICA group (Supple- ever, six of these regions overlapped HD regions (Supple-
mental Table S4; Supplemental Table S6). mental Table S4). Grain chalk regions overlapped with a
total of four grain length regions and three grain width
Overlapping Genomic Segments regions. Additionally, three grain chalk regions over-
Significant chromosomal regions identified from the lapped with regions shared with both AAC and grain
GWA results were compared across all traits in the protein, and two regions shared with ASV and grain pro-
MC-pub, HDMC, and MC09 panels. The chromosomal tein (Supplemental Table S4).
Table 3. Summary of significant genome-wide association analysis
single nucleotide polymorphisms (SNPs) for grain chalk in the
minicore diversity panel. The analysis consisted of 3.2 million
SNPs. Markers detected at or above the threshold [-log10(p) = 6]
were considered significant.
Chromosome Start† Stop† Peak SNP‡ Peak_val‡
——————————bp ——————————
1¶ 1,654,788 1,754,788 1,704,788 6.12 × 10 –7
1 12,313,419 12,489,991 12,363,419 1.97 × 10 –8
1 25,378,734 25,478,736 25,428,734 8.45 × 10 –7
1§¶ 31,444,903 31,545,511 31,494,903 4.34 × 10–7
2§ 23,027,141 23,127,141 23,077,141 4.46 × 10–7
2 24,474,052 24,574,756 24,524,052 2.02 × 10 –8
3 3,707,484 3,822,358 3,757,484 5.94 × 10–7
3 9,447,631 9,547,631 9,497,631 4.47 × 10–7
3 22,388,199 22,502,385 22,438,199 8.00 × 10–7
4¶ 4,754,567 4,991,082 4,804,567 2.53 × 10–9
4¶ 5,218,584 5,358,637 5,291,778 8.65 × 10–10
4 11,551,491 11,879,693 11,601,491 1.13 × 10–9
4 20,550,548 20,650,548 20,600,548 3.47 × 10 –7
4§ 22,029,754 22,129,754 22,079,754 5.84 × 10–7
5¶ 3,041,626 3,141,626 3,091,626 5.07 × 10 –7
5§¶ 3,725,967 3,825,967 3,775,967 1.73 × 10 –8
5§ 4,111,225 4,211,225 4,161,225 9.65 × 10–7
5 5,335,281 5,510,712 5,396,755 2.40E × 10–7
6§ 1,729,126 1,876,635 1,779,126 1.07 × 10 –10
6 4,575,534 4,775,534 4,725,534 1.95 × 10 –7
6 8,969,806 9,071,605 9,019,806 9.67 × 10–7
6 15,657,572 15,757,572 15,707,572 8.33 × 10 –7
7 7,289,618 7,460,150 7,339,618 1.56 × 10–7
7 22,964,373 23,221,163 23,014,373 3.74 × 10 –7
8 15,757,626 16,025,232 15,961,296 1.28 × 10 –9
8§ 17,474,125 17,686,054 17,524,125 2.13 × 10 –7
8 18,447,974 19,316,518 19,010,220 4.12 × 10 –7
8 19,570,551 19,670,551 19,620,551 4.37 × 10–7
8 19,891,463 20,103,321 20,031,164 2.19 × 10 –10
8 24,524,827 24,624,827 24,574,827 5.92 × 10 –7
8 26,027,610 26,127,610 26,077,610 3.11 × 10–7
10 4,262,440 4,362,440 4,312,440 6.34 × 10 –8
11§ 8,116,105 8,333,235 8,166,105 4.75 × 10–10
11§ 24,255,726 24,355,726 24,305,726 5.42 × 10 –7
12 9,221,916 9,321,916 9,271,916 1.00 × 10–6 Fig. 4. Heat-map of the overlapping genome segments significantly
† Start and Stop indicate the regions 50,000 bp upstream and downstream of the peak SNP, respectively.
associated with multiple traits in the rice core subset (RCS), the mini-
core (MC) with data from before 2009 (MC-pub), the MC with phe-
‡ Peak SNP,; most significant SNP in the region; Peak_val, the p-value of the most significant SNP. notype data from 2009 (MC09), or the combined high-density rice
§ Regions identified in the Lemont × TeQing biparental populations for grain chalk. array–MC panels (HDMC). The segments (horizontal) are clustered
¶ Regions identified in the KBNT lpa × Zhe733 biparental populations for grain chalk. according to the patterns of shared significant traits (vertical) with red
indicating a significant association between that chromosome region
Candidate Genes Identified for Grain Quality and the trait, and blue indicating no detected association. Known
genes contained within segments associated with grain chalk are
Some of the significant segments that were identified annotated on the left.
in the GWA analysis were in proximity to or located
within known and characterized genes. A few of these Perl scripts and the MSU7 gene annotation tracks
major genes include Grain Size 3 (Os03g0407400), Grain in Ricebase (Edwards et al., 2016) were used to identify
Weight 5 (DQ991205), the dwarf genes semi-dwarf 1 candidate genes within 200 kb of significant segments. Six
(Os01g0883800) and OsGH3.1 (Os01g0785400), and the potential candidate genes were identified for AAC: a Na–
starch biosynthesis genes Waxy (Os06g0133000) and Ca exchanger gene (LOC_Os02g43110), a triose phosphate
SSIIa (Os06g0229800). translocator gene (LOC_Os05g07870), a cellulose synthase
gene (LOC_Os06g39970), a trehalose phosphatase gene

(LOC_Os07g30160), an aminocyclopropane-1-carboxylate SNPs were used to analyze the MC, the highest number
oxidase gene (LOC_Os08g30210), and an inositol triphos- of significant segments identified across all traits was 18
phate phosphatase gene (LOC_Os10g28660) (Supplemental (data not shown). The ability of the MC to reveal nearly
Table S5). An auxin-response gene (LOC_Os01g70050), all the statistically significant loci that could be found in
a vacuolar protein sorting gene (LOC_Os0431390), an the HDMC and the much larger RCS reveals the useful-
α-glucanotransferase gene (LOC_Os07g46790), and a ness of thoughtfully constructing a diversity panel for
glucose phosphate transferase gene (LOC_Os08g10600) gene discovery by maximizing genetics, age of accession
were identified as potential candidate genes for ASV (i.e. landraces vs improved varieties), and ecotype while
(Supplemental Table S5). Six potential candidate genes for minimizing the number of accessions to be genotyped
Chk were identified, which include SUCROSE TRANS- and phenotyped.
PORTER 1 (Ishimaru et al., 2001) (LOC_Os03g07480),
trehalose phosphatase genes (LOC_Os05g06160 and LOC_ Genome-Wide Association Analysis of the RCS
Os08g31630), a sugar transporter gene (LOC_Os05g07750), Diversity Panel
a stress-induced receptor kinase gene (LOC_Os04g09770), An examination of marker–trait associations in the
and a fructose-6-phosphatase kinase gene (LOC_ RCS reveals that significant regions occur in proximity
Os05g07130) (Supplemental Table S5). Characterized genes to characterized genes in most cases. The SNP markers
in identified significant genomic segments overlapping WxIn1 and Waxy-hap were highly associated with AAC
between Chk and other traits include Waxy, BADH, INO2 (Table 2). The Waxy-hap haplotype, made up of WxIn1
(Perera et al., 2018), the flowering time gene Hd1 (Yano et and Waxy Exon 6, was designed specifically to identify
al., 2000), SUCROSE TRANSPORTER 1, Shrunken2 (Lee et the Waxy gene region in rice, which affects grain amylose
al., 2007), and Grain Weight 5 (Fig. 4). content. The WxIn1 SNP is located in the intron region
of the Waxy gene (Os06g133000) and appears to serve as
one of the multiple functional nucleotide polymorphisms
Discussion (FNP) (Cai et al., 1998; Hirano et al., 1998; Isshiki et al.,
The availability of high-density genotype data provided 1998; Sato et al., 2002; Chen et al., 2010). The G to T muta-
by next-generation sequencing presents the opportunity tion of WxIn1 resulted in inefficient splicing of Intron 1,
to revisit and reanalyze previously collected phenotypes lowering the amount of fully processed GBSS 1 mRNA,
across diversity panels. High-density SNP datasets pro- and is associated with low amylose genotypes (Larkin and
vide increased genome coverage and increased resolution Park, 1999), whereas genotypes with a G SNP contained
and thus can be leveraged in GWA studies to uncover intermediate and high AAC (Chen et al., 2008a). Within
statistically significant regions associated with traits of the genotypes carrying the WxIn1 G SNP, a mutation of
interest (Huang et al., 2009; Han and Huang, 2013; Korte Exon 6 from A to C altered an amino acid from tyrosine
and Farlow, 2013). By accounting for population struc- to serine in intermediate amylose genotypes (Larkin and
ture and history, GWA studies can further decipher the Park, 2003) and resulted in lower amounts and activity of
underlying genetic architecture of complex traits in rice GBSS 1 relative to those with the A SNP, which have high
(Zhao et al., 2011; Han and Huang, 2013). amylose (Zhou et al., 2015). It was speculated that tyro-
An advantage of SNP-based genotyping is that data sine, through hydrogen bonding, provides more stability
can be combined from different platforms such as next- to GBSS 1 than serine (Dobo et al., 2010). As reported
generation sequencing and fixed arrays by using pseudo- previously, unlike the SNPs in WxIn1 and Waxy Exon
molecule coordinates. Some accessions from the RCS that 6, which have causative effects biochemically, RM190
were not included in the MC were included in the HDRA. does not and thus its association with AAC is relatively
Merging the MC resequencing data and HDRA data from lower (Chen et al., 2008a; Dobo et al., 2010). Waxy-hap,
the additional RCS accessions increased the total number ALK and Alk-hap, and WxIn1 were significantly associ-
of accessions in the RCS with high-density genotyping ated with ASV. The ALK and Alk-hap markers target the
to 383 (194 from MC and 189 from HDRA). Accessions amylopectin chain-length (Alk) gene, which is located at
that were used in the HDMC diversity panel are available 6,748,398 bp and is associated with gelatinization tem-
to the public through the Genetic Stocks Oryza database perature and gel consistency (Gao et al., 2011; Yan et al.,
(www.ars.usda.gov/GSOR, accessed 11 June 2018). It was 2014a). The Alk-hap haplotype is designed from a combi-
expected that expanding the MC with additional geno- nation of the ALK and NPBR SNPs (Supplemental Table
typed accessions from the RCS would increase the statisti- S1). The Alk gene encodes SSIIa, when it is functional,
cal power of the GWA analysis. The significance of the which elongates the exterior short chains to intermediate
loci detected in the HDMC increased; however, the total length chains and results in higher gelatinization tem-
number of loci was reduced compared with the MC. This perature. Rice with the nonfunctional Alk protein did not
occurred partly because of the decrease in the number have detectable starch-associated SSIIa enzyme activity,
of SNPs available for the merged HDMC compared with although the proteins were present in the soluble endo-
the resequenced MC. The number of significant segments sperm fraction (Umemoto and Aoki, 2005).
identified in the MC using the full set of SNPs ranged The markers RM1339 and RM431 on chromosome
from 3 to 43 across all traits. However, when the HDMC 1 produced significant signals for PHt in the RCS (Table
2). RM1339 is located ~184 kb upstream and RM431 be involved in heat stress tolerance and may affect starch
is located ~512 kb downstream of the “Green Revolu- metabolism (Li et al., 2015).
tion” semi-dwarf 1 (Os01g08803800) gene, a mutation The ASV candidate gene, LOC_Os07g46790,
that reduces plant height by affecting the final stages of encodes for Disproportionating Enzyme 1, a
gibberellin biosynthesis (Cho et al., 1994; Monna et al., 4-α-glucanotransferase. This protein is involved in the
2002; Spielmeyer et al., 2002). The semi-dwarf 1 gene is a synthesis of starch and also affects amylose content, amy-
major-effect gene and produces a ‘mountain range’ dis- lopectin structure, and the size of starch granules (Colleoni
tribution of significant SNPs in this region, as described et al., 1999; Dong et al., 2015). Suppression of Dispropor-
by Atwell et al. (2010). The marker RM489 is significantly tionating Enzyme 1 resulted in increased amylose content,
associated with HD and grain length. Closer examination reduced proportions of amylopectin chains with a degree
of the region showed that it sits between the dwarf gene of polymerization of 6 to 8 glucose units and those with
OsBP-73 (Os03g0183100) and a plant height gene TIFY11b a degree of polymerization of 16 to 36 glucose units but it
(Os03g0181100), which are located ~21 kb upstream and increased those with a degree of polymerization of 9 to 15
~84 kb downstream, respectively. The dwarf gene OsBP- glucose units, and displayed loosely packed starch granules
73 inhibits plant growth by reducing tiller number and in the rice endosperm. When overexpressed, it reduced
panicle number and shortening culms (Chen et al., 2003), amylose content, increased the proportion of amylopectin
whereas TIFY11b increases plant height and increases chains with a degree of polymerization of 6 to 10 glucose
seed size by pronounced accumulation of stem carbohy- units and those with a degree of polymerization of 23 to
drates (Nakamura et al., 2007; Hakata et al., 2012). Previ- 38 glucose units, whereas it reduced those with a degree
ous studies have reported similar pleiotropic observations of polymerization of 11 to 22 glucose units, and the starch
between HD and grain length in a chromosomal region; granules were tightly packed (Dong et al., 2015).
an example of this is the Ghd8 gene, which affects grain The candidate gene for grain chalk, LOC_Os03g07480,
yield, HD, and PHt (Yan et al., 2011). is a sucrose transporter located approximately ~1.4
Mb downstream of a low phytic acid gene (XS-lpa2,
Possible Candidate Genes Os03g0142800) and ~1.0 Mb upstream of rice myo-inositol
The high resolution afforded by GWA analysis can allow 3-phosphate synthase 1 (RINO1, Os03g0192700) (Supple-
for detection and identification of regions that are sig- mental Table S5). Sucrose transporters are proton-coupled
nificantly associated with traits of interest. Possible can- uptake transporters that transport sucrose, maltose, and
didate genes were identified on the basis of the biological α- and β-glucosides into sink tissues and the phloem
function of the surrounding characterized genes and the (Kühn and Grof, 2010; Ayre, 2011; Reinders et al., 2012).
presence of significant regions occurring within 200 kb of Edwards et al. (2017) reported that the low phytic acid gene
a known gene in rice (Supplemental Table S5). The AAC located on chromosome 2 (OsLpa1) was a likely candidate
candidate gene, LOC_Os05g07870, is located approxi- for causing grain chalkiness in the KBNT lpa × Zhe733
mately ~860 kb downstream of a major grain chalk gene, biparental mapping population. Phytic acid biosynthesis
chalk5, a vacuolar pyrophosphatase with H+ translocation genes are regulators of seed P and have been reported to
activity. The LOC_Os05g07870 gene is characterized as a be influenced by abscisic acid during seed development
triose phosphate-encoding gene. Triose phosphates play (Yoshida et al., 2002; Matsuno and Fujimura, 2014). Phytic
an important role in the source–sink relationship in plants acid, abscisic acid, and sucrose accumulate in rice seeds
and reside within the cell wall of chloroplasts, regulating during the same developmental period, and it has been
the transport of sucrose in and out of the cytosol, con- reported that abscisic acid regulates grain filling along
necting photosynthesis, starch synthesis, and glycolysis with sucrose (Akihiro et al., 2005; Tang et al., 2009).
(Jin-Yue et al., 2004; Toyota et al., 2006). LOC_Os07g30160 Another grain chalk candidate gene, LOC_
is a trehalose-6-phosphatase gene that has been shown to Os05g06160, is a trehalose phosphatase that is located
regulate starch use in plants and is an indicator of plant ~244 kb upstream of the chalk5 gene identified by Li
sucrose status (Wingler et al., 2000; Schluepmann et al., et al. (2014). This chromosomal region has previously
2004; Lunn et al., 2006; Ponnu et al., 2011). More impor- been reported to contain a grain chalk QTL (qBCHK5)
tantly, trehalose-6-phosphatase is a known regulator (Edwards et al., 2017). Two other possible candidate genes
of starch metabolism in plants and specifically induces for grain chalk were identified on chromosome 5, LOC_
starch accumulation and synthesis (Wingler et al., 2000; Os05g07130 (a fructose-6-phosphate-2-kinase gene) and
Lunn et al., 2006). The LOC_Os08g30210 gene produces LOC_Os05g07750 (a sugar transporter gene) are located at
1-aminocyclopropane-1-carboxylate oxidase, which is ~436 kb and ~800 kb downstream of chalk5, respectively
involved in ethylene biosynthesis, thus regulating plant (Supplemental Table S5). Fructose-6-phosphate-2-kinase
developmental stages and stress tolerance (Ruduś et al., is a bifunctional enzyme that modulates fructose-
2013). Previous studies have reported that high nighttime 2,6-bisphosphate in plants. It is primarily expressed in
temperatures can affect endosperm development, result- leaves and regulates leaf sucrose levels (Park et al., 2007;
ing in poor packing of starch granules (Ambardekar et Udomchalothorn et al., 2009). Sugar transporters medi-
al., 2011; Lanning et al., 2011). The presence of 1-amino- ate the movement of starch from source to sink tissues,
cyclopropane-1-carboxylate oxidase has been reported to especially during grain filling (Kühn and Grof, 2010; Ayre,

2011; Reinders et al., 2012). These two candidate genes al., 2008). Although the predicted gene effects of signifi-
were located in the region where two grain chalk QTLs, cantly associated SNPs may be suggestive of them being
DEC (qDEC5a) and PGWC (qPGWC5a), were detected in FNPs, conclusive determination will require further
the Lemont × TeQing recombinant inbred line population studies involving fine mapping, gene editing, or both.
(Zhao et al., 2016). The functions of the proposed candi-
date genes suggest that this region may affect some por- Segment Overlap
tion of the starch metabolism and thus grain quality. Knowledge of the genetic loci associated with a trait and
LOC_Os08g31630, a trehalose-6-phosphate gene, the number of loci controlling that trait are important for
was located ~760 kb upstream of betaine aldehyde dehy- determining the genetic architecture and designing an
drogenase 2 (OS08g0424500), a gene associated with effective breeding strategy. In this study, a number of over-
rice flavor. Two previously reported grain chalk QTLs, lapping genomic regions were significantly associated with
DEC (qDEC8a) and PGWC (qPGWC), also reside in multiple traits. Specifically, endosperm starch-related traits,
this chromosomal region (Zhao et al., 2016). As previ- AAC and ASV, overlapped with grain chalk in a number
ously mentioned, some of the significant chromosomal of genomic regions. The analysis of overlapping segments
regions detected either overlapped or were in proximity revealed that AAC, ASV, and Chk appeared together in
to known and characterized genes. Some of the signifi- 10 chromosomal regions (Fig. 4 and Supplemental Table
cant regions and candidate genes also overlapped with S4). The chromosomal regions shared among these grain
a number of previously mapped QTL related to grain quality traits could be an indicator of mediated pleiotropy.
quality in biparental populations. Recently, two studies Mediated pleiotropy can be described as the influence of
investigating grain chalk reported significant chromo- one phenotype on a second phenotype (Solovieff et al.,
somal regions that coincided with regions detected for 2013). As previously detailed, the loose packing of starch
grain chalk, AAC, and ASV (Gong et al., 2017; Wang granules, as well as their shape and size, can result in grain
et al., 2017). These observations confirm that GWA chalkiness (Tashiro and Ebata, 1975; Tashiro and Wardlaw,
analysis with high-density coverage not only improved 1991; Lisle et al., 2000). Some of these regions are associated
the mapping resolution but also confirms the results of with starch biosynthesis-related genes or enzymes, thus it is
the previous low-density GWA analysis. Since most of possible for amylose-related traits to indirectly affect grain
these candidate loci were located in previously reported chalk, the second phenotype.
regions, these regions require further investigation. The overlap of HD regions with AAC, ASV, and Chk
regions may be influenced by source–sink dynamics at
Characterized Gene Regions play in primary tillers during the preheading and grain
In rice, GWA usually does not have sufficient resolution filling periods. Two recent studies also reported similar
to identify the FNP. Single nucleotide polymorphisms interactions among grain chalk, starch-related traits,
with significant p-values may be the FNP itself or in high HD, and grain size traits (Gong et al., 2017; Wang et al.,
linkage disequilibrium with the FNP. Linkage disequilib- 2017). The heading period is vital in regulating carbohy-
rium decays within ~75 to 150 kb in rice and is variable drate or starch mobilization and starch biosynthesis in
across the genome (Mather et al., 2007). The Waxy gene tiller leaves (Hirose et al., 2006; Hirano et al., 2016). Two
is known to have multiple FNPs which include A to G, studies investigating starch accumulation and mobili-
G to A, and T to C base changes (Cai et al., 1998; Sato et zation, observed increased starch accumulation in the
al., 2002). The Waxy-associated SNP on chromosome 6 at second and third leaf sheath on productive tillers during
1,765,761 bp is located within the gene but had a T to G the 15 d period prior to heading and flag leaf emergence
base change and is within an intron, so it is unlikely to (Hirose et al., 2006; Hirano et al., 2016). Starch was also
be the FNP. However, the SSIIa gene has a known FNP observed to decrease 3 d after heading in the second leaf
with a GC to TT base change (Umemoto and Aoki, 2005; sheath of primary tillers (Hirose et al., 2006). The rapid
Bao et al., 2006) and was detected in this study by the accumulation of starch during this period and the sub-
associated SNP on chromosome 6 at 6,752,887 bp, which sequent decrease after heading, coincided with the onset
causes a missense variant within the gene. Likewise, our of starch accumulation in spikelets and grains (Hirose
study detected the Grain Size 3-associated SNP on chro- et al., 2006; Hirano et al., 2016). The study also noted the
mosome 3 at 16,733,441 bp that is located within the gene increased activity of four starch biosynthesis enzymes,
and introduces a stop codon in the second exon, which adenosine 5′-diphosphate-glucose pyrophosphorylase,
is a known FNP at this gene (Fan et al., 2009). Other starch synthase, GBSS, and starch branching enzyme,
FNPs for Grain Size 3 that have been reported include which peaked ~10 d prior to heading, then gradually
a nonsense mutation in Exon 2 and three deletions in decreasing (Hirose et al., 2006). High nighttime tempera-
Exon 5, which result in short seeds (Takano-Kai et al., tures during flowering and grain filling can affect the
2011, 2013). The SNP on chromosome 5 at 5,375,167 bp source–sink dynamics, resulting in reduced quality and
associated with the Grain Weight 5 region in this study is yields (Counce et al., 2005; Cooper et al., 2008; Shi et al.,
located downstream from the gene and has a C to T base 2013). These observations suggest that starch mobiliza-
change, but a known FNP of Grain Weight 5 that is the tion to the grain during this crucial time is essential for
result of a 1212-bp deletion was not detected (Shomura et producing high-quality translucent grains.
Future Research group (bottom). All aromatics and other admixtures were
The genomic regions and candidate genes identified in removed from analysis.
this study will be targeted for development of gene-specific Supplemental Figure S2. Genome-wide analysis for alkali
markers. The genetic markers thus developed could be used spreading value (ASV) using the mini-core (MC-pub) data.
to validate the findings of the GWA results in biparental Genome-wide analysis Manhattan and quantile–quantile
populations and be used to assess the diversity panels fully. plots with NGS SNPs for the trait evaluated. For each Man-
Validated markers will be tested in panels of breeding lines hattan plot, the x-axis displays SNPs along chromosomes and
and subsequently be deployed for use in marker-assisted the y-axis displays the –log10(p) values for each SNP. The sig-
selection to accelerate breeding for grain quality. Knowl- nificance threshold is represented by the black horizontal line
edge of the different pleiotropic effects of the various loci is on each Manhattan plot. Single nucleotide polymorphisms
also critical for their deployment in breeding so that grain with p-values greater than 10–6 were classified as significant.
defects like chalk can be reduced without simultaneously For each quantile–quantile plot, the x-axis displays the
having an undesired effect on other grain quality traits. expected distribution of association across the SNPs and the
y-axis displays the observed SNP distribution in –log10(p).
Supplemental Information The plots are of the INDICA (indica, aus, indica–aus) varietal
Supplemental Table S1. The Alk and Waxy SNP data group (top) and the JAPONICA (temperate japonica, tropical
were generated on an ABI 3730, which uses fluorophore- japonica, temperate japonica–tropical japonica) varietal group
labeled markers to detect the product size of the ampli- (bottom). All aromatics and other admixtures were removed
fied fragment. The specific fragment size corresponds to from analysis.
the SNP nucleotide for the markers. An artificial allele Supplemental Figure S3. Genome-wide analysis for
size for Waxy-hap was generated from the combination brown grain length (BrL) using the mini-core (MC-pub) data.
of the Waxy Intron 1 allele and Waxy Exon 6 allele as Genome-wide analysis Manhattan and quantile–quantile
shown in the table. plots wtih NGS SNPs for the trait evaluated. For each Man-
Supplemental Table S2. Summary of significant hattan plot, the x-axis displays SNPs along chromosomes and
GWA SNPs for AAC in the mini-core (MC) and the the y-axis displays the –log10(p) values for each SNP. The sig-
HDRA-mini-core (HDMC) diversity panels. nificance threshold is represented by the black horizontal line
Supplemental Table S3. Summary of significant on each Manhattan plot. Single nucleotide polymorphisms
GWA SNPs for ASV in the mini-core (MC) and the with p-values greater than 10–6 were classified as significant.
HDMC diversity panels. For each quantile–quantile plot, the x-axis displays the
Supplemental Table S4. Summary of overlapping expected distribution of association across the SNPs and the
significant GWA hits identified in the mini-core (MC) y-axis displays the observed SNP distribution in –log10(p).
and the HDRA-mini-core (HDMC) diversity panels. The The plots are of the INDICA (indica, aus, indica–aus) varietal
significant regions were compared across all traits and group (top) and the JAPONICA (temperate japonica, tropical
subspecies groups (INDICA and JAPONICA) within japonica, temperate japonica–tropical japonica) varietal group
each diversity panel. (bottom). All aromatics and other admixtures were removed
Supplemental Table S5. Summary of putative candi- from analysis.
date genes for AAC, ASV, and grain chalk. Supplemental Figure S4. Genome-wide analysis for
Supplemental Table S6. Summary of significant brown grain width (BrW) using the mini-core (MC-pub)
GWA hits for protein of the mini-core (MC) diversity data. Genome-wide analysis Manhattan and quantile-
panel. The analysis consisted of 3.2 million SNPs across quantile plots with NGS SNPs for the trait evaluated. For
176 accessions. Markers detected at or above the thresh- each Manhattan plot, the x-axis displays SNPs along chro-
old [–log10(p) = 6] were considered to be significant. mosomes and the y-axis displays the –log10(p) values for each
Supplemental Figure S1. Genome-wide analysis for SNP. The significance threshold is represented by the black
amylose content (AAC) using the mini-core (MC-pub) horizontal line on each Manhattan plot. Single nucleotide
data. Genome-wide analysis Manhattan and quantile– polymorphisms with p-values greater than 10–6 were classi-
quantile plots with NGS SNPs for the trait evaluated. fied as significant. For each quantile–quantile plot, the x-axis
For each Manhattan plot, the x-axis displays SNPs along displays the expected distribution of association across the
chromosomes and the y-axis displays the –log10(p) values SNPs and the y-axis displays the observed SNP distribu-
for each SNP. The significance threshold is represented tion in –log10(p). The plots are of the INDICA (indica, aus,
by the black horizontal line on each Manhattan plot. indica–aus) varietal group (top) and the JAPONICA (temper-
Single nucleotide polymorphisms with p-values greater ate japonica, tropical japonica, temperate japonica–tropical
than 10–6 were classified as significant. For each quantile– japonica) varietal group (bottom). All aromatics and other
quantile plot, the x-axis displays the expected distribution admixtures were removed from analysis.
of association across the SNPs and the y-axis displays Supplemental Figure S5. Genome-wide analysis for
the observed SNP distribution in –log10(p). The plots are days to heading (HD) using the mini-core (MC-pub)
of the INDICA (indica, aus, indica–aus) varietal group data. Genome-wide analysis Manhattan and quantile–
(top) and the JAPONICA (temperate japonica, tropical quantile plots with NGS SNPs for the trait evaluated.
japonica, temperate japonica–tropical japonica) varietal For each Manhattan plot, the x-axis displays SNPs along

chromosomes and the y-axis displays the –log10(p) values for each SNP. The significance threshold is repre-
values for each SNP. The significance threshold is represented by the black horizontal line on each Manhattan
sented by the black horizontal line on each Manhattan plot. Single nucleotide polymorphisms with p-values
plot. Single nucleotide polymorphisms with p-values greater than 10 –6 were classified as significant. For each
greater than 10–6 were classified as significant. For each quantile–quantile plot, the x-axis displays the expected
quantile–quantile plot, the x-axis displays the expected distribution of association across the SNPs and the y-axis
distribution of association across the SNPs and the y-axis displays the observed SNP distribution in –log10(p).
displays the observed SNP distribution in –log10(p). The plots are of all subpopulations (top), INDICA
The plots are of all subpopulations (top), INDICA (indica, aus, indica–aus) varietal group (middle), and the
(indica, aus, indica–aus) varietal group (middle), and the JAPONICA (temperate japonica, tropical japonica, tem-
JAPONICA (temperate japonica, tropical japonica, temperate japonica–tropical japonica) varietal group (bot-
perate japonica–tropical japonica) varietal group (bottom). All aromatics and other admixtures were removed
tom). All aromatics and other admixtures were removed from analysis.
from analysis. Supplemental Figure S9. Genome-wide analysis for
Supplemental Figure S6. Genome-wide analysis for amylose content (AAC) using the HDMC data. Genome-
plant height (PHt) using the mini-core (MC-pub) data. wide analysis Manhattan and quantile–quantile plots
Genome-wide analysis Manhattan and quantile–quantile with NGS SNPs for the trait evaluated. For each Manhat-
plots with NGS SNPs for the trait evaluated. For each tan plot, the x-axis displays SNPs along chromosomes
Manhattan plot, the x-axis displays SNPs along chro- and the y-axis displays the –log10(p) values for each SNP.
mosomes and the y-axis displays the –log10(p) values The significance threshold is represented by the black
for each SNP. The significance threshold is represented horizontal line on each Manhattan plot. Single nucleo-
by the black horizontal line on each Manhattan plot. tide polymorphisms with p-values greater than 10 –6
Single nucleotide polymorphisms with p-values greater were classified as significant. For each quantile–quantile
than 10–6 were classified as significant. For each quan- plot, the x-axis displays the expected distribution of
tile–quantile plot, the x-axis displays the expected dis- association across the SNPs and the y-axis displays the
tribution of association across the SNPs and the y-axis observed SNP distribution in –log10(p). The plots are
displays the observed SNP distribution in –log10(p). of the INDICA (indica, aus, indica–aus) varietal group
The plots are of all subpopulations (top), INDICA (top) and the JAPONICA (temperate japonica, tropical
(indica, aus, indica–aus) varietal group (middle), and the japonica, temperate japonica–tropical japonica) varietal
JAPONICA (temperate japonica, tropical japonica, tem- group (bottom). All aromatics and other admixtures
perate japonica–tropical japonica) varietal group (bot- were removed from analysis.
tom). All aromatics and other admixtures were removed Supplemental Figure S10. Genome-wide analysis
from analysis. for alkali spreading value (ASV) using the HDMC data.
Supplemental Figure S7. Genome-wide analysis for Genome-wide analysis Manhattan and quantile–quantile
rough grain length (RgL) using the mini-core (MC-pub) plots with NGS SNPs for the trait evaluated. For each
data. Genome-wide analysis Manhattan and quantile– Manhattan plot, the x-axis displays SNPs along chro-
quantile plots with NGS SNPs for the trait evaluated. mosomes and the y-axis displays the –log10(p) values
For each Manhattan plot, the x-axis displays SNPs along for each SNP. The significance threshold is represented
chromosomes and the y-axis displays the –log10(p) by the black horizontal line on each Manhattan plot.
values for each SNP. The significance threshold is repre- Single nucleotide polymorphisms with p-values greater
sented by the black horizontal line on each Manhattan than 10 –6 were classified as significant. For each quan-
plot. Single nucleotide polymorphisms with p-values tile–quantile plot, the x-axis displays the expected dis-
greater than 10 –6 were classified as significant. For each tribution of association across the SNPs and the y-axis
quantile–quantile plot, the x-axis displays the expected displays the observed SNP distribution in –log10(p). The
distribution of association across the SNPs and the y-axis plots are of the INDICA (indica, aus, indica–aus) varietal
displays the observed SNP distribution in –log10(p). group (top) and the JAPONICA (temperate japonica,
The plots are of all subpopulations (top), INDICA tropical japonica, temperate japonica–tropical japonica)
(indica, aus, indica–aus) varietal group (middle), and the varietal group (bottom). All aromatics and other admix-
JAPONICA (temperate japonica, tropical japonica, tem- tures were removed from analysis.
perate japonica–tropical japonica) varietal group (bot- Supplemental Figure S11. Genome-wide analysis
tom). All aromatics and other admixtures were removed for brown grain length (BrL) using the HDMC data.
from analysis. Genome-wide analysis Manhattan and quantile–quantile
Supplemental Figure S8. Genome-wide analysis for plots with NGS SNPs for the trait evaluated. For each
rough grain width (RgW) using the mini-core (MC-pub) Manhattan plot, the x-axis displays SNPs along chro-
data. Genome-wide analysis Manhattan and quantile– mosomes and the y-axis displays the –log10(p) values
quantile plots with NGS SNPs for the trait evaluated. for each SNP. The significance threshold is represented
For each Manhattan plot, the x-axis displays SNPs along by the black horizontal line on each Manhattan plot.
chromosomes and the y-axis displays the –log10(p) Single nucleotide polymorphisms with p-values greater
than 10–6 were classified as significant. For each quan- subpopulations (top), INDICA (indica, aus, indica–aus)
tile–quantile plot, the x-axis displays the expected dis- varietal group (middle), and the JAPONICA (temperate
tribution of association across the SNPs and the y-axis japonica, tropical japonica, temperate japonica–tropi-
displays the observed SNP distribution in –log10(p). The cal japonica) varietal group (bottom). All aromatics and
plots are of the INDICA (indica, aus, indica–aus) varietal other admixtures were removed from analysis.
group (top) and the JAPONICA (temperate japonica, Supplemental Figure S15. Genome-wide analysis for
tropical japonica, temperate japonica–tropical japonica) rough grain length (RgL) using the HDMC data. Genome-
varietal group (bottom). All aromatics and other admix- wide analysis Manhattan and quantile–quantile plots with
tures were removed from analysis. NGS SNPs for the trait evaluated. For each Manhattan
Supplemental Figure S12. Genome-wide analysis plot, the x-axis displays SNPs along chromosomes and the
for brown grain width (BrW) using the HDMC data. y-axis displays the –log10(p) values for each SNP. The sig-
Genome-wide association Manhattan and quantile– nificance threshold is represented by the black horizontal
quantile plots with NGS SNPs for the trait evaluated. line on each Manhattan plot. Single nucleotide polymor-
For each Manhattan plot, the x-axis displays SNPs along phisms with p-values greater than 10–6 were classified as
chromosomes and the y-axis displays the –log10(p) significant. For each quantile–quantile plot, the x-axis
values for each SNP. The significance threshold is repre- displays the expected distribution of association across the
sented by the black horizontal line on each Manhattan SNPs and the y-axis displays the observed SNP distribu-
plot. Single nucleotide polymorphisms with p-values tion in –log10(p). The plots are of all subpopulations (top),
greater than 10 –6 were classified as significant. For each INDICA (indica, aus, indica–aus) varietal group (middle),
quantile–quantile plot, the x-axis displays the expected and the JAPONICA (temperate japonica, tropical japonica,
distribution of association across the SNPs and the y-axis temperate japonica–tropical japonica) varietal group (bot-
displays the observed SNP distribution in –log10(p). The tom). All aromatics and other admixtures were removed
plots are of the INDICA (indica, aus, indica–aus) varietal from analysis.
group (top) and the JAPONICA (temperate japonica, Supplemental Figure S16. Genome-wide analysis
tropical japonica, temperate japonica–tropical japonica) for rough grain width (RgW) using the HDMC data.
varietal group (bottom). All aromatics and other admix- Genome-wide analysis Manhattan and quantile–quan-
tures were removed from analysis. tile plots with NGS SNPs for the trait evaluated. For each
Supplemental Figure S13. Genome-wide analysis for Manhattan plot, the x-axis displays SNPs along chromo-
days to heading (HD) using the HDMC data. Genome- somes and the y-axis displays the –log10(p) values for each
wide analysis Manhattan and quantile–quantile plots with SNP. The significance threshold is represented by the black
NGS SNPs for the trait evaluated. For each Manhattan horizontal line on each Manhattan plot. Single-nucleotide
plot, the x-axis displays SNPs along chromosomes and the polymorphisms with p-values greater than 10–6 were clas-
y-axis displays the –log10(p) values for each SNP. The sig- sified as significant. For each quantile–quantile plot, the
nificance threshold is represented by the black horizontal x-axis displays the expected distribution of association
line on each Manhattan plot. Single nucleotide polymor- across the SNPs and the y-axis displays the observed SNP
phisms with p-values greater than 10–6 were classified as distribution in –log10(p). The plots are of all subpopula-
significant. For each quantile–quantile plot, the x-axis tions (top), INDICA (indica, aus, indica–aus) varietal
displays the expected distribution of association across the group (middle), and the JAPONICA (temperate japonica,
SNPs and the y-axis displays the observed SNP distribu- tropical japonica, temperate japonica–tropical japonica)
tion in –log10(p). The plots are of all subpopulations (top), varietal group (bottom). All aromatics and other admix-
INDICA (indica, aus, indica–aus) varietal group (middle), tures were removed from analysis.
and the JAPONICA (temperate japonica, tropical japonica, Supplemental Figure S17. Genome-wide analysis for
temperate japonica–tropical japonica) varietal group (bot- amylose content (AAC) using the MC09 data. Genome-
tom). All aromatics and other admixtures were removed wide analysis Manhattan and quantile–quantile plots
from analysis. with NGS SNPs for the trait evaluated. For each Manhat-
Supplemental Figure S14. Genome-wide analysis for tan plot, the x-axis displays SNPs along chromosomes
plant height (PHt) using the HDMC data. Genome-wide and the y-axis displays the –log10(p) values for each SNP.
analysis Manhattan and quantile–quantile plots with The significance threshold is represented by the black
NGS SNPs for the trait evaluated. For each Manhattan horizontal line on each Manhattan plot. Single nucleo-
plot, the x-axis displays SNPs along chromosomes and tide polymorphisms with p-values greater than 10–6
the y-axis displays the –log10(p) values for each SNP. were classified as significant. For each quantile–quantile
The significance threshold is represented by the black plot, the x-axis displays the expected distribution of
horizontal line on each Manhattan plot. Single nucleo- association across the SNPs and the y-axis displays the
tide polymorphisms with p-values greater than 10 –6 were observed SNP distribution in –log10(p). The plots are
classified as significant. For each quantile–quantile plot, of the INDICA (indica, aus, indica–aus) varietal group
the x-axis displays the expected distribution of associa- (top) and the JAPONICA (temperate japonica, tropical
tion across the SNPs and the y-axis displays the observed japonica, temperate japonica–tropical japonica) varietal
SNP distribution in –log10(p). The plots are of all

group (bottom). All aromatics and other admixtures NGS SNPs for the trait evaluated. For each Manhattan
were removed from analysis. plot, the x-axis displays SNPs along chromosomes and the
Supplemental Figure S18. Genome-wide analysis y-axis displays the –log10(p) values for each SNP. The sig-
for alkali spreading value (ASV) using the MC09 data. nificance threshold is represented by the black horizontal
Genome-wide analysis Manhattan and quantile–quantile line on each Manhattan plot. Single nucleotide polymor-
plots with NGS SNPs for the trait evaluated. For each phisms with p-values greater than 10–6 were classified as
Manhattan plot, the x-axis displays SNPs along chro- significant. For each quantile–quantile plot, the x-axis
mosomes and the y-axis displays the –log10(p) values displays the expected distribution of association across the
for each SNP. The significance threshold is represented SNPs and the y-axis displays the observed SNP distribu-
by the black horizontal line on each Manhattan plot. tion in –log10(p). The plots are of all subpopulations (top),
Single-nucleotide polymorphisms with p-values greater INDICA (indica, aus, indica–aus) varietal group (middle),
than 10–6 were classified as significant. For each quan- and the JAPONICA (temperate japonica, tropical japonica,
tile–quantile plot, the x-axis displays the expected dis- temperate japonica–tropical japonica) varietal group (bot-
tribution of association across the SNPs and the y-axis tom). All aromatics and other admixtures were removed
displays the observed SNP distribution in –log10(p). The from analysis.
plots are of the INDICA (indica, aus, indica–aus) varietal Supplemental Figure S22. Genome-wide analysis for
group (top) and the JAPONICA (temperate japonica, plant height (PHt) using the MC09 data. Genome-wide
tropical japonica, temperate japonica–tropical japonica) analysis Manhattan and quantile–quantile plots with
varietal group (bottom). All aromatics and other admix- NGS SNPs for the trait evaluated. For each Manhattan
tures were removed from analysis. plot, the x-axis displays SNPs along chromosomes and the
Supplemental Figure S19. Genome-wide analysis for y-axis displays the –log10(p) values for each SNP. The sig-
brown grain length (BrL) using the MC09 data. Genome- nificance threshold is represented by the black horizontal
wide analysis Manhattan and quantile–quantile plots with line on each Manhattan plot. Single nucleotide polymor-
NGS SNPs for the trait evaluated. For each Manhattan plot, phisms with p-values greater than 10–6 were classified as
the x-axis displays SNPs along chromosomes and the y-axis significant. For each quantile–quantile plot, the x-axis
displays the –log10(p) values for each SNP. The significance displays the expected distribution of association across the
threshold is represented by the black horizontal line on each SNPs and the y-axis displays the observed SNP distribu-
Manhattan plot. Single nucleotide polymorphisms with tion in –log10(p). The plots are of all subpopulations (top),
p-values greater than 10–6 were classified as significant. For INDICA (indica, aus, indica–aus) varietal group (middle),
each quantile–quantile plot, the x-axis displays the expected and the JAPONICA (temperate japonica, tropical japonica,
distribution of association across the SNPs and the y-axis temperate japonica–tropical japonica) varietal group (bot-
displays the observed SNP distribution in –log10(p). The tom). All aromatics and other admixture were removed
plots are of the INDICA (indica, aus, indica–aus) varietal from analysis.
group (top) and the JAPONICA (temperate japonica, tropi- Supplemental Figure S23. Genome-wide analysis for
cal japonica, temperate japonica–tropical japonica) varietal grain protein using the MC09 data. Genome-wide analy-
group (bottom). All aromatics and other admixtures were sis Manhattan and quantile–quantile plots with NGS
removed from analysis. SNPs for the trait evaluated. For each Manhattan plot,
Supplemental Figure S20. Genome-wide analysis the x-axis displays SNPs along chromosomes and the
for brown grain width (BrW) using the MC09 data. y-axis displays the –log10(p) values for each SNP. The sig-
Genome-wide analysis Manhattan and quantile–quantile nificance threshold is represented by the black horizontal
plots with NGS SNPs for the trait evaluated. For each line on each Manhattan plot. Single nucleotide polymor-
Manhattan plot, the x-axis displays SNPs along chro- phisms with p-values greater than 10 –6 were classified as
mosomes and the y-axis displays the –log10(p) values significant. For each quantile–quantile plot, the x-axis
for each SNP. The significance threshold is represented displays the expected distribution of association across
by the black horizontal line on each Manhattan plot. the SNPs and the y-axis displays the observed SNP dis-
Single nucleotide polymorphisms with p-values greater tribution in –log10(p). The plots are of all subpopulations
than 10–6 were classified as significant. For each quan- (top), INDICA (indica, aus, indica–aus) varietal group
tile–quantile plot, the x-axis displays the expected dis- (middle), and the JAPONICA (temperate japonica, tropi-
tribution of association across the SNPs and the y-axis cal japonica, temperate japonica–tropical japonica) vari-
displays the observed SNP distribution in –log10(p). The etal group (bottom). All aromatics and other admixtures
plots are of the INDICA (indica, aus, indica–aus) varietal were removed from analysis.
group (top) and the JAPONICA (temperate japonica, Supplemental File S1. The list of Oryza sativa acces-
tropical japonica, temperate japonica–tropical japonica) sions for each diversity panel included in this study.
varietal group (bottom). All aromatics and other admix-
tures were removed from analysis. Conflict of Interest Disclosure
Supplemental Figure S21. Genome-wide analysis for The authors declare that there is no conflict of interest.
days to heading (HD) using the MC09 data. Genome-wide
analysis Manhattan and quantile–quantile plots with
Acknowledgments Champagne, E.T., K.L. Bett-Garber, J.L. Thomson, and M.A. Fitzgerald. 2009.
The authors greatly appreciated Dr. Fjellstrom’s (deceased) dedication to Unraveling the impact of nitrogen nutrition on cooked rice flavor and
developing the molecular markers that are used in this study as well as in texture. Cereal Chem. 86:274–280. doi:10.1094/CCHEM-86-3-0274
US and international breeding programs for improving rice grain qual- Champagne, E.T., D.F. Wood, B.O. Juliano, and D.B. Bechtel. 2004. The
ity. The authors acknowledge Naomi Gipson, Janis Delgado, and Heather rice grain and its gross composition. Rice Chem. Technol. 3:77–107.
Box for the analysis of amylose, ASV, and protein; Tiffany Sookaserm for doi:10.1094/1891127349.004
grain chalk analysis; and Melissa Jia for her work in genotyping the RCS Chen, J., W.H. Tang, M.M. Hong, and Z.Y. Wang. 2003. OsBP-73, a rice gene,
and MC with SSR markers. Mention of a trademark or proprietary prod- encodes a novel DNA-binding protein with a SAP-like domain and
uct does not constitute a guarantee or warranty of the product by the U.S. its genetic interference by double-stranded RNA inhibits rice growth.
Department of Agriculture, and does not imply its approval to the exclu- Plant Mol. Biol. 52:579–590. doi:10.1023/A:1024854101965
sion of other products that also can be suitable. USDA is an equal oppor- Chen, M.H., C.J. Bergman, S.R.M. Pinson, and R.G. Fjellstrom. 2008a. Waxy
tunity provider and employer. All experiments complied with the current gene haplotypes: Associations with apparent amylose content and the
laws of the United States, the country in which they were performed. effect by the environment in an international rice germplasm collection.
J. Cereal Sci. 47:536–545. doi:10.1016/j.jcs.2007.06.013
References Chen, M.H., C.J. Bergman, S.R.M. Pinson, and R.G. Fjellstrom. 2008b. Waxy
Agrama, H.A., W. Yan, M. Jia, R. Fjellstrom, and A.M. McClung. 2010. gene haplotypes: Associations with pasting properties in an interna-
Genetic structure associated with diversity and geographic distribu- tional rice germplasm collection. J. Cereal Sci. 48:781–788. doi:10.1016/j.
tion in the USDA rice world collection. Nat. Sci. 2:247–291. doi:10.4236/ jcs.2008.05.004
ns.2010.24036 Chen, M.H., R.G. Fjellstrom, E.F. Christensen, and C.J. Bergman. 2010.
Agrama, H.A., W. Yan, F. Lee, F. Robert, M.H. Chen, M. Jia, et al. 2009. Development of three allele-specific codominant rice Waxy gene PCR
Genetic assessment of a mini-core subset developed from the USDA markers suitable for marker-assisted selection of amylose content and
rice genebank. Crop Sci. 49:1336–1346. doi:10.2135/cropsci2008.06.0551 paste viscosity. Mol. Breed. 26:513–523. doi:10.1007/s11032-010-9419-z
Akihiro, T., K. Mizuno, and T. Fujimura. 2005. Gene expression of ADP- Cho, Y.G., M.Y. Eun, S.R. McCouch, and Y.A. Chae. 1994. The semi-
glucose pyrophosphorylase and starch contents in rice cultured cells dwarf gene, sd-1, of rice (Oryza sativa L.).II Molecular mapping and
are cooperatively regulated by sucrose and ABA. Plant Cell Physiol. maker-assisted selection. Theor. Appl. Genet. 89:54–59. doi:10.1007/
46:937–946. doi:10.1093/pcp/pci101 BF00226982
Ambardekar, A.A., T.J. Siebenmorgen, P.A. Counce, S.B. Lanning, and A. Chun, A., J. Song, K.-J. Kim, and H.-J. Lee. 2009. Quality of head and chalky
Mauromoustakos. 2011. Impact of field-scale nighttime air tempera- rice and deterioration of eating quality by chalky rice. J. Crop Sci. Bio-
tures during kernel development on rice milling quality. Field Crop. technol. 12:239–244. doi:10.1007/s12892-009-0142-4
Res. 122:179–185. doi:10.1016/j.fcr.2011.03.012 Colleoni, C., D. Dauville, G. Mouille, A. Bulon, D. Gallant, B. Bouchet, et
Atwell, S., Y.S. Huang, B.J. Vilhjálmsson, G. Willems, M. Horton, Y. Li, et al. al. 1999. Genetic and biochemical evidence for the involvement of
2010. Genome-wide association study of 107 phenotypes in Arabidopsis alpha-1,4 glucanotransferases in amylopectin synthesis. Plant Physiol.
thaliana inbred lines. Nature 465:627–631. doi:10.1038/nature08800 120:993–1004. doi:10.1104/pp.120.4.993
Ayre, B.G. 2011. Membrane-transport systems for sucrose in relation to Cooper, N.T.W., T.J. Siebenmorgen, and P.A. Counce. 2008. Effects of night-
whole-plant carbon partitioning. Mol. Plant 4:377–394. doi:10.1093/mp/ time temperature during kernel development on rice physicochemical
ssr014 properties. Cereal Chem. J. 85:276–282. doi:10.1094/CCHEM-85-3-
Bao, J.S., H. Corke, and M. Sun. 2006. Nucleotide diversity in starch synthase 0276
IIa and validation of single nucleotide polymorphisms in relation to Costanzo, S., A.K. Jackson, and S.A. Brooks. 2011. High-resolution mapping
starch gelatinization temperature and other physicochemical properties of Rsn1, a locus controlling sensitivity of rice to a necrosis-inducing
in rice (Oryza sativa L.). Theor. Appl. Genet. 113:1171–1183. doi:10.1007/ phytotoxin from Rhizoctonia solani AG1-IA. Theor. Appl. Genet.
s00122-006-0355-6 123:33–41. doi:10.1007/s00122-011-1564-1
Bian, J.M., H.H. He, C.J. Li, H. Shi, C.L. Zhu, X.S. Peng, et al. 2013. Identifica- Counce, P.A., R.J. Bryant, C.J. Bergman, R.C. Bautista, Y.J. Wang, T.J.
tion and validation of a new grain weight QTL in rice. Genet. Mol. Res. Siebenmorgen, et al.. 2005. Rice milling quality, grain dimensions, and
12:5623–5633. doi:10.4238/2013.November.18.11 starch branching as affected by high night temperatures. Cereal Chem.
Bockelman, H.E., R.H. Dilday, W. Yan, and D.M. Wesenberg. 2003. Germ- 82:645–648. doi:10.1094/CC-82-0645
plasm collection, preservation, and utilization. John Wiley & Sons, New Danecek, P., A. Auton, G. Abecasis, C.A. Albers, E. Banks, M.A. DePristo, , et
York. al.. 2011. The variant call format and VCFtools. Bioinformatics 27:2156–
Bradbury, P.J., Z. Zhang, D.E. Kroon, T.M. Casstevens, Y. Ramdoss, and E.S. 2158. doi:10.1093/bioinformatics/btr330
Buckler. 2007. TASSEL: Software for association mapping of complex Dobo, M., N. Ayres, G. Walker, and W.D. Park. 2010. Polymorphism in the
traits in diverse samples. Bioinformatics 23:2633–2635. doi:10.1093/ GBSS gene affects amylose content in US and European rice germplasm.
bioinformatics/btm308 J. Cereal Sci. 52:450–456. doi:10.1016/j.jcs.2010.07.010
Bryant, R., A. Proctor, M. Hawkridge, A. Jackson, K. Yeater, P. Counce, et al. Dong, X., D. Zhang, J. Liu, Q.Q. Liu, H. Liu, L. Tian, et al.. 2015. Plastidial
2011. Genetic variation and association mapping of silica concentra- disproportionating enzyme participates in starch synthesis in rice
tion in rice hulls using a germplasm collection. Genetica (The Hague) endosperm by transferring maltooligosyl groups from amylose and
139:1383–1398. doi:10.1007/s10709-012-9637-x amylopectin to amylopectin. Plant Physiol. 169:2496–2512. doi:10.1104/
Bryant, R.J., A.K. Jackson, K.M. Yeater, W.G. Yan, A.M. McClung, and R.G. pp.15.01411
Fjellstrom. 2013. Genetic variation and association mapping of protein Edwards, J.D., A.M. Baldo, and L.A. Mueller. 2016. Ricebase: A breeding
concentration in brown rice using a diverse rice germplasm collection. and genetics platform for rice, integrating individual molecular mark-
Cereal Chem. J. 90:445–452. doi:10.1094/CCHEM-09-12-0122-R ers, pedigrees and whole-genome-based data. Database 2016:baw107.
Cai, X.-L., Z.-Y. Wang, Y.-Y. Xing, J.-L. Zhang, and M.-M. Hong. 1998. doi:10.1093/database/baw107
Aberrant splicing of intron 1 leads to the heterogenous 5´ UTR and Edwards, J.D., A.K. Jackson, and A.M. McClung. 2017. Genetic architecture
decreased expression of the waxy gene in rice cultivars of inter- of grain chalk in rice and interactions with a low phytic acid locus. Field
mediate amylose content. Plant J. 14:459–465. doi:10.1046/j.1365- Crop. Res. 205:116–123. doi:10.1016/j.fcr.2017.01.015
313X.1998.00126.x Eizenga, G.C., M.L. Ali, R.J. Bryant, K.M. Yeater, A.M. McClung, and
Calingacion, M., A. Laborte, A. Nelson, A. Resurreccion, J.C. Concepcion, S.R. McCouch. 2014. Registration of the rice diversity panel 1 for
V.D. Daygon, et al. 2014. Diversity of global rice markets and the sci- genomewide association studies. J. Plant Reg. 8:109–116. doi:10.3198/
ence required for consumer-targeted rice breeding. PLoS One 9:e85106. jpr2013.03.0013crmp
doi:10.1371/journal.pone.0085106 Fan, C., S. Yu, C. Wang, and Y. Xing. 2009. A causal C–A mutation in the sec-
ond exon of GS3 highly associated with rice grain length and validated

as a functional marker. Theor. Appl. Genet. 118:465–472. doi:10.1007/ Kühn, C., and C.P.L. Grof. 2010. Sucrose transporters of higher plants. Curr.
s00122-008-0913-1 Opin. Plant Biol. 13:288–298. doi:10.1016/j.pbi.2010.02.001
Fitzgerald, M.A., and A.P. Resurreccion. 2009. Maintaining the yield of edible Lanning, S.B., T.J. Siebenmorgen, P.A. Counce, A.A. Ambardekar, and A.
rice in a warming world. Funct. Plant Biol. 36:1037–1045. doi:10.1071/ Mauromoustakos. 2011. Extreme nighttime air temperatures in 2010
FP09055 impact rice chalkiness and milling quality. Field Crop. Res. 124:132–
Gao, Y., C. Liu, Y. Li, A. Zhang, G. Dong, L. Xie, et al. 2016. QTL analysis for 136. doi:10.1016/j.fcr.2011.06.012
chalkiness of rice and fine mapping of a candidate gene for qACE9. Rice Larkin, P.D., and W.D. Park. 1999. Transcript accumulation and utiliza-
(N. Y.) 9:41. doi:10.1186/s12284-016-0114-5 tion of alternate and non-consensus splice sites in rice granule-
Gao, Z., D. Zeng, F. Cheng, Z. Tian, L. Guo, Y. Su, et al. 2011. ALK, the key bound starch synthase are temperature-sensitive and controlled
gene for gelatinization temperature, is a modifier gene for gel con- by a single-nucleotide polymorphism. Plant Mol. Biol. 40:719–727.
sistency in rice. J. Integr. Plant Biol. 53:756–765. doi:10.1111/j.1744- doi:10.1023/A:1006298608408
7909.2011.01065.x Larkin, P.D., and W.D. Park. 2003. Association of waxy gene single nucleotide
Gao, Z.Y., S.-C. Zhao, W.-M. He, L.-B. Guo, Y.-L. Peng, J.-J. Wang, et al. polymorphisms with starch characteristics in rice (Oryza sativa L.).
2013. Dissecting yield-associated loci in super hybrid rice by rese- Mol. Breed. 12:335–339. doi:10.1023/B:MOLB.0000006797.51786.92
quencing recombinant inbred lines and improving parental genome Lee, S.K., S.K. Hwang, M. Han, J.S. Eom, H.G. Kang, Y. Han, et al. 2007. Iden-
sequences. Proc. Natl. Acad. Sci. USA 110:14492–14497. doi:10.1073/ tification of the ADP-glucose pyrophosphorylase isoforms essential for
pnas.1306579110 starch synthesis in the leaf and seed endosperm of rice (Oryza sativa L.).
Gong, J., J. Miao, Y. Zhao, Q. Zhao, Q. Feng, Q. Zhan, et al. 2017. Dissecting Plant Mol. Biol. 65(4):531–546. doi:10.1007/s11103-007-9153-z
the genetic basis of grain shape and chalkiness traits in hybrid rice Li, K., J. Bao, H. Corke, and M. Sun. 2017. Association analysis of markers
using multiple collaborative populations. Mol. Plant 10:1353–1356. derived from starch biosynthesis related genes with starch physico-
doi:10.1016/j.molp.2017.07.014 chemical properties in the USDA rice mini-core collection. Front. Plant
Hakata, M., M. Kuroda, A. Ohsumi, T. Hirose, H. Nakamura, M. Mura- Sci. 8:242. doi:10.3389/fpls.2017.00424
matsu, et al. 2012. Overexpression of a rice TIFY gene increases grain Li, Y., C. Fan, Y. Xing, P. Yun, L. Luo, B. Yan, et al. 2014. Chalk5 encodes a
size through enhanced accumulation of carbohydrates in the stem. vacuolar H+-translocating pyrophosphatase influencing grain chalki-
Biosci. Biotechnol. Biochem. 76:2129–2134. doi:10.1271/bbb.120545 ness in rice. Nat. Genet. 46(4):398–404. doi:10.1038/ng.2923
Han, B., and X. Huang. 2013. Sequencing-based genome-wide associa- Li, J., R.Y. Qin, H. Li, R.F. Xu, C.H. Qiu, Y.C. Sun, et al. 2015. Identification
tion study in rice. Curr. Opin. Plant Biol. 16:133–138. doi:10.1016/j. and analysis of the mechanism underlying heat-inducible expression of
pbi.2013.03.006 rice aconitase 1. Plant Sci. 233:22–31. doi:10.1016/j.plantsci.2015.01.003
Hirano, H.Y., M. Eiguchi, and Y. Sano. 1998. A single base change altered the Li, X., W. Yan, H. Agrama, B. Hu, L. Jia, M. Jia, et al. 2010. Genotypic and
regulation of the Waxy gene at the posttranscriptional level during the phenotypic characterization of genetic differentiation and diversity in
domestication of rice. Mol. Biol. Evol. 15:978–987. doi:10.1093/oxford- the USDA rice mini-core collection. Genetica (The Hague) 138:1221–
journals.molbev.a026013 1230. doi:10.1007/s10709-010-9521-5
Hirano, T., T. Higuchi, M. Hirano, Y. Sugimura, and H. Michiyama. 2016. Li, X., W. Yan, H. Agrama, L. Jia, A. Jackson, et al. 2012. Unraveling the
Two β-amylase genes, OsBAM2 and OsBAM3, are involved in starch complex trait of harvest index with association mapping in rice (Oryza
remobilization in rice leaf sheaths. Plant Prod. Sci. 19:291–299. doi:10.10 sativa L.). PLoS One 7:e29350. doi:10.1371/journal.pone.0029350
80/1343943X.2016.1140008 Li, X., W. Yan, H. Agrama, L. Jia, X. Shen, A. Jackson, et al. 2011. Mapping
Hirose, T., T. Ohdan, Y. Nakamura, and T. Terao. 2006. Expression profil- QTLs for improving grain yield using the USDA rice mini-core collec-
ing of genes related to starch synthesis in rice leaf sheaths during tion. Planta 234:347–361. doi:10.1007/s00425-011-1405-0
the heading period. Physiol. Plant. 128:425–435. doi:10.1111/j.1399- Liakat Ali, M., A.M. McClung, M.H. Jia, J.A. Kimball, S.R. McCouch,
3054.2006.00758.x and C.E. Georgia. 2011. A rice diversity panel evaluated for genetic
Huang, X., Q. Feng, Q. Qian, Q. Zhao, L. Wang, A. Wang, et al. 2009. High- and agro-morphological diversity between subpopulations and its
throughput genotyping by whole-genome resequencing. Genome Res. geographic distribution. Crop Sci. 51:2021–2035. doi:10.2135/crop-
19:1068–1076. doi:10.1101/gr.089516.108 sci2010.11.0641
Ishimaru, K., T. Hirose, N. Aoki, S. Takahashi, K. Ono, S. Yamamoto, et al. Lisle, A.J., M. Martin, and M.A. Fitzgerald. 2000. Chalky and translucent rice
2001. Antisense expression of a rice sucrose transporter OsSUT1 in rice grains differ in starch composition and structure and cooking proper-
(Oryza sativa L.). Plant Cell Physiol. 42(10):1181–1185. doi:10.1093/pcp/ ties. Cereal Chem. J. 77:627–632. doi:10.1094/CCHEM.2000.77.5.627
pce148 Lunn, J.E., R. Feil, J.H.M. Hendriks, Y. Gibon, R. Morcuende, D. Osuna, et al.
Isshiki, M., K. Morino, M. Nakajima, R.J. Okagaki, S.R. Wessler, T. Izawa, et 2006. Sugar-induced increases in trehalose 6-phosphate are correlated
al. 1998. A naturally occurring functional allele of the rice waxy locus with redox activation of ADPglucose pyrophosphorylase and higher
has a GT to TT mutation at the 5´ splice site of the first intron. Plant J. rates of starch synthesis in Arabidopsis thaliana. Biochem. J. 397:139–
15:133–138. doi:10.1046/j.1365-313X.1998.00189.x 148. doi:10.1042/BJ20060083
Jia, L., W. Yan, H.A. Agrama, K. Yeater, X. Li, B. Hu, et al. 2011. Searching for Maclean, J.L., D.C. Dawe, and B. Hardy. and G.P. Hettel. 2002. Rice almanac:
germplasm resistant to sheath blight from the USDA rice core collec- Source book for the most important economic activity on earth. CABI
tion. Crop Sci. 51:1507–1517. doi:10.2135/cropsci2010.10.0581 Publishing, Wallingford, UK.
Jia, L., W. Yan, C. Zhu, H.A. Agrama, A. Jackson, K. Yeater, et al. 2012. Allelic Martin, M., and M.A. Fitzgerald. 2002. Proteins in rice grains influence cook-
analysis of sheath blight resistance with association mapping in rice. ing properties! J. Cereal Sci. 36:285–294. doi:10.1006/jcrs.2001.0465
PLoS One 7:e32703. doi:10.1371/journal.pone.0032703 Mather, K.A., A.L. Caicedo, N.R. Polato, K.M. Olsen, S. McCouch, and M.D.
Jin-Yue, S., W. Qing-Mei, C. Jia, and W. Xue-Chen. 2004. Characteristics of Purugganan. 2007. The extent of linkage disequilibrium in rice (Oryza
triose phosphate/phosphate translocator from wheat and its role in the sativa L.). Genetics 177:2223–2232. doi:10.1534/genetics.107.079616
distribution of assimilates. Acta Bot. Sin. 46:294–301. Matsuno, K., and T. Fujimura. 2014. Induction of phytic acid synthesis by
Kadan, R.S., R.J. Bryant, and J.A. Miller. 2008. Effects of milling on func- abscisic acid in suspension-cultured cells of rice. Plant Sci. 217–218:152–
tional properties of rice flour. J. Food Sci. 73: E151–E154. doi:10.1111/ 157. doi:10.1016/j.plantsci.2013.12.015
j.1750-3841.2008.00720.x McCouch, S., G.J. Baute, J. Bradeen, P. Bramel, P.K. Bretting, E. Buck-
Khush, G.S., C.M. Paule, and N.M. De La Cruz. 1978. Rice grain quality ler, et al. 2013. Agriculture: Feeding the future. Nature 499:23–24.
evaluation and improvement at IRRI. In: N.C. Brady, editor, Proceed- doi:10.1038/499023a
ings of the Workshop on Chemical Aspects of Rice Grain Quality. IRRI, McCouch, S.R., M.H. Wright, C.-W. Tung, L.G. Maron, K.L. McNally,
Los Banos, Philippines. p. 21–31. M. Fitzgerald, et al. 2016. Open access resources for genome-wide
Korte, A., and A. Farlow. 2013. The advantages and limitations of trait analy- association mapping in rice. Nat. Commun. 7:10532. doi:10.1038/
sis with GWAS: A review. Plant Methods 9:29. doi:10.1186/1746-4811- ncomms10532
9-29
McKenna, A., M. Hanna, E. Banks, A. Sivachenko, K. Cibulskis, A. Ker- Storey, J.D., and R. Tibshirani. 2003. Statistical significance for genome-
nytsky, et al. 2010. The Genome Analysis Toolkit: A MapReduce frame- wide studies. Proc. Natl. Acad. Sci. USA 100:9440–9445. doi:10.1073/
work for analyzing next-generation DNA sequencing data. Genome pnas.1530509100
Res. 20:1297–1303. doi:10.1101/gr.107524.110 Sun, W., Q. Zhou, Y. Yao, X. Qiu, K. Xie, and S. Yu. 2015. Identification of
Monna, L., N. Kitazawa, R. Yoshino, J. Suzuki, H. Masuda, Y. Maehara, et al. genomic regions and the isoamylase gene for reduced grain chalkiness
2002. Positional cloning of rice semidwarfing gene, sd-1: Rice “green in rice. PLoS One 10:e0122013. doi:10.1371/journal.pone.0122013
revolution gene” encodes a mutant enzyme involved in gibberellin syn- Sweeney, M., and S. McCouch. 2007. The complex history of the domestica-
thesis. DNA Res. 9:11–17. doi:10.1093/dnares/9.1.11 tion of rice. Ann. Bot. (Lond.) 100:951–957. doi:10.1093/aob/mcm128
Muthayya, S., J.D. Sugimoto, S. Montgomery, and G.F. Maberly. 2014. An Takano-Kai, N., K. Doi, and A. Yoshimura. 2011. GS3 participates in
overview of global rice production, supply, trade, and consumption. stigma exsertion as well as seed length in rice. Breed. Sci. 61:244–250.
Ann. N. Y. Acad. Sci. 1324:7–14. doi:10.1111/nyas.12540 doi:10.1270/jsbbs.61.244
Nakamura, H., M. Hakata, K. Amano, A. Miyao, N. Toki, M. Kajikawa, et al. Takano-Kai, N., H. Jiang, A. Powell, S. McCouch, I. Takamure, N. Furuya, et
2007. A genome-wide gain-of-function analysis of rice genes using the al. 2013. Multiple and independent origins of short seeded alleles of GS3
FOX-hunting system. Plant Mol. Biol. 65:357–371. doi:10.1007/s11103- in rice. Breed. Sci. 63:77–85. doi:10.1270/jsbbs.63.77
007-9243-y Tang, T., H. Xie, Y. Wang, B. Lü, and J. Liang. 2009. The effect of sucrose
Ouyang, S., W. Zhu, J. Hamilton, H. Lin, M. Campbell, K. Childs, et al. 2007. and abscisic acid interaction on sucrose synthase and its relationship
The TIGR rice genome annotation resource: Improvements and new to grain filling of rice (Oryza sativa L.). J. Exp. Bot. 60:2641–2652.
features. Nucleic Acids Res. 35:D883–D887. doi:10.1093/nar/gkl976 doi:10.1093/jxb/erp114
Park, S., M.-H. Cho, S.H. Bhoo, J.-S. Jeon, Y.-K. Kwon, and T.-R. Hahn. 2007. Tashiro, T., and M. Ebata. 1975. Studies on white-belly rice kernels. 4. Opaque
Altered sucrose synthesis in rice plants with reduced activity of fruc- rice endosperm viewed with a scanning electron microscope. Proc.
tose-6-phosphate 2-kinase/fructose-2,6-bisphosphatase. J. Plant Biol. Crop Sci. Soc. Japan 44:205–214. doi:10.1626/jcs.44.205
50:38. doi:10.1007/BF03030598 Tashiro, T., and I. Wardlaw. 1991. The effect of high temperature on kernel
Perera, I., S. Seneweera, and N. Hirotsu. 2018. Manipulating the phytic acid dimensions and the type and occurrence of kernel damage in rice. Aust.
content of rice grain toward improving micronutrient bioavailability. J. Agric. Res. 42:485–496. doi:10.1071/AR9910485
Rice (N. Y.) 11(1):4. doi:10.1186/s12284-018-0200-y Toyota, K., M. Tamura, T. Ohdan, and Y. Nakamura. 2006. Expression profil-
Ponnu, J., V. Wahl, and M. Schmid. 2011. Trehalose-6-phosphate: Connecting ing of starch metabolism-related plastidic translocator genes in rice.
plant metabolism and development. Front. Plant Sci. 2:70. doi:10.3389/ Planta 223:248–257. doi:10.1007/s00425-005-0128-5
fpls.2011.00070 Turner, S.D. 2014. qqman: An R package for visualizing GWAS results using
Raj, A., M. Stephens, and J.K. Pritchard. 2014. fastSTRUCTURE : Variational Q-Q and Manhattan plots. bioRxiv 5165. doi:10.1101/005165
inference of population structure in large SNP data sets. Genetics Udomchalothorn, T., S. Maneeprasobsuk, E. Bangyeekhun, P. Boon-Long,
197:573–589. doi:10.1534/genetics.114.164350 and S. Chadchawan. 2009. The role of the bifunctional enzyme,
Reinders, A., Y. Sun, K.L. Karvonen, and J.M. Ward. 2012. Identification of fructose-6-phosphate-2-kinase/fructose-2,6-bisphosphatase, in carbon
amino acids important for substrate specificity in sucrose transporters partitioning during salt stress and salt tolerance in rice (Oryza sativa
using gene shuffling. J. Biol. Chem. 287:30296–30304. doi:10.1074/jbc. L.). Plant Sci. 176:334–341. doi:10.1016/j.plantsci.2008.11.009
M112.372888 Umemoto, T., and N. Aoki. 2005. Single-nucleotide polymorphisms in rice
Ruduś, I., M. Sasiak, and J. Kepczyński. 2013. Regulation of ethylene biosyn- starch synthase IIa that alter starch gelatinisation and starch association
thesis at the level of 1-aminocyclopropane-1-carboxylate oxidase (ACO) of the enzyme. Funct. Plant Biol. 32:763–768. doi:10.1071/FP04214
gene. Acta Physiol. Plant. 35:295–307. doi:10.1007/s11738-012-1096-6 Wan, X.Y., J.M. Wan, J.F. Weng, L. Jiang, J.C. Bi, C.M. Wang, et al. 2005.
Sakai, H., S.S. Lee, T. Tanaka, H. Numa, J. Kim, Y. Kawahara, et al. 2013. Rice Stability of QTLs for rice grain dimension and endosperm chalki-
annotation project database (RAP-DB): An integrative and interactive ness characteristics across eight environments. Theor. Appl. Genet.
database for rice genomics. Plant Cell Physiol. 54:e6–e6. doi:10.1093/ 110:1334–1346. doi:10.1007/s00122-005-1976-x
pcp/pcs183 Wang, H., X. Xu, F.G. Vieira, Y. Xiao, Z. Li, J. Wang, et al. 2016. The power
Sato, H., Y. Suzuki, M. Sakai, and T. Imbe. 2002. Molecular characterization of inbreeding: NGS-based GWAS of rice reveals convergent evolu-
of Wx-mq, a novel mutant gene for low-amylose content in endosperm tion during rice domestication. Mol. Plant 9:975–985. doi:10.1016/j.
of rice (Oryza sativa L.). Breed. Sci. 52:131–135. doi:10.1270/jsbbs.52.131 molp.2016.04.018
Schläppi, M.R., A.K. Jackson, G.C. Eizenga, A. Wang, C. Chu, Y. Shi, et al. Wang, X., Y. Pang, C. Wang, K. Chen, Y. Zhu, C. Shen, et al. 2017. New candi-
2017. Assessment of five chilling tolerance traits and GWAS mapping date genes affecting rice grain appearance and milling quality detected
in rice using the USDA mini-core collection. Front. Plant Sci. 8:957. by genome-wide and gene-based association analyses. Front. Plant Sci.
doi:10.3389/fpls.2017.00957 7:1998. doi:10.3389/fpls.2016.01998
Schluepmann, H., A. van Dijken, M. Aghdasi, B. Wobbes, M. Paul, and S. Wingler, A., T. Fritzius, A. Wiemken, T. Boller, and R.A. Aeschbacher. 2000.
Smeekens. 2004. Trehalose mediated growth inhibition of Arabidopsis Trehalose induces the ADP-glucose pyrophosphorylase gene, ApL3, and
seedlings is due to trehalose-6-phosphate accumulation. Plant Physiol. starch synthesis in Arabidopsis. Plant Physiol. 124:105–114. doi:10.1104/
135:879–890. doi:10.1104/pp.104.039503 pp.124.1.105
Shi, W., R. Muthurajan, H. Rahman, J. Selvam, S. Peng, Y. Zou, et al. 2013. Yan, B., N. Tondi Yacouba, J. Chen, Y. Wang, G. Gao, Q. Zhang, et al. 2014a.
Source–sink dynamics and proteomic reprogramming under elevated Analysis of minor quantitative trait loci for eating and cooking quality
night temperature and their impact on rice yield and grain quality. New traits in rice using a recombinant inbred line population derived from
Phytol. 197:825–837. doi:10.1111/nph.12088 two indica cultivars with similar amylose content. Mol. Breed. 34:2151–
Shomura, A., T. Izawa, K. Ebana, T. Ebitani, H. Kanegae, S. Konishi, et al. 2163. doi:10.1007/s11032-014-0170-8
2008. Deletion in a gene associated with grain size increased yields dur- Yan, W., A. Jackson, M. Jia, W. Zhou, H. Xiong, and R. Bryant. 2014b. Asso-
ing rice domestication. Nat. Genet. 40:1023–1028. doi:10.1038/ng.169 ciation mapping of four important traits using the USDA rice mini-core
Solovieff, N., C. Cotsapas, P.H. Lee, S.M. Purcell, and J.W. Smoller. 2013. collection. In: J. Bao, editor, Rice—Germplasm, Genetics and Improve-
Pleiotropy in complex traits: challenges and strategies. Nat. Rev. Genet. ment. IntechOpen, London, UK. p. 105–142.
14(7): 483–495. doi:10.1038/nrg3461 Yan, W., J.N. Rutger, R.J. Bryant, H.E. Bockelman, R.G. Fjellstrom, M.-H.
Song, X.-J., W. Huang, M. Shi, M.-Z. Zhu, and H.-X. Lin. 2007. A QTL for rice Chen, et al. 2007. Development and evaluation of a core subset of the
grain width and weight encodes a previously unknown RING-type E3 USDA rice germplasm collection. Crop Sci. 47:869–878. doi:10.2135/
ubiquitin ligase. Nat. Genet. 39:623–630. doi:10.1038/ng2014 cropsci2006.07.0444
Spielmeyer, W., M.H. Ellis, and P.M. Chandler. 2002. Semidwarf (sd-1), “green Yan, W.G., Y. Li, H.A. Agrama, D. Luo, F. Gao, X. Lu, et al. 2009. Association
revolution” rice, contains a defective gibberellin 20-oxidase gene. Proc. mapping of stigma and spikelet characteristics in rice (Oryza sativa L.).
Natl. Acad. Sci. USA 99:9043–9048. doi:10.1073/pnas.132266399 Mol. Breed. 24:277–292. doi:10.1007/s11032-009-9290-y

Yan, W.G., J.N. Rutger, H.E. Bockelman, and T.H. Tai. 2003a. Development Yano, M., Y. Katayose, M. Ashikari, U. Yamanouchi, L. Monna, T. Fuse, et
of a core collection from the USDA rice germplasm collection. In: R.J. al. 2000. Hd1, a major photoperiod sensitivity quantitative trait locus
Norman, J.-F. Meullenet, and K.A.K. Moldenhauer, editors, B.R. Wells in rice, is closely related to the Arabidopsis flowering time gene CON-
Rice Research Studies 2003. Arkansas Agric. Exp. Sta. Res. Ser. 517. STANS. Plant Cell 12(12):2473–2484. doi:10.1105/tpc.12.12.2473
Univ. of Arkansas System, Division of Agriculture, Cooperative Exten- Yoshida, K., T. Fujiwara, and S. Naito. 2002. The synergistic effects of sugar
sion Service, Little Rock, AR. p.88–96. and abscisic acid on myo-inositol-1-phosphate synthase expression.
Yan, W.G., J.N. Rutger, H.E. Bockelman, and T.H. Tai. 2003b. Germplasm Physiol. Plant. 114:581–587. doi:10.1034/j.1399-3054.2002.1140411.x
accessions resistant to straighthead in the USDA rice core collection. Zhang, Z., E. Ersoz, C.-Q. Lai, R.J. Todhunter, H.K. Tiwari, M.A. Gore, et al.
In: R.J. Norman, , and K.A.K. Moldenhauer, editors, B.R. Wells Rice 2010. Mixed linear model approach adapted for genome-wide associa-
Research Studies 2003. Arkansas Agric. Exp. Sta. Res. Ser. 517. Univ. of tion studies. Nat. Genet. 42:355–360. doi:10.1038/ng.546
Arkansas System, Division of Agriculture, Cooperative Extension Ser- Zhao, K., C.-W. Tung, G.C. Eizenga, M.H. Wright, M.L. Ali, A.H. Price,
vice, Little Rock, AR. p. 97–102. et al. 2011. Genome-wide association mapping reveals a rich genetic
Yan, W.G., J.N. Rutger, H.E. Bockelman, and T.H. Tai. 2005a. Agronomic architecture of complex traits in Oryza sativa. Nat. Commun. 2:467.
evaluation and seed stock establishment of the USDA rice core col- doi:10.1038/ncomms1467
lection. In: B.R. Wells, R.J. Norman, and J.-F. Meullenet, editors, B.R. Zhao, X., V.D. Daygon, K.L. McNally, R.S. Hamilton, F. Xie, R.F. Reinke, et al.
Wells Rice Research Studies 2005. Arkansas Agric. Exp. Sta. Res. Ser. 2016. Identification of stable QTLs causing chalk in rice grains in nine
529. . Univ. of Arkansas System, Division of Agriculture, Cooperative environments. Theor. Appl. Genet. 129:141–153. doi:10.1007/s00122-
Extension Service, Little Rock, AR. p. 63–68. 015-2616-8
Yan, W.G., J.N. Rutger, H.E. Bockelman, and T.H. Tai. 2005b. Evaluation of Zhou, L.J., W.T. Sheng, J. Wu, C.Q. Zhang, Q.Q. Liu, and Q.Y. Deng. 2015.
kernel characteristics of the USDA rice core collection. In: B.R. Wells, Differential expressions among five Waxy alleles and their effects on the
R.J. Norman, and J.-F. Meullenet, editors, B.R. Wells Rice Research eating and cooking qualities in specialty rice cultivars. J. Integr. Agric.
Studies 2005. Arkansas Agric. Exp. Sta. Res. Ser. 529. Univ. of Arkansas 14:1153–1162. doi:10.1016/S2095-3119(14)60850-9
System, Division of Agriculture, Cooperative Extension Service, Little Zhu, C., M. Gore, E.S. Buckler, and J. Yu. 2008. Status and prospects of asso-
Rock, AR. p.69–74. ciation mapping in plants. Plant Genome 1:5–20. doi:10.3835/plantgen-
Yan, W.H., P. Wang, H.X. Chen, H.J. Zhou, Q.P. Li, C.R. Wang, et al. 2011. ome2008.02.0089
A major QTL, Ghd8, plays pleiotropic roles in regulating grain pro-
ductivity, plant height, and heading date in rice. Mol. Plant 4:319–330.
doi:10.1093/mp/ssq070

Plantgenome2017 09 0085 PDF

Uploaded by

Copyright:

Available Formats

Plantgenome2017 09 0085 PDF

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Plantgenome2017 09 0085 PDF

Uploaded by

Copyright:

Available Formats

Published November 15, 2018

The Plant Genome o r i g i n a l r es e a rc h

Association Analysis of Three Diverse Rice

2 of 21 the pl ant genome  vol . 12, no . 1  m arch 2019

4 of 21 the pl ant genome  vol . 12, no . 1  m arch 2019

6 of 21 the pl ant genome  vol . 12, no . 1  m arch 2019

8 of 21 the pl ant genome  vol . 12, no . 1  m arch 2019

10 of 21 the pl ant genome  vol . 12, no . 1  m arch 2019

12 of 21 the pl ant genome  vol . 12, no . 1  m arch 2019

14 of 21 the pl ant genome  vol . 12, no . 1  m arch 2019

16 of 21 the pl ant genome  vol . 12, no . 1  m arch 2019

18 of 21 the pl ant genome  vol . 12, no . 1  m arch 2019

20 of 21 the pl ant genome  vol . 12, no . 1  m arch 2019

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.