TY - JOUR
T1 - Identification, structure, and differential expression of members of a BURP domain containing protein family in soybean
AU - Granger, Cheryl
AU - Coryell, Virginia
AU - Khanna, Anupama
AU - Keim, Paul
AU - Vodkin, Lila
AU - Shoemaker, Randy C.
PY - 2002
Y1 - 2002
N2 - Expressed sequence tags (ESTs) exhibiting homology to a BURP domain containing gene family were identified from the Glycine max (L.) Merr. EST database. These ESTs were assembled into 16 contigs of variable sizes and lengths. Consistent with the structure of known BURP domain containing proteins, the translation products exhibit a modular structure consisting of a C-terminal BURP domain, an N-terminal signal sequence, and a variable internal region. The soybean family members exhibit 35-98% similarity in a ∼100-amino-acid C-terminal region, and a phylogenetic tree constructed using this region shows that some soybean family members group together in closely related pairs, triplets, and quartets, whereas others remain as singletons. The structure of these groups suggests that multiple gene duplication events occurred during the evolutionary history of this family. The depth and diversity of G. max EST libraries allowed tissue-specific expression patterns of the putative soybean BURPs to be examined. Consistent with known BURP proteins, the newly identified soybean BURPs have diverse expression patterns. Furthermore, putative paralogs can have both spatially and quantitatively distinct expression patterns. We discuss the functional and evolutionary implications of these findings, as well as the utility of EST-based analyses for identifying and characterizing gene families.
AB - Expressed sequence tags (ESTs) exhibiting homology to a BURP domain containing gene family were identified from the Glycine max (L.) Merr. EST database. These ESTs were assembled into 16 contigs of variable sizes and lengths. Consistent with the structure of known BURP domain containing proteins, the translation products exhibit a modular structure consisting of a C-terminal BURP domain, an N-terminal signal sequence, and a variable internal region. The soybean family members exhibit 35-98% similarity in a ∼100-amino-acid C-terminal region, and a phylogenetic tree constructed using this region shows that some soybean family members group together in closely related pairs, triplets, and quartets, whereas others remain as singletons. The structure of these groups suggests that multiple gene duplication events occurred during the evolutionary history of this family. The depth and diversity of G. max EST libraries allowed tissue-specific expression patterns of the putative soybean BURPs to be examined. Consistent with known BURP proteins, the newly identified soybean BURPs have diverse expression patterns. Furthermore, putative paralogs can have both spatially and quantitatively distinct expression patterns. We discuss the functional and evolutionary implications of these findings, as well as the utility of EST-based analyses for identifying and characterizing gene families.
KW - BURP domain
KW - Expressed sequence tag
KW - Gene duplication
KW - Glycine max
UR - http://www.scopus.com/inward/record.url?scp=0036435948&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0036435948&partnerID=8YFLogxK
U2 - 10.1139/g02-032
DO - 10.1139/g02-032
M3 - Article
C2 - 12175072
AN - SCOPUS:0036435948
SN - 0831-2796
VL - 45
SP - 693
EP - 701
JO - Genome / National Research Council Canada = Genome / Conseil national de recherches Canada
JF - Genome / National Research Council Canada = Genome / Conseil national de recherches Canada
IS - 4
ER -