Skip Navigation

Integrative and Comparative Biology 2004 44(5):358-365; doi:10.1093/icb/44.5.358
© 2004 by The Society for Integrative and Comparative Biology
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (5)
Right arrow Request Permissions
Google Scholar
Right arrow Articles by Stellwag, E. J.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?


Are Genome Evolution, Organism Complexity and Species Diversity Linked?1

Edmund J. Stellwag2,1
1 Department of Biology, East Carolina University, Greenville, North Carolina 27858-4353


    SYNOPSIS
 TOP
 SYNOPSIS
 INTRODUCTION
 References
 
Fishes represent an extremely diverse group of vertebrates with a deeply rooted evolutionary history. An understanding of their biology is being enriched by advancements in phylogenetic analysis and genomics, which are providing the framework for deciphering their evolutionary relationships and the molecular details that govern their evolution. Recent discoveries about the structure and function of fish genomes suggest the occurrence of large-scale genome level duplications within the stem lineage of the Actinopterygii (ray-finned fishes). However, little is understood about the effects, if any, of this event in relation to organismal complexity or species diversity. In this manuscript, I propose a hypothesis to test whether there is a likely relationship linking vertebrate genomes, organisms and species diversity. In so doing, I discuss the problems inherent in defining the complexity of genomes and organisms and provide simplifying assumptions that enable a preliminary test of the hypothesis. Results of this test suggest the likelihood of linkage between large-scale genome changes and organismal complexity early in vertebrate evolution but not in the evolution of the ray-finned fishes. A particularly interesting implication of the results is that there may be a limit to the effects of genome level duplications on organismal complexity and species diversity.


    INTRODUCTION
 TOP
 SYNOPSIS
 INTRODUCTION
 References
 
An examination of the patterns and processes involved in the evolution of the actinopterygians (ray-finned fishes) requires an analysis of the genomic evolution accompanying biological change in this vastly divergent and highly successful vertebrate group. Of particular interest in this regard, is the influence of genome evolution on organism complexity and speciation. It has become commonplace to read that speciation in ray-finned fishes is unmatched among descendants of the other major vertebrate lineage, the sarcopterygians (Amores et al., 1998Go; Meyer and Schartl, 1999Go; Taylor et al., 2001Go). Provided that the actinopterygians did indeed speciate more extensively than the sarcopterygians, it seems reasonable to ask if there are discernable features related to the evolution of fish genomes that intimate the underlying nature of the molecular processes driving this divergence?

In 1998, Amores and colleagues (1998)Go published a manuscript describing the Hox cluster organization of the zebrafish (Danio rerio) in which they demonstrated unequivocally, that this cypriniform fish possesses an outsized number of Hox clusters relative to mammals. In their manuscript, they argued "that teleosts, the most species-rich group of vertebrates, appear to have more copies of these developmental regulatory genes than do mammals, despite less complexity in the anterior-posterior axis." Their observation that Hox cluster complexity in teleosts appears to be independent of organismal complexity contradicts the consensus that chordate Hox cluster, genome, and organismal complexity increase in parallel (Holland, 1999Go). As an explanation for this apparent disjunction, they hypothesized that the increased Hox gene complement of ray-finned fishes likely contributed in some fashion to species diversification but presumably not organismal complexity. The implication being that the duplication propagated species diversification rather than organismal innovation. Based on the presumption that Hox cluster duplications were indicative of a total genome duplication event unique to early actinopterygian evolution, Meyer and Schartl (1999)Go broadened Amores's hypothesis by proposing that genome duplication not just duplication of Hox clusters drove species diversification in actinopterygians (Fig. 1). While these hypotheses are appealing to those who advocate genome duplications as a driving force in the evolution of organismal novelty, it leaves open the question of why a duplication within the stem lineage of the Actinopteryii, unlike similar deeply rooted vertebrate genome duplications, may have fueled species diversification in the apparent absence of a concordant increase in organism level complexity? A related but more fundamental question is whether large-scale genetic duplications consistently presage or parallel advancements in organism complexity, whether they fuel diversification, or if they generally do both or neither?



View larger version (19K):
[in this window]
[in a new window]
 
FIG. 1. Vertebrate phylogeny showing putative genome duplication events. (Modified from Prince, 2002Go)

 
Although genome duplications are less common among vertebrates than within other major taxonomic groups, most notably the plants, there is a growing body of evidence to support that genome level or at least large-scale duplications have occurred repeatedly in vertebrate evolutionary history (Gibson and Spring, 2000Go; Taylor et al., 2001Go; Furlong and Holland, 2002Go; Gu et al., 2002Go; McLysaght et al., 2002Go). Interestingly, these large-scale vertebrate duplications, roughly defined as those occurring prior to the divergence of the actinopterygian and sarcopterygian lineages (Fig. 1), seem to correlate well with accepted notions of increasing organism complexity, and arguably occur concurrently with the extensive species diversification characteristic of vertebrates (Durrand, 2003Go). This correlation of large-scale genetic duplication, organism complexity and species diversification is indicative of a coordinated relationship linking vertebrate genomes, organisms and their capacity for diversification. If such a correlation exists, it might be expected that large-scale genetic shifts, particularly those that trace back to ancient polyploidizations would have similar effects. However, this seems not to be the case. Genome duplications that have occurred more recently than those that marked the inception of the vertebrates, e.g., within the stem lineage of the ray-finned fishes as well as within more restricted teleost lineages, seem relatively unremarkable in relation to their apparent influence on organismal innovation.

If there is a discrepancy in the relationship between large scale genome shifts, organism complexity and species diversification at different times in vertebrate evolution what might be the reason for the differences? A possible explanation is a differential effect in which the duplication events that occurred earlier at a time when vertebrates were less complex promoted innovation and diversification whereas later events that occurred when overall organismal complexity was greater were ineffectual or extremely limited in effect. This explanation is valid under the premise that evolutionary innovation and species diversification may have inherent limits, which would generate an apparent time-dependent plateau in innovation and diversification.

The earliest genome duplication events are hypothesized to have occurred at the inception of the vertebrates as they evolved from their chordate ancestors (Holland et al., 1994Go; Sidow, 1996Go). This was a time when vertebrates were arguably less complex than their descendants. More recent duplications, particularly the duplications hypothesized to have occurred early in the actinopterygian lineage and again in the salmonid and catostomid lineages may have failed to usher in a similar period of developmental and morphological innovation (Taylor et al., 2003Go; Bailey et al., 1978Go).

An alternative explanation is that these temporally distinct genome shifts are equally effective but that insufficient time has elapsed since the occurrence of the more recent duplications, which date between 320– 450 mya for actinopterygians (Kumar and Hedges, 1998Go; Chiu et al., 2004; Vandepoele et al., 2004Go) or 40–70 mya for salmonids (Hartley, 1987Go), and the expression of their "evolutionary potential." Initial estimates indicate that the linkage between the earliest vertebrate duplications and advancements in organismal innovation is relatively tight, i.e., within tens of millions of years (McLysaght et al., 2002Go). This estimate correlates well with the measured 4 million year half-life of duplicated genes (Lynch and Conery, 2000Go), an estimate that suggests functional differentiation leading selection to maintenance of duplicated genes resulting from genome level duplications must occur within tens of millions of years rather than hundreds of millions of years. If correct these results suggest that the 320–450 million years since the proposed actinopterygian duplication is sufficient time to observe an effect on organismal innovation and species diversification or for the duplicated genes to vanish into oblivion. Yet another scenario is that large-scale vertebrate duplications are unlinked to organismal innovation or species diversification and that the observed differences are stochastic and ineffectual. This explanation requires that some genetic or epigenetic system other than gene duplication is responsible for fueling the observed innovations and species diversification. Unlike invertebrates, the general gene expression systems of vertebrates appear to be shared widely, including a vertebrate-specific global genome methylation system, indicative that in vertebrates, at least, epigenetic systems suitable to account for vertebrate innovations and diversification has not yet been discovered (Bird, 1995Go).

In the following discussion, I present a hypothesis that seeks to provide an explanation for the seemingly incongruent relationships linking genome duplication events, advancement in organismal complexity and the degree of species diversification in vertebrate evolution. In so doing, I briefly examine current perceptions and problems related to understanding genome and organismal complexity, and provide a simplifying framework for addressing the effects of genome duplication on genetic and organismal complexity that enable a test of the hypothesis.

The genetic value optimum: a possible link between genome and organismal evolution
Despite considerable recent interest in the effects of large-scale genetic duplication on organismal evolution, the current debate is focused almost exclusively on resolution of the 2R hypothesis, which proposes that two rounds of genome duplication occurred early in vertebrate history (Sidow, 1996Go; Hughes, 1999Go; Furlong and Holland, 2002Go). This debate, while useful in attempting to shed light on the evolutionary history of the vertebrate genome, belies a growing consensus, even among critics of the 2R hypothesis, that large-scale genome expansion occurred in vertebrate evolutionary history and that it likely played a central role in vertebrate evolution (Wolfe, 2001Go). Provided the agreement that large scale genome expansion is a hallmark of vertebrate evolutionary history and that it likely influenced organismal evolution, it seems useful to create a theoretical framework for examining the interrelationships among genome expansion, organismal evolution and species diversification. In an attempt to formulate a theoretical basis for exploring these linkages, I propose the genetic-organism value optimum (GOVO) hypothesis. The primary purpose of this hypothesis is to provide a formal construct for testing the relationship between genome changes, organismal innovation and species diversification.

The GOVO hypothesis posits that there is an intrinsic relationship between genetic value (herein defined as the total functional genome information content) and organismal complexity (herein defined as the sum of the developmental, morphological, and functional complexity of organisms) wherein increases or decreases in either occur in parallel. While this hypothesis is simply a formalization of statements that have been articulated previously in an informal way, it provides a testable framework for evaluation of the linkage between genes, genomes and organisms. There are two major predictions of the GOVO hypothesis: the first is that abrupt changes in genetic value will be matched by similar saltational changes in organismal complexity, and the second is that genetic value should be relatively constant within lineages that demonstrate stable levels of organismal complexity. These predictions of the GOVO hypothesis are most likely to be manifested within the context of large-scale genetic duplications or during extended periods in which organismal complexity within a lineage exhibits stability. A corollary prediction is that abrupt increases in genetic value resulting from genome level duplications that are not paralleled by matching changes in organismal complexity should be followed by a period of declining genetic value culminating in a return of the genetic value to the preduplication level. This prediction infers that large-scale duplications may be inconsequential in the absence of concurrent organismal innovation or species diversification, a likelihood that is only rarely mentioned and not explicitly predicted by existing genome duplication theories (Wolfe, 2001Go). This line of reasoning also runs contrary to the common presumption that large-scale genetic duplications characteristically parallel a stage of major organismal innovation or accelerated species diversification (Ohno, 1970Go; Bird, 1995Go; Holland, 1999Go).

Based on these predictions, explicit tests of the GOVO hypothesis include determining what happens to genetic value after large-scale genetic duplications both in the presence and absence of concordant increases in organismal complexity. Since testing the GOVO hypothesis requires understanding as well as measuring genetic value and organismal complexity, it is important to discuss contemporary understanding of these issues to define an appropriate system for testing the hypothesis.

Genetic value and organismal complexity
Identification of a system for testing the GOVO hypothesis, necessitates an appropriate approach for measuring, or at least addressing, genetic value and organismal complexity. The scientific literature abounds with an ever more protracted debate about the most acceptable means to measure genetic (genomic) and organismal complexity (Hahn and Wray, 2002Go; Adami, 2002Go). However, even the most ardent critics of a relationship between genome duplication and organismal complexity acknowledge that the historical record of evolution provides evidence of a positive correlation between genetic value and organismal complexity (Hughes et al., 2001Go). The difficulties in refining this relationship then seem rooted largely in reaching a consensus about defining and accurately quantifying genetic value and organismal complexity.

Efforts to define and quantify genetic value have explored related concepts including the historically most prevalent but clearly inadequate metric, the C-value (a measure of genome length). Use of the C-value as a means to estimate functional genetic complexity, i.e., the genetic component responsible for organismal determination, has been cast aside because it has proven impossible to link the vast tracts of the eukaryotic genome composed of simple repetitive elements to organismal function (Zuckerkandl, 2002Go). More recently, the G-value (the total number of genes in the genome) and the I-value (the number of expressed gene products or even the information content of the genome) have been proposed as more adequate measures of genetic value (Betran and Long, 2002Go; Weinberger, 2002Go). The G-value has been criticized as a measure of genetic value in that it fails to take into account the plethora of post-transcriptional and post-translational processes through which genes generate products needed in constructing organisms (Hahn and Wray, 2002Go). The problem of the g-value might therefore be equated with the inadequacy of existing definitions for the gene. Similarly, the I-value, while conceptually encompassing the expressivity of the genome, is so inclusive and mechanistically ill-defined as to render the concept of little practical utility as a standard of measurement. At issue are problems tied to the vagaries stemming from difficulties in constructing a genotype-phenotype network. A reasonable short-term solution, as is argued in the next section, may be to estimate genetic value within a morphologically and genomically diverse clade in which the systems responsible for gene expression are similar such that the correspondence between genotype and phenotype among the taxa included in the analysis is stable.

A similar dire collection of problems plague estimates of organismal complexity, including how to account for the complexities of intercellular relationships during development, all aspects of organismal function, and hierarchical interactions bridging the entirety of organismal morphospace (Foote, 1993Go; Kauffman, 1993Go; McShea, 1996Go; McShea, 2000Go; McShea, 2001Go)

Rather than try to address the myriad substantive but widely debated issues related to measurements of genetic and organismal complexity, I will define a highly simplified system within which it seems feasible to conduct a test of the GOVO hypothesis.

Defining a system to test the GOVO hypothesis
As mentioned, if approached from its broadest dimensions, the measurement of genetic value across living taxa is extremely daunting as it requires comparison of organisms that have evolved vastly different mechanisms for expressing the information content of their genomes. As an example, the nematode, Caenorhabditis elegans, with about a thousand total cells and a very simple tube-like body plan has an estimated 19,000 genes, whereas the fruit fly, Drosophila melanogaster, with an arguably more complex body structure, including a distinctive head, appendages, and a relatively complex brain compared to C. elegans has only 14,000 genes (The C. elegans Sequencing Consortium, 1998Go; Adams et al., 2000Go) Despite the apparent discrepancy in gene content, some of which can be accounted for as a result of tandem duplications in C. elegans, D. melanogaster apparently expands the functional utility of its "limited" gene complement by extensive alternative splicing in a fashion unavailable to C. elegans. A more detailed exposition of the problems associated with calculating and comparing the genetic complexity of divergent taxa is provided in several provocative reviews (Szarthmáry et al., 2001Go; Hahn and Wray, 2002Go)

A necessary step toward simplification of the genetic complexity problem would be to include only those groups in which gene expression mechanisms are the same or very similar, i.e., in which counting genes or transcripts yields generally equivalent information about genetic complexity in relation to gene number. This approach requires the group to be sufficiently well characterized genetically to justify conclusions regarding the overall equivalence of its genetic expression systems in a background of extensive organismal and genomic level variation. Based on the evidence available from comparative genomics, it is clear some comparisons may be more appropriate than others. In the case of C. elegans and D. melanogaster the differences in structure of "genes" and the resultant generation of splicing products from a single "gene" between the coelomate arthropods and pseudocoelomate nematodes preclude estimates of genetic value based on counting genes. However, comparative genomics suggests that chordates, show a considerable similarity in general mechanisms for gene expression and its control (Coulson and Ouzounis, 2003Go). An added advantage of this group is the availability of complete genomic sequences for three very divergent members of this group, the human (Homo sapiens), Japanese pufferfish (Takifugu rubripes) and sea squirt (Ciona intestinalis), and a growing body of sequence information for other key taxa including amphioxus (Branchiostoma), sea lamprey (Petromyzon marinus), horned shark (Heterodontus francisi), African clawed frog (Xenopus laevis), zebrafish (Danio rerio) and other mammals and fish species. Given the similarity of gene expression systems among the chordates and a rapidly increasing body of genomic information, there is reason to believe that comparisons of gene numbers among individual taxa could provide meaningful information about their genetic complexity relative to one another.

Although it is tempting to consider deriving estimates of genetic complexity from large-scale genome comparisons, this approach is not yet practical because gene annotation is incomplete even for the most advanced systems (Heilig et al., 2003Go) However, large-scale genome comparisons have revealed the existence of considerable synteny and the occurrence of extensive paralogons (paralogous chromosome regions) among evolutionarily divergent taxa within the chordates (McLysaght et al., 2002Go). The occurrence of paralogons in individual members of evolutionarily divergent taxa provides a potential resource for measuring changes in genetic value in relation to large-scale genome duplication events without the complications of annotation that presently plague genome level comparisons. Since paralogons are composed of collections of related genes at different chromosomal locations that arise through the process of gene duplication, it follows that repeated genome duplications create paralogons with an exponentially increasing number of genes. In the absence of gene loss, simple counting of genes within a paralogon can provide a measure of the extent of duplication. Under the assumption that gene numbers relate to genetic value, at least in the cases where gene expression systems are equivalent, counting the number of genes within a paralogon can provide an estimate of genetic value that is intrinsically linked to the evolutionary ancestry of the organisms in which duplications have occurred. While quantifying genetic value through comparison of the number of genes in paralogons does not provide absolute genetic values, it does provide relative genetic values. By counting and comparing the number of expressed genes in paralogons from organisms that represent lineages with divergent genome duplication histories and organismal complexities, it is feasible to assess the relationship between genetic value and organismal complexity and to conduct a test of the GOVO hypothesis.

Measurements of organismal complexity in relation to the genome and evolution are confounded by a host of factors, including definitions of complexity, agreement regarding the organismal constituents that contribute to complexity, as well as the appropriate hierarchical organizational levels to consider (Gould, 1996Go).

In light of the unresolved nature of these issues, it seems useful to consider certain simplifying assumptions about organismal complexity as a means to identify appropriate qualities to enable a test of the GOVO hypothesis. The most extreme simplification is to consider only the most drastic changes in organismal complexity as relevant landmarks for comparisons. Among multicellular eukaryotes such changes might include the origin of new embryological tissue layers, i.e., the emergence of the diploblastic, triploblastic and neural crest tissue layers and the structures derived from them, whereas within vertebrates it might include the evolution of a complex brain from a simplified cranial ganglion, an endoskeleton, paired appendages, and neurogenic placodes (Shimeld and Holland, 2000Go). In short, simplification of the complexity issue may best be addressed by adjusting the scale of the complexity measure to encompass only the most deeply rooted or profound innovations, provided there is information on their directionality; which is to say, whether they represent additions to existing structures, rearrangements, or reductive simplifications. By limiting measures of complexity to the most extreme examples, it is relatively straightforward to map the complexity changes in terms of directionality. Also, since the GOVO hypothesis predicts that profound changes in organismal complexity should under most circumstances occur in parallel with similar changes in genetic value, limiting observations to complexity changes characterized by extreme variation provides a favorable background for assessment of the relationship between genetic value and organism complexity.

An initial test of the GOVO hypothesis
The most extensively characterized paralogon within the chordates includes the genes that comprise the Hox gene family (McLysaght et al., 2002Go). An evolutionarily ancient gene family that encodes transcription factors required for the specification of tissue and cellular identity during embryogenesis, the clustered Hox genes represent a particularly compelling assemblage of genes for use in testing the GOVO hypothesis. The fundamental properties of the Hox genes that recommend them for testing the GOVO hypothesis include their stereotypical clustered organization throughout the vertebrates (Shubert et al., 1993Go), their extensive characterization within members of the chordate lineage (Amores, 2004), including species that represent divergent lineages with different genome duplication histories (Prince, 2002Go), including one (Branchiostoma) assumed to be representative of the basal clade within the group (Garcia-Fernandez and Holland, 1994Go), and their central role in morphological specification of major body axes and structures including appendages. It is also worthwhile noting that the clustered Hox genes do not appear to have undergone intra-cluster tandem duplications, recombination or gene conversion, each of which can confound the analysis of gene duplication history.

The selection of the chordates as an appropriate taxonomic group in which to investigate the relationship between genetic value and organismal complexity has been detailed earlier. To summarize, chordates utilize similar gene expression systems, which increases the likelihood that counting genes in divergent lineages provides similar information about genetic value; they exhibit considerable evolutionary divergence and marked organismal innovation, indicative of large-scale shifts in organismal complexity and useful for the simplified analytical paradigm developed for an initial test of the GOVO hypothesis; their Hox cluster paralogons, which serve as a genome proxy, are extensively characterized and therefore provides suitable genomic information for comparative purposes.

An especially useful property of the Hox cluster paralogon, is the preservation of the order of genes within the paralogon even among the most widely divergent chordates. Although individual clusters differ from one another because of gene loss following cluster duplication, a comparison of clusters within and among species provides strong evidence to infer that the ancestral condition in chordates is a string of 14 tightly linked genes (Powers and Amemiya, 2004Go). The remarkable feature of this organization is that the inferred composition of the ancestral cluster and the actual composition of the cluster in the extant representative of the presumed prototypical vertebrate ancestor are for all intents and purposes identical. In the context of the GOVO hypothesis, this result is taken to indicate that Branchiostoma can serve as the GOVO genetic value index against which descendent lineages can be compared. Further support for the use of Branchiostoma as an index relates to its relative organismal simplicity in comparison to vertebrates. Despite considerable effort, it has been impossible to identify a tissue in Branchiostoma that corresponds to the neural crest of vertebrates (Escriva et al., 2002Go). Moreover, Branchiostoma is missing a host of structures considered to be vertebrate innovations, including an elaborate segmented brain, an endoskeleton, neurogenic placodes and all the structures derived from neural crest tissue (Shimeld and Holland, 2000Go). In this sense, Branchiostoma serves as an ideal index for chordate organismal complexity as it clearly lacks a collection of deeply rooted vertebrate innovations, the most profound of which is the absence of neural crest tissue.

Genetic value and organism complexity comparisons: a further test of the GOVO hypothesis
Under the assumption that the single Branchiostoma Hox cluster is representative of the chordate ancestor of vertebrates and that cluster duplication events occurred by repeated trans duplications (a proxy for genome duplications), it is possible to predict the expected gene value for the various descendent vertebrate lineages under the assumption that the full complement of genes are retained after each duplication event. Table 1 shows the results of a comparison wherein the predicted value for the basal gnathostome is derived from the number of genes resulting from two complete duplications of the 14 gene ancestral cluster, while the observed value is inferred from a composite of the Hox cluster genes documented in extant actinopterygians and sarcopterygians (Amores et al., 1998Go, 2004; Stellwag, 1999Go; Meyer and Málaga-Trillo, 1999Go). Since the sarcopterygian lineage appears to have a stable four clusters with no indication of any additional clusters, their predicted gene value is the same as that for basal gnathostomes, whereas the observed value of 39 is derived from extant mammals and based on exhaustive sequence searches. The predicted value of 112 for actinopterygians is derived from the number of genes that would have resulted from three complete duplications of the 14 gene ancestral cluster based on the observation of an additional complete set of duplicated clusters in extant members of this lineage relative to the stem lineage of gnathostomes (Prince, 2002Go). The observed value of 47–52 for actinopterygians is taken from the actual number of genes documented after an extensive characterization of the Hox gene complement of zebrafish, Japanese pufferfish and the Southern pufferfish, respectively (Amores et al., 1998Go, 2004).


View this table:
[in this window]
[in a new window]
 
TABLE 1. Estimates of gene value for selected vertebrate lineages.

 
Comparisons of the gene values among these taxa reveal some intriguing relationships. Firstly, the predicted and observed values for basal gnathostomes and mammals are very similar, differing by only three genes out of a predicted maximum of 52 genes. This result suggests that the mammals, and presumably other tetrapods, exhibit a relatively stable Hox gene value with respect to their gnathostome ancestor, with a nearly negligible net gene value loss within the vertebrate lineage leading from ancestral gnathostomes to mammals. While it would be presumptuous to assume that the net gene loss leading from ancestral gnathostomes to mammals is significant, this result underscores the evident stability of gene value extending from ancestral gnathostomes to mammals. An additional observation is that the sarcopterygian lineage has retained a substantial 70% of the predicted gene value relative to its chordate ancestor, which can be interpreted to mean that the gene value optima from chordates through gnathostomes to mammals is quite conserved and indicative of a relatively "efficient" utilization of the expressed genome. This interpretation is dependent on the assumption that organismal complexity remained relative constant between basal gnathostomes and the most complex tetrapods and that the major complexity changes occurred prior to the inception of gnathostomes. In support of this interpretation is the fact that the major vertebrate innovations including the evolution of neural crest, the beginnings of an elaborate segmented brain, an endoskeleton, and neurogenic placodes actually occurred between the inception of vertebrates and the basal gnathostomes (Shimeld and Holland, 2000Go). Accordingly, it is consistent to conclude that the level of organismal complexity exhibited by basal gnathostomes could be similar to that of the most advanced members of the sarcopterygian lineage. Alternatively, if the organismal complexity of the most advanced tetrapods is significantly greater than that of basal gnathostomes, it would invalidate the GOVO hypothesis, indicative of the absence of an identifiable linkage between gene value and organismal complexity.

What then of the comparisons between the gene values of the sarcopterygian and actinopterygian lineages? Assuming that a genome duplication event occurred in the actinopterygian stem lineage after the divergence of bichir (Chiu et al., 2004Go), the predicted number of Hox genes under maximal conditions should be 112 (Table 1). However, the observed number in zebrafish, Japanese pufferfish, the smooth pufferfish and likely in related teleosts, ranges between 48 and 52, which is less than one-half the predicted number and very nearly the same as the inferred value for basal gnathostomes. This result shows that the gene values for basal gnathostomes are similar to those for the sarcopterygian and actinopterygian lineages.

Calculation of the ratio of observed to predicted gene values for the various lineages (Table 1), provides a number that hovers around 0.7 for all the lineages with the exception of the actinopterygians, which dips below 0.5. This dip in the ratio of observed to predicted gene values is indicative of extensive gene loss in the actinopterygians relative to the sarcopterygians, but is based on the assumption of a sharp rise in gene value corresponding to large-scale genome expansion due to a genome level duplication in early actinopterygian evolution. Again, provided that organismal complexity of these two divergent lineages is similar, the GOVO hypothesis predicts an excessive loss of gene value post-duplication in actinopterygians relative to the sarcopterygian lineage in which a parallel genome duplication did not occur. While it is difficult to make definitive statements about differences in organismal complexity in comparisons between the actinopterygians and sarcopterygian lineages, it is certain that the degree of complexity difference is far less than that between ancestral chordates and basal gnathostomes and may even be negligible. If this interpretation is correct, there is reason to suspect that the GOVO hypothesis provides a useful perspective for understanding the relationship between genome and organismal complexity.

Gene value and species diversity
A central question posed at the outset was whether large-scale genome changes, such as occur during polyploidization, likely precede or parallel advancements in organism complexity, promote species diversification, or do both or neither? Based on the forgoing discussion, a case can be made that large-scale genome changes occur concurrently with advancements in organismal complexity but also in their absence. When the genome changes occur in the absence of parallel changes in organismal complexity it appears that gene value is lost concordantly. Is it possible that some of the genome "potential" resulting from genome duplications is devoted to species diversification, thereby leading to the equivalent of a distributed form of organismal complexity (Eble, 2003Go)? The answer to this question requires information about the rate and extent of speciation in divergent lineages with similar level of organismal complexity but different histories of genome expansion, a condition which it has been argued here exists between the actinopterygian and sarcopterygian lineages.

Based on estimates, the number of extant species derived from the sarcopterygian lineage is approximately 21,000–22,000 compared to about 23–25,000 for actinopterygians (Carroll, 1988Go; Nelson, 1994Go). When viewed from the perspective of total numbers, the differences appear rather small, suggesting that the genomic duplication within the stem lineage of actinopterygians would not be required to support such minor differences in species number. However, evidence presented in a recent review by Taylor et al. (2001)Go indicates that polyploidized members of the families Salmonidae and the Catostomidae, exhibit higher degrees of speciation than members of the same family that remain diploid (Nelson, 1994Go). As proposed by Lynch and colleagues (Lynch and Conery, 2000Go; Lynch and Force, 2000Go) this as an example of "divergent resolution," wherein the differential post-duplication loss of genes creates an effective mechanism for population isolation that promotes speciation. Their interpretation provides a mechanistic explanation consistent with the GOVO hypothesis, which predicts that gene value will decline following large-scale duplication events in the absence of concurrent advancement in organism complexity. While it is difficult to reconcile the different interpretations of the effects of genome level duplications in the example of the actinopteryian stem lineage duplication compared to the salmonid or catostomid duplications, it remains a distinct possibility that genome level duplications are linked to species diversification.

If we accept that large-scale genetic duplications are a hallmark of vertebrate evolution (Shimeld and Holland, 2000Go) and that vertebrates have a rich history of evolutionary innovation and speciation, it would be consistent to conclude that genome duplications provided a springboard to loft chordates into a trajectory leading to the evolution of novelty, increased organismal complexity, and extensive species diversification. However, it is worth reiterating that the GOVO hypothesis predicts that the evolutionary trajectory of novelty, complexity and diversity linked to genome duplication may be diminished or thwarted entirely by organismal constraints.


    FOOTNOTES
 
1 From the Symposium Patterns and Processes in the Evolution of Fishes presented at the Annual Meeting of the Society for Integrative and Comparative Biology, 4–8 January 2003, at Toronto, Canada. Back

2 E-mail: stellwage{at}mail.ecu.edu Back


    References
 TOP
 SYNOPSIS
 INTRODUCTION
 References
 
Adami, C. 2002. What is complexity? Bioessays, 24:1085-1094.[CrossRef][Web of Science][Medline]

Adams, M. D., S. E. Celnicker, R. A. Holt, C. A. Evans, and J. D. Gocayne.et al. 2000. The genome sequence of Drosophila melanogaster. Science, 287:2185-2195.[Abstract/Free Full Text]

Amores, A., A. Force, Y. Yan, L. Joly, C. T. Amemiya, A. Fritz, R. K. Ho, J. Langeland, V. Prince, Y-L Wang, M. Westerfield, M. Ekker, and J. H. Postlethwait. 1998. Zebrafish Hox clusters and vertebrate genome evolution. Science, 282:1711-1714.[Abstract/Free Full Text]

Amores, A., T. Suzuki, Y. L. Yan, I. Pomery, A. Singer, C. Amemiya, and J. H. Postlethwait. 2004. Developmental roles of pufferfish Hex clusters and genome evolution in ray-fin fish. Genome Res, 14:1-10.[Abstract/Free Full Text]

Bailey, G. S., R. T. Poulter, and P.A. Stockwell. 1978. Gene duplication in tetraploid fish: Model for gene silencing at unlinked duplicated loci. Proc. Natl. Acad. Sci. USA, 75:5575-5579.[Abstract/Free Full Text]

Betran, E., and M. Long. 2002. Expansion of genome coding regions by acquisition of new genes. Genetica, 115:65-80.[CrossRef][Web of Science][Medline]

Bird, A. P. 1995. Gene number, noise reduction, and biological complexity. Trends Genet, 11:94-100.[CrossRef][Web of Science][Medline]

Carroll, R. L. 1988. Vertebrate paleontology and evolution. W. H. Freeman and Company, New York.

Chiu, C. H., K. Dewar, G. P. Wagner, K. Takahashi, F. Ruddle, C. Ledje, P. Bartsch, J. L. Scemama, E. J. Stellwag, C. Fried, S. J. Prohaska, P. F. Stadler, and C. T. Amemiya. 2004. Bichir HoxA cluster sequence reveals surprising trends in ray-finned fish genomic evolution. Genome Res, 14:11-17.[Abstract/Free Full Text]

Coulson, R. M., and C. A. Ouzounis. 2003. The phylogenetic diversity of eukaryotic transcription. Nucl. Acids Res, 31:653-660.[Abstract/Free Full Text]

Durrand, D. 2003. Vertebrate evolution: Doubling and shuffling with a full deck. Trends Genet, 19:2-5.[CrossRef][Web of Science][Medline]

Eble, G. J. 2003. The macroevolution of phenotypic integration. In M. Pigliucci and K. Preston (eds.), The evolutionary biology of complex phenotypes, pp. 253–273. Oxford University Press, Oxford.

Escriva, H., N. D. Holland, H. Gronemeyer, V. Laudet, and L. Z. Holland. 2002. The retinoic acid signaling pathway regulates anterior/posterior patterning in the nerve cord and pharynx of amphioxus, a chordate lacking neural crest. Development, 129:2905-2916.

Ferrier, D. E., C. Minguillon, P.W. H. Holland, and J. Garcia-Fernanadez. 2000. The amphioxus Hox cluster: Deuterostome posterior flexibility and Hox 14. Evol. Dev, 2:284-293.[CrossRef][Web of Science][Medline]

Foote, M. 1993. Discordance and concordance between morphological and taxonomic diversity. Paleobiology, 19:185-204.[Abstract]

Furlong, R. F., and P. W. H. Holland. 2002. Were vertebrates octoploid? Philos. Trans. R. Soc. London B Biol. Sci, 357:531-544.[Abstract/Free Full Text]

Garcia-Fernandez, J., and P. W. H. Holland. 1994. Archetypal organization of the amphioxus Hox gene cluster. Nature, 370:563-566.[CrossRef][Medline]

Gibson, T., and J. Spring. 2000. Evidence in favor of ancient octapolidy in the vertebrate genome. Biochem. Soc. Trans, 28:259-264.[Medline]

Gould, S. J. 1996. Full house. Harmony Books, New York.

Gu, X., Y. Wang, and J. Gu. 2002. Age distribution of human gene families shows significant roles of both large- and small-scale duplication in vertebrate evolution. Nat. Genet, 31:205-209.[CrossRef][Web of Science][Medline]

Hahn, M. H., and G. Wray. 2002. The g-value paradox. Evol. Dev, 4:73-75.[CrossRef][Web of Science][Medline]

Hartley, S. E. 1987. The chromosomes of salmonid fishes. Biol. Rev, 62:197-214.

Heilig, R., R. Eckenberg, J. L. Petit, N. Fonknechten, C. Da Silva, and R. Cattollico.et al. 2003. The DNA sequence and analysis of human chromosome 14. Nature, 421:601-607.[CrossRef][Medline]

Holland, P. W. 1999. Gene duplication: Past, present, future. Semin. Cell and Dev. Biol, 10:541-547.

Holland, P. W. H., and J. Garcia-Fernandez. 1996. Hox genes and chordate evolution. Dev. Biol, 173:382-395.[CrossRef][Web of Science][Medline]

Holland, P. W. H., J. Garcia-Fernandez, J. Williams, and A. Sidow. 1994. Gene duplications and the origins of vertebrate development. Dev. Suppl. 125–133.

Hughes, A. L. 1999. Phylogenies of developmentally important proteins do not support the hypothesis of two rounds of genome duplication early in vertebrate history. J. Mol. Evol, 48:565-576.[CrossRef][Web of Science][Medline]

Hughes, A. L., J. daSilva, and R. Friedman. 2001. Ancient genome duplications did not structure the human Hox-bearing chromosomes. Genome Res, 11:771-780.[Abstract/Free Full Text]

Kauffman, S. A. 1993. The origins of order: Self-organization and selection in Evolution. Oxford University Press, Oxford.

Kumar, S., and B. Hedges. 1998. A molecular timescale for vertebrate evolution. Nature, 392:917-920.[CrossRef][Medline]

Lynch, M., and J. S. Conery. 2000. The evolutionary fate and consequences of duplicate genes. Science, 290:1151-1155.[Abstract/Free Full Text]

Lynch, M., and A. Force. 2000. The probability of duplicate gene preservation by subfunctionalization. Genetics, 154:459-473.[Abstract/Free Full Text]

McLysaght, A., K. Hokamp, and K. H. Wolfe. 2002. Extensive genomic duplication during early chordate evolution. Nat. Genet, 31:128-129.[Web of Science][Medline]

McShea, D. W. 1996. Metazoan complexity and evolution: Is there a trend? Evolution, 50:477-492.[CrossRef][Web of Science]

McShea, D. W. 2000. Functional complexity in organisms: Parts as proxies. Biol. Philos, 15:641-668.[CrossRef]

McShea, D. W. 2001. The hierarchical structure of organisms: A scale and documentation of a trend in the maximum. Paleobiology, 27:405-423.[Abstract/Free Full Text]

Meyer, A., and M. Schartl. 1999. Gene and genome duplication in vertebrates: The one-to-four (to-eight in fish) rule and the evolution of novel gene functions. Curr. Opin. Cell Biol, 11:699-704.[CrossRef][Web of Science][Medline]

Meyer, A., and E. Málaga-Trillo. 1999. Vertebrate genomics: More fishy tales about Hox genes. Current Biol, 9:R210-R213.[CrossRef][Web of Science][Medline]

Nelson, J. S. 1994. Fishes of the world. 3rd ed. John Wiley and Sons, New York.

Ohno, S. 1970. Evolution by gene duplication. Berlin-Heidelberg-New York. Springer-Verlag.

Powers, T. P., and C. T. Amemiya. 2004. Evidence for a Hox14 paralog group in vertebrates. Current Biol, 14:R183-184.[CrossRef][Web of Science][Medline]

Prince, V. 2002. The Hox paradox: More complex(es) than imagined. Dev. Biol, 249:1-15.[CrossRef][Web of Science][Medline]

Shimeld, S. M., and P. W. H. Holland. 2000. Vertebrate innovations. Proc. Natl. Acad. Sci. USA, 97:4449-4452.[Abstract/Free Full Text]

Shubert, F., K. Nieselt-Struwe, and P. Gruss. 1993. The Antenneapedia-type homeobox genes have evolved from three precursors early in metazoan evolution. Proc. Natl. Acad. Sci. U.S.A, 90:143-147.[Abstract/Free Full Text]

Sidow, A. 1996. Gen(om)e duplications in the evolution of early vertebrates. Curr. Opin. Genet. Dev, 6:715-722.[CrossRef][Web of Science][Medline]

Stellwag, E. J. 1999. Hox gene duplication in fish. Semin. Cell Dev. Biol, 10:531-540.[CrossRef][Web of Science][Medline]

Szathm'ry, E., F. Jordan, and C. Pal. 2001. Molecular biology and evolution. Can genes explain biological complexity? Science, 292:1315-1316.[Free Full Text]

Taylor, J. S., I. Braasch, T. Frickey, A. Meyer, and Y. Van de Peer. 2003. Genome duplication, a trait shared by 22000 species of ray-finned fish. Genome Res, 13:382-390.[Abstract/Free Full Text]

Taylor, J. S., Y. Ven de Peer, I. Braasch, and A. Meyer. 2001. Comparative genomics provides evidence for an ancient genome duplication event in fish. Philos. Trans. R. Soc. London B Biol. Sci, 356:1661-1679.[Abstract/Free Full Text]

The C. elegans Sequencing Consortium., 1998. Sequence and analysis of the genome of C. elegans. Science, 282:2012-2018.[Abstract/Free Full Text]

Vandepoele, K., W. de Vos, J. S. Taylor, A. Meyer, and Y. van de Peer. 2004. Major events in the genome evolution of vertebrates: Paranome age and size differ considerably between ray-finned fishes and land vertebrates. Proc. of the Natl. Acad. of Sci. U.S.A, 101:1638-1643.

Weinberger, E. D. 2002. A theory of pragmatic information and its application to the quasi-species model of biological evolution. Biosystems, 66:105-119.[Medline]

Wolfe, K. H. 2001. Yesterday's polyploids and the mystery of diploidization. Nat. Rev. Genet, 2:333-341.[CrossRef][Web of Science][Medline]

Zuckerkandl, E. 2002. Why so many noncoding nucleotides? The eukaryote genome as an epigenetic machine. Genetica, 115:105-129.[CrossRef][Web of Science][Medline]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?



This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (5)
Right arrow Request Permissions
Google Scholar
Right arrow Articles by Stellwag, E. J.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?