The phylogenetic relationships and divergence times of 39 drosophilid species were studied by using the coding region of the Adh gene. Four genera--Scaptodrosophila, Zaprionus, Drosophila, and ...Scaptomyza (from Hawaii)--and three Drosophila subgenera--Drosophila, Engiscaptomyza, and Sophophora--were included. After conducting statistical analyses of the nucleotide sequences of the Adh, Adhr (Adh-related gene), and nuclear rRNA genes and a 905-bp segment of mitochondrial DNA, we used Scaptodrosophila as the outgroup. The phylogenetic tree obtained showed that the first major division of drosophilid species occurs between subgenus Sophophora (genus Drosophila) and the group including subgenera Drosophila and Engiscaptomyza plus the genera Zaprionus and Scaptomyza. Subgenus Sophophora is then divided into D. willistoni and the clade of D. obscura and D. melanogaster species groups. In the other major drosophilid group, Zaprionus first separates from the other species, and then D. immigrans leaves the remaining group of species. This remaining group then splits into the D. repleta group and the Hawaiian drosophilid cluster (Hawaiian Drosophila, Engiscaptomyza, and Scaptomyza). Engiscaptomyza and Scaptomyza are tightly clustered. Each of the D. repleta, D. obscura, and D. melanogaster groups is monophyletic. The splitting of subgenera Drosophila and Sophophora apparently occurred about 40 Mya, whereas the D. repleta group and the Hawaiian drosophilid cluster separated about 32 Mya. By contrast, the splitting of Engiscaptomyza and Scaptomyza occurred only about 11 Mya, suggesting that Scaptomyza experienced a rapid morphological evolution. The D. obscura and D. melanogaster groups apparently diverged about 25 Mya. Many of the D. repleta group species studied here have two functional Adh genes (Adh-1 and Adh-2), and these duplicated genes can be explained by two duplication events.
In recent years, copy number variation (CNV) of DNA segments has become a hot topic in the study of genetic variation, and a large amount of CNVs has been uncovered in human populations. The CNVs ...involving the smallest units of DNA segments are microsatellite DNAs, and the evolutionary change of microsatellite DNAs is believed to occur mostly by the increase or decrease of one repeat unit at a time in a more or less neutral fashion. If we note that eukaryotic genomes contain millions of microsatellite loci, this pattern of nucleotide change is expected to generate random changes of genome size, that is, genomic drift, and will provide a neutral model of CNV evolution. We therefore investigated the amount of variation of the total number of repeats (TNR) per individual concerned with 145 microsatellite loci in three human populations, Africans, Europeans, and Asians. It was shown that the TNR follows the normal distribution in all three populations and that the extent of variation of TNR is more than 50% greater in Africans than in Europeans and Asians as expected from the hypothesis of African origin of modern humans. If we consider all microsatellite loci in the human genome and compute the variation of the total number of nucleotides involved (TNN), it is possible to study the contribution of microsatellite loci to the genome size variation. This study has shown that the genome sizes of human individuals are affected considerably by genomic drift of microsatellite DNA alone. This pattern of evolution is similar to that of olfactory receptor (OR) genes previously studied in human populations and support the idea that the number of OR genes has evolved in a more or less neutral fashion. However, this conclusion does not necessarily apply to the genomewide CNVs of various DNA segments, and it appears that long variant DNA fragments are deleterious and under purifying selection.
Recently, a large amount of copy number variation (CNV) of DNA segments has been uncovered in human populations. The CNVs involving the smallest units of DNA segments are microsatellite DNAs, and the ...evolutionary change of microsatellite DNAs is believed to occur mostly by the increase or decrease of one repeat unit at a time in a more or less neutral fashion. If we note that eukaryotic genomes contain millions of microsatellite loci, this pattern of nucleotide change is expected to generate random changes of genome size, i.e., genomic drift, and will provide a neutral model of CNV evolution. We therefore investigated the amount of variation of the total number of repeats (TNR) per individual concerned with 145 microsatellite loci in three human populations, Africans, Europeans, and Asians. It was shown that the TNR follows the normal distribution in all three populations and that the extent of variation of TNR is more than 50% greater in Africans than in Europeans and Asians as expected from the hypothesis of African origin of modern humans. This study has also shown that the genome sizes of human individuals are affected considerably by genomic drift of microsatellite DNA alone.
The relative efficiencies of different protein-coding genes of the mitochondrial genome and different tree-building methods in recovering a known vertebrate phylogeny (two whale species, cow, rat, ...mouse, opossum, chicken, frog, and three bony fish species) was evaluated. The tree-building methods examined were the neighbor joining (NJ), minimum evolution (ME), maximum parsimony (MP), and maximum likelihood (ML), and both nucleotide sequences and deduced amino acid sequences were analyzed. Generally speaking, amino acid sequences were better than nucleotide sequences in obtaining the true tree (topology) or trees close to the true tree. However, when only first and second codon positions data were used, nucleotide sequences produced reasonably good trees. Among the 13 genes examined, Nd5 produced the true tree in all tree-building methods or algorithms for both amino acid and nucleotide sequence data. Genes Cytb and Nd4 also produced the correct tree in most tree-building algorithms when amino acid sequence data were used. By contrast, Co2, Nd1, and Nd41 showed a poor performance. In general, large genes produced better results, and when the entire set of genes was used, all tree-building methods generated the true tree. In each tree-building method, several distance measures or algorithms were used, but all these distance measures or algorithms produced essentially the same results. The ME method, in which many different topologies are examined, was no better than the NJ method, which generates a single final tree. Similarly, an ML method, in which many topologies are examined, was no better than the ML star decomposition algorithm that generates a single final tree. In ML the best substitution model chosen by using the Akaike information criterion produced no better results than simpler substitution models. These results question the utility of the currently used optimization principles in phylogenetic construction. Relatively simple methods such as the NJ and ML star decomposition algorithms seem to produce as good results as those obtained by more sophisticated methods. The efficiencies of the NJ, ME, MP, and ML methods in obtaining the correct tree were nearly the same when amino acid sequence data were used. The most important factor in constructing reliable phylogenetic trees seems to be the number of amino acids or nucleotides used.
Microsatellite DNA loci or short tandem repeats (STRs) are abundant in eukaryotes and are often used for constructing the phylogenetic trees of closely related populations or species. These ...phylogenetic trees are usually constructed by using some genetic distance measures based on allele frequency data, and there are many distance measures that have been proposed for this purpose. In the past the efficiencies of these distance measures in constructing phylogenetic trees have been studied mathematically or by computer simulations. Recently, however, 783 STR loci have been compiled from various human populations. We have therefore investigated the relative efficiencies of different distance measures such as DA, standard genetic distance Ds, FST*, FST/(1-FST) and (δμ)2, in constructing phylogenetic trees, as well as the effect on P1s of loci of different nucleotide repeat sizes, using these data.
According to a widely held view, the more than 300 species of haplochromine cichlid fishes in Lake Victoria (LV), East Africa, originated from a single founder species in less than 12 000 years. This ...view, however, does not follow from the published geological and molecular evidence. The former does indeed suggest that the LV basin dried out less than 15 000 years ago, but it does not provide any information about the species that re-colonized the new lake or that remained in the rivers draining the area. The molecular evidence is inconclusive with respect to the origin of the LV haplochromines because cichlids from critical regions around LV were not adequately sampled; and as far as the age of the LV haplochromines is concerned, it in fact led to an estimate of 250 000-750 000 years old. In the present study, mitochondrial DNA (control region) variation was determined by heteroduplex and sequencing analyses of more than 670 specimens collected at widely distributed East African riverine and lacustrine localities. The analyses revealed the existence of seven haplogroups (I-VII) distinguishable by characteristic substitutions. All endemic LV samples tested fell into one of these haplogroups (V) which, however, was also found to be present at various other localities, both riverine and lacustrine, outside LV . Within this haplogroup, four subgroups (VA through VD) could be distinguished, two of which (VB and VC) were represented in LV and at other localities. The great majority of the LV haplochromine species could be classified as belonging to the VC subgroup, which was found only in LVand in the rivers draining into it. Hence, while the endemic haplochromine species of LVcould not have originated from a single founding population, the lake does harbour a large species flock which probably arose in situ.
Full text
Available for:
BFBNIB, NMLJ, NUK, PNG, SAZU, UL, UM, UPUK
Entomological surveys were done as a part of a malaria control project at nine sites in East Sepik Province. Adult mosquitoes were collected by the humanbait method and the number of anopheline ...mosquitoes by 177 night×people collections was 3,631. Species identification was done by the PCR-RFLP method at the ITS 2 region of the rDNA. Five kinds of anopheline mosquitoes, i.e., Anopheles farauti 1, An. farauti 2, An. farauti 4, An. koliensis and An. punctulatus were identified. Anopheles koliensis was the most common, then An. punctulatus, An. farauti 1, and followed by An. farauti 4 and An. farauti 2. Anopheles farauti 1 was distributed mainly in coastal areas and An. farauti 2 in inland areas. Anopheles farauti 4 was found only in Kairiru Island and this is the first report of the distribution of this species in an island. Anopheles koliensis and An. punctulatus were widely distributed throughout the districts; however, the distribution of An. koliensis was sporadic. Negative correlations were found between the collected numbers of An. farauti 1 and An. farauti 2 and those of An. koliensis and An. punctulatus.
The relative efficiencies of the maximum-likelihood (ML), neighbor-joining (NJ), and maximum-parsimony (MP) methods in obtaining the correct topology and in estimating the branch lengths for the case ...of four DNA sequences were studied by computer simulation, under the assumption either that there is variation in substitution rate among different nucleotide sites or that there is no variation. For the NJ method, several different distance measures (Jukes-Cantor, Kimura two-parameter, and gamma distances) were used, whereas for the ML method three different transition/transversion ratios (R) were used. For the MP method, both the standard unweighted parsimony and the dynamically weighted parsimony methods were used. The results obtained are as follows: (1) When the R value is high, dynamically weighted parsimony is more efficient than unweighted parsimony in obtaining the correct topology. (2) However, both weighted and unweighted parsimony methods are generally less efficient than the NJ and ML methods even in the case where the MP method gives a consistent tree. (3) When all the assumptions of the ML method are satisfied, this method is slightly more efficient than the NJ method. However, when the assumptions are not satisfied, the NJ method with gamma distances is slightly better in obtaining the correct topology than is the ML method. In general, the two methods show more or less the same performance. The NJ method may give a correct topology even when the distance measures used are not unbiased estimators of nucleotide substitutions. (4) Branch length estimates of a tree with the correct topology are affected more easily than topology by violation of the assumptions of the mathematical model used, for both the ML and the NJ methods. Under certain conditions, branch lengths are seriously overestimated or underestimated. The MP method often gives serious underestimates for certain branches. (5) Distance measures that generate the correct topology, with high probability, do not necessarily give good estimates of branch lengths. (6) The likelihood-ratio test and the confidence-limit test, in Felsenstein's DNAML, for examining the statistical of branch length estimates are quite sensitive to violation of the assumptions and are generally too liberal to be used for actual data. Rzhetsky and Nei's branch length test is less sensitive to violation of the assumptions than is Felsenstein's test. (7) When the extent of sequence divergence is < or = 5% and when > or = 1,000 nucleotides are used, all three methods show essentially the same efficiency in obtaining the correct topology and in estimating branch lengths.
Concatenated sequences of all protein-coding genes in mitochondria recovered a known phylogeny of 11 vertebrate species correctly with statistical significance. However, when it was rooted by ...lampreys or sea urchins, the root of the vertebrate tree was placed between the mammal cluster and the chicken-frog-fish cluster or between the mammal-chicken cluster and the frog-fish cluster, depending on the tree-making method used. Although the frog-fish or chicken-frog-fish cluster was biologically incorrect, it was again supported with a significantly high bootstrap value. In this study, we investigated the reasons why this happened. It has been suggested that an incorrect phylogeny may be constructed due to a change of amino acid composition in different lineages or due to homoplasies at sites with hydrophobic amino acids. However, our results indicated that these were not the causes of the incorrect rooting of the vertebrate tree. Rather, it was important to take into account an extensive rate variation across sites and different probabilities of substitution among different amino acids. The substitution rates for mitochondrial sequences vary considerably for different vertebrate lineages. In such a case, it is known to be important to use the model that reflects the actual substitution probability to obtain a correct tree topology. The correct rooting of the vertebrate tree was recovered when rate variation across sites was properly accounted for.
Although African populations have been shown to be most divergent from any other human populations, it has been difficult to establish the root of the phylogenetic tree of human populations since the ...rate of evolutionary change may vary from population to population owing to the fluctuation of population size and other factors. However, the root can be determined by using the chimpanzee as an outgroup and by employing proper statistical methods. Using this strategy, we constructed phylogenetic trees of human populations for five different sets of gene frequency data. The data sets used were two sets of microsatellite loci data (25 and 8 loci, respectively), restriction fragment length polymorphism (RFLP) data (79 loci), protein polymorphism data (15 loci), and Alu insertion frequency data (4 loci). All these data sets showed that the root is located in the branch connecting African and non-African populations, and in the four data sets the root was established at a significant level. These results indicate that Africans are the first group of people that split from the rest of the human populations.