Deoxyribonucleic Acid (DNA)

Deoxyribonucleic Acid (DNA)




Replication of DNA

The genetic code

Expression of genetic information

Genetic engineering and recombinant DNA


Deoxyribonucleic acid (DNA) is a natural polymer which encodes the genetic information required for the growth, development, and reproduction of an organism. Found in all cells, it consists of chains of units called nucleotides. Each nucleotide unit contains three components: the sugar deoxyribose, a phosphate group, and a nitrogen-containing ring structure called a base. There are four different bases in DNA: adenine, cytosine, guanine or thymine.

DNA molecules are very long and threadlike. They consist of two polymeric strands twisted about each other into a spiral shape known as a double helix, which resembles a twisted ladder. In eukaryotic cells, DNA is found within the cell nucleus in the chromosomes, which are extremely condensed structures in which DNA is associated with proteins. Each species contains a characteristic number of chromosomes in their cells. In humans, every cell contains 46 chromosomes (except for egg and sperm cells which contain only 23). The total genetic information in a cell is called its genome. In prokaryotic cells such as bacteria, DNA is not contained within the specialized nuclear membrane, but rather is dispersed in the interior substance of the cell (cytoplasm)

The fundamental units of heredity are genes. A gene is a segment of a DNA molecule that encodes the information necessary to make a specific protein. The many proteins encoded by DNA contribute to a cells structure and chemical activities.

DNA not only encodes the blueprints for cellular proteins but also the instructions for when and where they will be made. For example, the oxygen carrier hemoglobin is made in red blood cells but not in nerve cells, though both contain the same total genetic content. Thus, DNA also contains the information necessary for regulating how its genetic messages are used.

The sequencing of the human genome has determined that a human cell contains approximately 30, 000 genes, far less than the previously estimated 50, 000100, 000. Except in the case of identical twins, a comparison of the genes from different individuals always reveals a number of differences. Therefore, each person is genetically unique. This is the basis of DNA fingerprinting, a forensic procedure used to match DNA collected from a crime scene with that of a suspect, and of the use of DNA to establish who is the biological parent of a child.

Genes direct the function of all organs and systems in the body. In some cases, the defects in the

DNA of just one gene can cause a genetic disorder that results in disease because the protein encoded by the defective gene is abnormal. The abnormal hemoglobin produced by people afflicted with sickle cell anemia is an example. Defects in certain genes called oncogenes, which regulate growth and development, give rise to cancer. Therefore, defects in DNA can affect the two kinds of genetic information it carries, messages directing the manufacture of proteins and information regulating the expression, or carrying out, of these messages.


Prior to the discovery of the nucleic acids, the Austrian monk Gregor Mendel (1822-1884) worked out the laws of inheritance by the selective breeding of pea plants. As early as 1865 he proposed that some then-undefined factors from each parent were responsible for the inheritance of certain characteristics in plants. The Swiss biochemist Friedrich Miescher (1844-1895) discovered the nucleic acids in 1868 in nuclei isolated from pus cells scraped from surgical bandages. However, research on the chemical structure of nucleic acids lagged until new analytical techniques became available in the mid twentieth century.

Despite knowledge of the chemical structure of nucleotides and how they were linked together to form DNA, the possibility that DNA was the genetic material was regarded as unlikely. As late as the mid twentieth century, proteins were thought to be the molecules of heredity because they appeared to be the only cellular components diverse enough to account for the large variety of genes. In 1944, Oswald Avery (1877-1955) and his colleagues showed that non-pathogenic strains of pneumococcus, the bacterium that causes pneumonia, could become pathogenic (disease-causing) if treated with a DNA-containing extract

from heat-killed pathogenic strains. Based on this evidence, Avery concluded that DNA was the genetic material. However, widespread acceptance of DNA as the bearer of genetic information did not come until a report by other workers in 1952 that DNA, not protein, enters a bacterial cell infected by a virus. This showed that the genetic material of the virus was contained in its DNA, confirming Averys hypothesis.

In 1953, James Watson (1928-) and Francis Crick (1916-2004) proposed their double helix model for the three-dimensional structure of DNA. They correctly deduced that the genetic information was encoded in the form of the sequence of nucleotides in the molecule. With their landmark discovery began an era of molecular genetics in biology. Eight years later investigators cracked the genetic code. They found that specific trinucleotide sequencessequences of three nucleotidesare codes for each of 20 amino acids, the building blocks of proteins.

In 1970 scientists found that bacteria contained enzymes that recognize a particular sequence of 4-8 nucleotides and will always cut DNA at or near that sequence to yield specific (rather than random), consistently reproducible DNA fragments. These enzymes were dubbed restriction enzymes. Two years later it was found that the bacterial enzyme DNA ligase could be used to rejoin these fragments. This permitted scientists to construct what were termed recombinant DNA; DNA composed of segments from two different sources, even from different organisms. With the availability of these tools, genetic engineering became possible and biotechnology began.

By 1984 the development of DNA fingerprinting allowed forensic chemists to compare DNA samples from a crime scene with that of suspects. The first conviction using this technique came in 1987. Three years later doctors first attempted to treat a patient unable to produce a vital immune protein using gene therapy. This technique involves inserting a portion of DNA into a patients cells to correct a deficiency in a particular function. The Human Genome Project also began in 1990. The aim of this project is to determine the nucleotide sequence in DNA of the entire human genome, which consists of about three billion nucleotide pairs. In 2001, researchers announced the completion of the sequencing of a human genome.


Deoxyribose, the sugar component in each nucleotide, is so-named because it has one less oxygen atom than ribose, which is present in ribonucleic acid (RNA). Deoxyribose contains five carbonatoms, four of which lie in a ring along with one oxygen atom. The fifth carbon atom is linked to a specific carbon atom in the ring. A phosphate group is always linked to deoxyribose via a chemical bond between an oxygen atom in the phosphate group and the carbon atom in deoxyribose by a chemical bond between a nitrogen atom in the base and a specific carbon atom in the deoxyribose ring.

The nucleotide components of DNA are connected to form a linear polymer in a very specific way. A phosphate group always connects the sugar component of a nucleotide with the sugar component of the next nucleotide in the chain. Consequently, the first nucleotide bears an unattached phosphate group, and the last nucleotide has a free hydroxyl group. Therefore, DNA is not the same at both ends. This directionality plays an important role in the replication of DNA.

DNA molecules contain two polymer chains or strands of nucleotides and so are said to be double-stranded. (In contrast, RNA is typically single-stranded.) Their shape resembles two intertwined spiral staircases in which the alternating sugar and phosphate groups of the nucleotides compose the sidepieces. The steps consist of pairs of bases, each attached to the sugars on their respective strands. The bases are held together by weak attractive forces called hydrogen bonds. The two strands in DNA are antiparallel, which means that one strand goes in one direction (first to last nucleotide from top to bottom) and the other strand goes in the opposite direction (first to last nucleotide from bottom to top).

Because the sugar and phosphate components which make up the sidepieces are always attached in the same way, the same alternating phosphate-sugar sequence repeats over and over again. The bases attached to each sugar may be one of four possible types. Because of the geometry of the DNA molecule, the only possible base pairs that will fit are adenine (A) paired with thymine (T), and cytosine (C) paired with guanine (G).

The DNA in our cells is a masterpiece of packing. The double helix coils itself around protein cores to form nucleosomes. These DNA-protein structures resemble beads on a string. Flexible regains between nucleosomes allows these structures to be wound around themselves to produce an even more compact fiber. The fibers can then be coiled for even further compactness. Ultimately, DNA is paced into the highly condensed chromosomes. If the DNA in a human cell is stretched, it is approximately 6 ft (1.82 m) long. If all 46 chromosomes are laid end-to-end, their total length is still only about eight-thousandths of an inch. This means that DNA in chromosomes is condensed about 10, 000 times more than that in the double helix. Why all this packing? The likely answer is that the fragile DNA molecule would get broken in its extended form. Also, if not for this painstaking compression, the cell might be mired in its own DNA.


DNA directs a cells activities by specifying the structures of its proteins and by regulating which proteins and how much are produced, and where. In so doing, it never leaves the nucleus. Each human cell contains about 6 ft (2 m) of highly condensed DNA, which encodes some 30, 000 genes. If a particular protein is to be made, the DNA segment corresponding to the gene for that protein acts as a template (pattern) for the synthesis of an RNA molecule in a process known as transcription. This messenger RNA molecule travels from the nucleus to the cytoplasm where it in turn acts as the template for the construction of the protein by the protein assembly apparatus of the cell. This latter process is known as translation and requires an adaptor molecule, transfer RNA, which translates the genetic code of DNA into the language of proteins.

Eventually, when a cell divides, its DNA must be copied so that each daughter cell will have a complete set of genetic instructions. The structure of DNA is perfectly suited to this process. The two intertwined strands unwind, exposing their bases, which then pair with bases on free nucleotides present in the cell. Because of the base-pairing rules, the sequence of bases along one strand of DNA determines the sequence of bases in the newly forming complementary strand. An enzyme then joins the free nucleotides to complete the new strand. Since the two new DNA strands that result are identical to the two originals, the cell can pass along an exact copy of its DNA to each daughter cell.

Sex cells, the eggs and sperm, contain half the number of chromosomes as other cells. When the egg and sperm fuse during fertilization, they form the first cell of a new individual with the complete complement of DNA46 chromosomes. Each cell (except the sex cells) in the new person carries DNA identical to that in the fertilized egg cell. In this way the DNA of both parents is passed from one generation to the next. Thus, DNA plays a crucial role in the propagation of life.

Replication of DNA

DNA replication, the process by which the double-stranded DNA molecule reproduces itself, is a complicated process, even in the simplest organisms. DNA synthesismaking new DNA from oldis complex because it requires the interaction of a number of cellular components and is rigidly controlled to ensure the accuracy of the copy, upon which the very life of the organism depends. This adds several verification steps to the procedure. Though the details vary from organism to organism, DNA replication follows certain rules that are universal to all.

DNA replication (duplication, or copying) is always semi-conservative. This means that during DNA replication the two strands of the parent molecule unwind and each becomes a template for the synthesis of the complementary strand of the daughter molecule. As a result both daughter molecules contain one new strand and one old strand (from the parent molecule). The replication of DNA always requires a template, an intact strand from the parent molecule. This strand determines the sequence of nucleotides on the new strand, because of the A-withT and C-with-G base pairing requirement.

Replication begins at a specific site called the replication origin when the enzyme DNA helicase binds to a portion of the double stranded helix and melts the bonds between base pairs. This unwinds the helix to form a replication fork consisting of two separated strands, each serving as a template. Specific proteins then bind to these single strands to prevent them from re-pairing. Another enzyme called DNA polymerase proceeds to assemble the daughter strands using a pool of free nucleotide units which are present in the cell in an activated form.

High fidelity in the copying of DNA is vital to the organism and, incredibly, only about one error per one trillion replications ever occurs. This high fidelity results largely because DNA polymerase is a self-editing enzyme. If a nucleotide added to the end of the chain mismatches the complementary nucleotide on the template, pairing does not occur. DNA polymerase then clips off the unpaired nucleotide and replaces it with the correct one.

Occasionally errors are made during DNA replication and passed along to daughter cells. Such errors are called mutations. They have serious consequences because they can cause the insertion of the wrong amino acid into a protein. For example, the substitution of a T for an A in the gene encoding hemoglobin causes an amino acid substitution that results in sickle cell anemia. To understand the significance of such mutations requires knowledge of the genetic code.

The genetic code

Genetic information is stored as nucleotide sequences in DNA (or RNA) molecules. This sequence specifies the identity and position of the amino acids in a particular protein. Amino acids are the building blocks of proteins in the same way that nucleotides are the building blocks of DNA. However, though there are only four possible bases in DNA (or RNA), there are 20 possible amino acids in proteins. The genetic code is a sort of bilingual dictionary which translates the language of DNA into the language of proteins. In the genetic code the letters are the four bases A, C, G, and T (or U instead of T in RNA). Obviously, the four bases of DNA are not enough to code for 20 amino acids. A sequence of two bases is also insufficient, because this permits coding for only 16 of the 20 amino acids in proteins. Therefore, a sequence of three bases is required to ensure enough combinations to code for all 20 amino acids. Since all the combinations in this DNA language, called codons, consist of three letters, the genetic code is often referred to as the triplet code.

Each codon specifies a particular amino acid. Because there are 64 possible codons and only 20 amino acids, several different codons specify the same amino acid, so the genetic code is said to be degenerate. However, the code is unambiguous because each codon specifies only one amino acid.

Since in eukaryotes DNA never leaves the nucleus, the information it stores is not transferred to the cell directly. Instead, a DNA sequence must first be copied into a messenger RNA molecule, which carries the genetic information from the nucleus to protein assembly sites in the cytoplasm. There it serves as the template for protein construction. The sequences of nucleotide triplets in messenger RNA are also referred to as codons.

Expression of genetic information

Genetic information flows from DNA to RNA to protein. Ultimately, the linear sequence of nucleotides in DNA directs the production of a protein molecule with a characteristic three-dimensional structure essential to its proper function. Initially, information is transcribed from DNA to RNA. The information in the resulting messenger RNA is then translated from RNA into protein by small transfer RNA molecules.

In some exceptional cases the flow of genetic information from DNA to RNA is reversed. In retro-viruses, such as the AIDS virus, RNA is the hereditary material. An enzyme known as reverse transcriptase makes a copy of DNA using the virus RNA as a template. In still other viruses which use RNA as the


Codon The base sequence of three consecutive nucleotides on DNA (or RNA) that codes for a particular amino acid or signals the beginning or end of a messenger RNA molecule.

Cytoplasm All the protoplasm in a living cell that is located outside of the nucleus, as distinguished from nucleoplasm, which is the protoplasm in the nucleus.

Gene A discrete unit of inheritance, represented by a portion of DNA located on a chromosome. The gene is a code for the production of a specific kind of protein or RNA molecule, and therefore for a specific inherited characteristic.

Genetic code The blueprint for all structures and functions in a cell as encoded in DNA.

Genetic engineering The manipulation of the genetic content of an organism for the sake of genetic analysis or to produce or improve a product.

Genome The complete set of genes an organism carries.

Nucleotide The basic unit of DNA. It consists of deoxyribose, phosphate, and a ring-like, nitrogen-containing base.

Nucleus A compartment in the cell which is enclosed by a membrane and which contains its genetic information.

Replication The synthesis of a new DNA molecule from a pre-existing one.

Transcription The process of synthesizing RNA from DNA.

Translation The process of protein synthesis.

hereditary material, DNA is not involved in the flow of information at all.

Most cells in the body contain the same DNA as that in the fertilized egg. (Some exceptions to this are the sex cells, which contain only half of the normal complement of DNA, as well as red blood cells, which lose their nucleus when fully developed.) Some so-called housekeeping genes are expressed in all cells because they are involved in the fundamental processes required for normal function. (A gene is said to be expressed when its product, the protein it codes for, is actively produced in a cell.) For example, since all cells require ribosomes, structures that function as protein assembly lines, the genes for ribosomal proteins and ribosomal RNA are expressed in all cells. Other genes are only expressed in certain cell types, such as genes for antibodies in certain cells of the immune system. Some are expressed only during certain times in development. How is it that some cells express certain genes while others do not, even though all contain the same DNA? A complete answer to this question is still in the works. However, the main way is by controlling the start of transcription. This is accomplished by the interaction of proteins called transcription factors with DNA sequences near the gene. By binding to these sequences transcription factors may turn a gene on or off.

Another way is to change the rate of messenger RNA synthesis. Sometimes the stability of the messenger RNA is altered. The protein product itself may be altered, as well as its transport or stability. Finally, gene expression can be altered by DNA rearrangements. Such programmed reshuffling of DNA is the means of generating the huge assortment of antibody proteins found in immune cells.

Genetic engineering and recombinant DNA

Cells that contain the same recombinant DNA fragment are clones. A clone harboring a recombinant DNA molecule that contains a specific gene can be isolated and identified by a number of techniques, depending upon the particular experiment. Thus, recombinant DNA molecules can be introduced into rapidly growing microorganisms, such as bacteria or yeast, to produce large quantities of medically or commercially important proteins normally present only in scant amounts in the cell. For example, human insulin and interferon have been produced in this manner.

In recent years a technique has been developed which permits analysis of very small samples of DNA without repeated cloning, which is laborious. Known as the polymerase chain reaction, this technique involves amplifying a particular fragment of DNA by repeated synthesis using the enzyme DNA polymerase. This method can increase the amount of the desired DNA fragment by a million-fold or more.



DNA (deoxyribonucleic acid) is the molecule that stores genetic information in living systems. Like other organic molecules, DNA mostly consists of carbon, along with hydrogen, oxygen, nitrogen, and phosphorus. The fundamental structural unit of DNA is the nucleotide , which has two parts: an unvarying portion composed of sugar and phosphate, attached to one of four nitrogen-containing bases named adenine, cytosine, guanine, or thymine (abbreviated A, C, G, T).

The Double Helix

The structure of DNA, deduced in 1953 by James Watson, Francis Crick, and Rosalind Franklin, resembles that of a twisted ladder or spinal staircase composed of two long chains of nucleotides that are coiled around each other to form a double helix. The DNA ladder's two sidepieces (its double-stranded backbone) are made of alternating units of sugar and phosphate. The sugar is deoxyribose, which contains a ring of four carbons and one oxygen. A phosphate is an atom of phosphorus bonded to four oxygens. Bases attached to opposing sugars project inward toward each other to form rungs or steps, called base pairs . In contrast to the strong covalent (electron-sharing) bonds between nucleotides in a strand, the two bases in a base pair are held together only by much weaker hydrogen bonds . However, the cumulative attractive force of the hydrogen bonds in a chain of base pairs maintains DNA as a double-stranded molecule under physiological conditions. In the cell nucleus , DNA is bound to proteins to form chromosomes , and is coated with a layer of water molecules.

To make a sturdy rung, the two bases in a base pair have to interlock like pieces of a jigsaw puzzle, which only happens if their shapes and hydrogen-bonding characteristics are compatible. Only two combinations fulfill these requirements in DNA: GC and AT. This rule makes the two strands of a DNA molecule complementary , so that if the bases of one strand are ordered GGTACAT, the bases of the opposite strand must be ordered CCATGTA. The order of the bases on a strand (mirrored in the complementary strand) is called the sequence of the DNA, and embodies coded instructions for making new biomolecules: proteins, ribonucleic acid (RNA), and DNA itself.

Complementarity and Replication

Each strand of DNA has a direction in which it can be read by the cellular machinery, arising from the arrangement of phosphates and sugars in the backbone. The two strands of DNA are oriented antiparallel to each other, that is, they lie parallel to each other but are decoded in opposite directions. Because of the numbering convention for the combinations in sugar, the directions along the backbone are called 5 3 ("five-prime to three-prime") or 3 5. The complementary nature of the two strands means that instructions for making new DNA can be read from both strands.

When DNA replicates, the weak hydrogen bonds of base pairs are broken and the two strands separate. Each strand acts as a template for the synthesis of a new complementary strand. Since the resulting new doublestranded molecule always contains one "old" (template) strand and one newly made strand, DNA replication is said to be semiconservative; it would be termed conservative if the two original template strands rejoined. By a similar mechanism (transcription), a DNA strand can be a template for the synthesis of RNA, which is a single-stranded nucleic acid that carries coded information from the DNA to the protein synthesizing machinery of the cell. During protein synthesis, the genetic code is used to translate the order of bases originally found in the DNA sequence into the order of amino acid building blocks in a protein.

Genes, Noncoding Sequences, and Methylation

DNA exists in nature as a macromolecule millions of base pairs long. In multicelled organisms, the complete set of genetic informationthe genome is divided among several DNA macromolecules (called chromosomes) in the cell nucleus. In contrast, the genomes of many one-celled organisms consist of a single, often circular, chromosome. The human genome contains 3.2 billion base pairs distributed among twenty-three chromosomes. Laid end to end, these would make a macromolecule 1.7 meters (5.5 feet) long; printed out, they would fill one thousand one-thousand-page telephone books. Furthermore, two copies of the genome are in almost every cell of humans and other diploid organisms. This vast amount of DNA packs into a cell nucleus, whose volume is only a few millionths of a cubic meter, by first spooling around globular proteins called histones . The DNA/histone complex then coils and curls up into even denser configurations, like a rubber band does when one holds one end and rolls the other end between one's fingers. Yet the human genome isn't nearly nature's biggest: the genome of a lily is just over ten times larger than a human's, although its nuclei are not significantly larger.


New Zealandborn British biologist who helped James Watson and Francis Crick deduce the structure of deoxyribonucleic acid (DNA), for which the three men received a 1962 Nobel Prize. Wilkins secretly showed Watson an x-ray diffraction photo of DNA taken by researcher Rosalind Franklin. Watson and Crick later used Franklin's extensive unpublished data to build a model of DNA.

The information storage capacity of DNA is vast; a microgram (onemillionth of a gram) of DNA theoretically could store as much information as 1 million compact discs. The "useful" information contained in genomes consists of the coded instructions for making proteins and RNA. These information-containing regions of a genome are called genes. However, genes comprise less than 5 percent of the human genome. Most genomes consist largely of repetitive, noncoding DNA (sometimes called junk DNA) that is interspersed with genes and whose only apparent function is to replicate itself. Perhaps it helps to hold the chromosome together. The tenfold greater size of the lily genome compared to humans' is due to the presence of enormous amounts of repetitive DNA of unknown function.

While most cells of higher organisms contain all the genes in the genome, specialized cells such as neurons or muscle require expression from only some of the genes. One strategy for silencing unneeded genes is methylation . A methyl group (CH3) is added to cytosine nucleotides, but only if they are followed by a guanine in the sequence, that is, CG. Adding methyl groups to a region of DNA attracts repressive DNA-binding proteins to it and may also cause the region to compact even further, making it inaccessible to proteins that make RNA from DNA (the first step of protein synthesis). During DNA replication the pattern of methylation is preserved by specific proteins that add methyl groups to the new strand based on the location of CG methyl groups in the template strand. The most extreme case of repression by methylation is X-inactivation, in which one of the two X chromosomes in cells of a female mammal is entirely shut down, presumably because expression from one X provides enough protein in females, as it does in males (who have only one X chromosome).

see also Chromosome, Eukaryotic; Control of Gene Expression; Crick, Francis; Gene; Mutation; Nucleotides; Replication; RNA; Watson, James

Steven A. Sullivan


A defect in the gene for a methylating enzyme causes Rett syndrome, a disorder responsible for mental retardation and movement disorders in young girls.


DNA (deoxyribonucleic acid ) carries design information between generations, and thus accounts for inherited biological traits (phenotypes ). At conception, a father's sperm injects a set of DNA molecules into a mother's egg, which already contains a nearly matching set. Those molecules contain the designs for all the material components their child needs for growth, development, and daily living.

Structure of DNA

The designs are called genes. Some genes play a role in regulating other genes, and some design ribonucleic acid, a close relative of DNA. But mostly, the designs in DNA are for the class of chemicals called proteins. The human body contains tens of thousands of kinds of proteins, which do all the body's work. Interactions among those proteins, and interactions between them and environmental factors account for the processes and structures of the body. Those processes and structures are manifested as inherited traits. DNA is comprised of chains of chemical subunits called nucleotides, each of which contains one nitrogenous base: adenine (A ), thymine (T ), cytosine (C ), or guanine (G ). The design instructions in DNA are spelled out as particular sequences of these four bases. This is analogous to conveying instructions in printed books by particular arrangements of the twenty-six letters of the alphabet. In the case of genes, however, there are only four letters in the alphabet. Hundreds of nucleotides are linked in a DNA chain in a sequence that spells out instructions for a single gene.

There are two complementary chains in the structure of DNA. Each nucleotide in DNA has a sugar component joined to a phosphate group at one point on the sugar, and to a nitrogen-containing base attached at another point. The chains in DNA have the phosphate of one nucleotide linked to the sugar of the next nucleotide to form a strand of alternating sugars and phosphates with dangling nitrogenous bases. DNA contains two such chains, twisted around each other to form a double-stranded helix with the bases on the inside. Every A on one chain forms weak bonds with a T on the other strand, and every C on a strand bonds weakly to a G on the opposite chain. The two strands, held together weakly by the pairing of A with T, and G with C, are thus complementary, and the sequence in one can be deduced from the other's sequence.

Design information is transmitted as new DNA to new cells during development and growth. The complementarity of the two DNA strands allows their information to be copied. Each old strand is used as a template in synthesizing a new complementary one. Intricate cellular machinery makes new copies of the DNA when a fertilized egg divides into two progeny cells. When each of the progeny divides again, the new progeny all receive complete copies of the parental DNA. As the fertilized egg grows to become successively an embryo, a fetus, a child, and finally an adult, cells go through many rounds of division with replication of the DNA in each round. Finally, adult humans have trillions of cells, each one (except sperm and ovum) containing complete copies of the DNA initially contributed by the parents.

On rare occasions mutations (changes) are made in nucleotides by chemicals, radiation, or errors in copying DNA. In a nucleotide chain, one nucleotide may be substituted for another, or one or more nucleotides might be inserted or deleted. Sometimes the change in DNA structure has little or no effect on the function of the gene's product, but it frequently harms the function to some degree, or very rarely enhances it. Harmful mutations cause gene-based diseases, but enhancing mutations allow organisms to evolve new or more effective functions. Like normal phenotypes, disease phenotypes usually require the products of multiple genes, so most defective genes predispose an organism to disease rather than directly causing it. The accumulation of mutations within the human species accounts for such phenotypic differences as eye color, stature, or skin pigmentation. The number of mutations among human genes is so large that no two persons, except for identical twins, have exactly the same nucleotide sequence in the three billion bases of their DNA.

Control of gene expression

DNA information is expressed as proteins and their feedback networks. The information resident in nucleotide sequences is used not only for replicating DNA, but also for synthesizing proteins. Proteins are chains of a few hundred subunits called amino acids, of which there are twenty kinds. The amino acids in a protein are arranged in a specific sequence by cellular machinery that translates the genetic information coded in DNA. The sequence of nucleotides, read three at a time, corresponds to the sequence of amino acids in a protein. The amino acids differ among themselves in chemical character so that every kind of protein differs in chemical character from others. For the work of the human body many thousands of proteins are needed, each having a highly specific function like catalyzing a chemical reaction or transporting oxygen. Observable phenotypes are the result of protein action, usually the coordinated action of many proteins. The functions of many proteins are integrated into large networks, and these webs of chemical processes act as feedback control systems allowing organisms to shift the balance of their activities to adapt to changes in the demand for the system's output. Often the networks possess alternate pathways for achieving a desired output.

Differentiation into specialized cells requires the control of gene expression. The development of a human being starts with a single-celled, fertilized egg. As the egg divides into two cells, and as successive rounds of cell division occur, every progeny cell receives a complete copy of parental DNA. In the first few divisions, the cells produced are identical in all observable characteristics, but as cell division continues, cells are produced that differ in phenotype even though all the cells continue to have identical DNA. In this differentiation, particular genes are controlled by blocking their expression, not by changing nucleotide sequence. Regulatory molecules block particular sites in DNA preventing translation of the corresponding genes into their products. Specific blocking thus generates different patterns of gene expression. Changing patterns of gene expression produce distinct populations of cells, diverging in phenotype as differentiation progresses. Eventually, differentiation in humans produces more than two hundred cell types, organized into different tissues and organs. In any one cell type the majority of its approximately 35,000 genes is repressed, leaving a small subset of expressed genes that differs from the subsets expressed in other cell types. Phenotypic differences between progeny in a given cell generation depend on the location of the cells in different microenvironments. During differentiation cells adapt to a succession of environmental changes produced by changes in their neighboring cells and extracellular fluids. Each successive adaptation is superimposed on its predecessor so that each terminally differentiated cell manifests the entire history of its lineage and not merely its immediate state. Since differentiation is irreversible in animals, (except in special cases), history as well as DNA designs a person, even in the material sense.

Feedback networks and regulation of genes allow individual organisms to adapt to changing conditions throughout life. When environment increases the need for the product of a network of chemical reactions, the overall process will be accelerated, and when need decreases the process will be inhibited. Obviously, adaptation to environment is induced by contact with physical and chemical forces, but adaptation can be evoked even without physical contact, as in the adaptation of the brain through learning, and emotional reaction. Many of these adaptive responses affect patterns of gene expression, and therefore environment, as well as history, joins with DNA in designing persons.

At the level of populations, long-term adaptation to environment occurs more by changes in gene structure than by changes in the expression of genes. The mechanism for this adaptation is the natural selection that underlies evolution. For example, skin pigmentation may be an adaptation that protects against exposure to the sun, and the genes that design the pigment systems would be naturally selected in successive generations that are exposed to much sunlight. Similarly, sickle-cell hemoglobin seems to have evolved in Africa because it offers resistance to malaria that is prevalent there.

Long-term adaptation through natural selection is most obvious in the case of physical and chemical aspects of human beings. Less obvious is the adaptation of behaviors through natural selection of genes, a possibility actively studied under the title "sociobiology." Although the mechanisms producing material phenotypes may seem more obvious than those producing social behaviors, a mechanism giving rise to a certain behavior may be thoroughly materialistic, although far more complex. Behavior modification by psychoactive drugs reveals a material mechanism for behavior. A mechanism can be pictured, for example, in the courting and mating behaviors that are correlated with the release of hormones from the brain, when an animal or human senses that a potential mate is near. Those released hormones induce particular chemical reactions at many sites throughout the body, giving rise to an appropriate pattern of bodily actions. Moreover, feedback responses between the mates guide further behavioral interactions between them. The hormonal system that links brain functions to bodily functions is, of course, designed by genes, and the mechanism just sketched is clearly materialistic. The frequent association of natural selection with notions of "survival of the fittest," makes altruism an especially challenging kind of behavior to study in testing the validity of sociobiology theory, and much of the research of sociobiologists is focused on the evolution of a gene for altruism.

Genes affect behavior, but as is the case with most human phenotypes, genes act in combinations and their expression is modulated by the histories and environments of individuals, as already described. Through the invariability of individual histories and environments, natural selection must be able to recognize the difference between organisms that possess a particular behavioral gene, and those that do not possess it. In order for a behavioral gene to evolve through natural selection it must be powerful enough in determining the behavior, to avoid substantial compromise by variable non-genetic factors. Sociobiology, then, tends to favor a strongly deterministic and materialistic view of behavior.

Human nature and genetic determinism

Choosing is part of human nature, but its degree of autonomy is debated. All agree that choice is constrained by genes, history, and environment, but does any degree of freedom remain? Science describes material brain mechanisms as chains of causes and effects, but every cause is an effect having a prior cause. Since the initial cause is not recognized by science, some say thought initiation is due to chance. Others look for initiation outside the material realm of science by distinguishing between mind and brain, or even spirit and brain.

Some degree of genetic determinism is necessary in describing human nature. All the possible scenarios of a person's life must conform to the designs in DNA, and thus genes set rigid, though spacious boundaries on what a person can be and do. But genes are insufficient for explaining what actually happens. What actually happens within the boundaries set by genes, depends on factors that control genes, including environment, history, and mental state. The question arises whether spiritual forces can be added to the list of controlling factors. Material determinism argues that a complete physicochemical description of the history and state of a person would explain everything without including a spiritual component. Some, however, argue that human spirituality is a capacity that emerged as gene-based human biology evolved, and that its activity cannot be fully comprehended at the molecular level. Still others add spirit as a control factor in human nature in accepting a dualism where body and spirit are distinct, though coexistent, in a person. The disparity in these views of human nature has theological consequences.

A view of human nature according to material determinism fits atheism and deism. It provides no locus for personal interaction with God, although deists might suppose that God influences humans through environment. Belief in human spirituality, either as an emerged capacity or as a distinct part of human nature does provide such a locus. Scientific understanding of gene-based human biology does not perceive a spiritual component in human nature, but it might not be expected that a physicochemico-molecular description of humans would be capable of such discernment in the first place.

See also Gene Patenting; Genetic Defect; Genetic Determinism; Genetics; Human Genome Project; Mutation; Nature versus Nurture


