13.2: Eukaryotic Epigenetic Gene Regulation - Biology

13.2: Eukaryotic Epigenetic Gene Regulation - Biology

We are searching data for your request:

Forums and discussions:
Manuals and reference books:
Data from registers:
Wait the end of the search in all databases.
Upon completion, a link will appear to access the found materials.

The human genome encodes over 20,000 genes; each of the 23 pairs of human chromosomes encodes thousands of genes. It is also organized so that specific segments can be accessed as needed by a specific cell type.

The first level of organization, or packing, is the winding of DNA strands around histone proteins. Histones package and order DNA into structural units called nucleosome complexes, which can control the access of proteins to the DNA regions (Figure 1a). Under the electron microscope, this winding of DNA around histone proteins to form nucleosomes looks like small beads on a string (Figure 1b). These beads (histone proteins) can move along the string (DNA) and change the structure of the molecule.

If DNA encoding a specific gene is to be transcribed into RNA, the nucleosomes surrounding that region of DNA can slide down the DNA to open that specific chromosomal region and allow for the transcriptional machinery (RNA polymerase) to initiate transcription (Figure 2). Nucleosomes can move to open the chromosome structure to expose a segment of DNA, but do so in a very controlled manner.

Practice Question

In females, one of the two X chromosomes is inactivated during embryonic development because of epigenetic changes to the chromatin. What impact do you think these changes would have on nucleosome packing?

[practice-area rows=”2″][/practice-area]
[reveal-answer q=”670204″]Show Answer[/reveal-answer]
[hidden-answer a=”670204″]The nucleosomes would pack more tightly together.[/hidden-answer]

How the histone proteins move is dependent on signals found on both the histone proteins and on the DNA. These signals are tags added to histone proteins and DNA that tell the histones if a chromosomal region should be open or closed (Figure 3 depicts modifications to histone proteins and DNA). These tags are not permanent, but may be added or removed as needed. They are chemical modifications (phosphate, methyl, or acetyl groups) that are attached to specific amino acids in the protein or to the nucleotides of the DNA. The tags do not alter the DNA base sequence, but they do alter how tightly wound the DNA is around the histone proteins. DNA is a negatively charged molecule; therefore, changes in the charge of the histone will change how tightly wound the DNA molecule will be. When unmodified, the histone proteins have a large positive charge; by adding chemical modifications like acetyl groups, the charge becomes less positive.

The DNA molecule itself can also be modified. This occurs within very specific regions called CpG islands. These are stretches with a high frequency of cytosine and guanine dinucleotide DNA pairs (CG) found in the promoter regions of genes. When this configuration exists, the cytosine member of the pair can be methylated (a methyl group is added). This modification changes how the DNA interacts with proteins, including the histone proteins that control access to the region. Highly methylated (hypermethylated) DNA regions with deacetylated histones are tightly coiled and transcriptionally inactive.

This type of gene regulation is called epigenetic regulation. Epigenetic means “around genetics.” The changes that occur to the histone proteins and DNA do not alter the nucleotide sequence and are not permanent. Instead, these changes are temporary (although they often persist through multiple rounds of cell division) and alter the chromosomal structure (open or closed) as needed. A gene can be turned on or off depending upon the location and modifications to the histone proteins and DNA. If a gene is to be transcribed, the histone proteins and DNA are modified surrounding the chromosomal region encoding that gene. This opens the chromosomal region to allow access for RNA polymerase and other proteins, called transcription factors, to bind to the promoter region, located just upstream of the gene, and initiate transcription. If a gene is to remain turned off, or silenced, the histone proteins and DNA have different modifications that signal a closed chromosomal configuration. In this closed configuration, the RNA polymerase and transcription factors do not have access to the DNA and transcription cannot occur (Figure 2).

View this video that describes how epigenetic regulation controls gene expression.

A link to an interactive elements can be found at the bottom of this page.

In Summary: Eukaryotic Epigenetic Gene Regulation

In eukaryotic cells, the first stage of gene expression control occurs at the epigenetic level. Epigenetic mechanisms control access to the chromosomal region to allow genes to be turned on or off. These mechanisms control how DNA is packed into the nucleus by regulating how tightly the DNA is wound around histone proteins. The addition or removal of chemical modifications (or flags) to histone proteins or DNA signals to the cell to open or close a chromosomal region. Therefore, eukaryotic cells can control whether a gene is expressed by controlling accessibility to transcription factors and the binding of RNA polymerase to initiate transcription.

13.2: Eukaryotic Epigenetic Gene Regulation - Biology

The human genome encodes over 20,000 genes each of the 23 pairs of human chromosomes encodes thousands of genes. The DNA in the nucleus is precisely wound, folded, and compacted into chromosomes so that it will fit into the nucleus. It is also organized so that specific segments can be accessed as needed by a specific cell type.

The first level of organization, or packing, is the winding of DNA strands around histone proteins. Histones package and order DNA into structural units called nucleosome complexes, which can control the access of proteins to the DNA regions (Figure 1a). Under the electron microscope, this winding of DNA around histone proteins to form nucleosomes looks like small beads on a string (Figure 1b). These beads (histone proteins) can move along the string (DNA) and change the structure of the molecule.

Figure 1. DNA is folded around histone proteins to create (a) nucleosome complexes. These nucleosomes control the access of proteins to the underlying DNA. When viewed through an electron microscope (b), the nucleosomes look like beads on a string. (credit “micrograph”: modification of work by Chris Woodcock)

If DNA encoding a specific gene is to be transcribed into RNA, the nucleosomes surrounding that region of DNA can slide down the DNA to open that specific chromosomal region and allow for the transcriptional machinery (RNA polymerase) to initiate transcription (Figure 2). Nucleosomes can move to open the chromosome structure to expose a segment of DNA, but do so in a very controlled manner.

Practice Question

Figure 2. Nucleosomes can slide along DNA. When nucleosomes are spaced closely together (top), transcription factors cannot bind and gene expression is turned off. When the nucleosomes are spaced far apart (bottom), the DNA is exposed. Transcription factors can bind, allowing gene expression to occur. Modifications to the histones and DNA affect nucleosome spacing.

In females, one of the two X chromosomes is inactivated during embryonic development because of epigenetic changes to the chromatin. What impact do you think these changes would have on nucleosome packing?

This type of gene regulation is called epigenetic regulation. Epigenetic means “around genetics.” The changes that occur to the histone proteins and DNA do not alter the nucleotide sequence and are not permanent. Instead, these changes are temporary (although they often persist through multiple rounds of cell division) and alter the chromosomal structure (open or closed) as needed. A gene can be turned on or off depending upon the location and modifications to the histone proteins and DNA.

View this video that describes how epigenetic regulation controls gene expression.

In Summary: Eukaryotic Epigenetic Gene Regulation

In eukaryotic cells, the first stage of gene expression control occurs at the epigenetic level. Epigenetic mechanisms control access to the chromosomal region to allow genes to be turned on or off. These mechanisms control how DNA is packed into the nucleus by regulating how tightly the DNA is wound around histone proteins. The addition or removal of chemical modifications (or flags) to histone proteins or DNA signals to the cell to open or close a chromosomal region. Therefore, eukaryotic cells can control whether a gene is expressed by controlling accessibility to transcription factors and the binding of RNA polymerase to initiate transcription.

Free Response

In cancer cells, alteration to epigenetic modifications turns off genes that are normally expressed. Hypothetically, how could you reverse this process to turn these genes back on?

You can create medications that reverse the epigenetic processes (to add histone acetylation marks or to remove DNA methylation) and create an open chromosomal configuration.

A scientific study demonstrated that rat mothering behavior impacts the stress response in their pups. Rats that were born and grew up with attentive mothers showed low activation of stress-response genes later in life, while rats with inattentive mothers had high activation of stress-response genes in the same situation. An additional study that swapped the pups at birth (i.e., rats born to inattentive mothers grew up with attentive mothers and vice versa) showed the same positive effect of attentive mothering. How do genetics and/or epigenetics explain the results of this study?

Swapping the pups at birth indicates that the genes inherited from the attentive or inattentive mothers do not explain the rats’ stress-responses later in life. Instead, researchers found that the attentive mothering caused the methylation of genes that control the expression of stress receptors in the brain. Thus, rats that received attentive maternal care exhibited epigenetic changes that limited the expression of stress-response genes, and that the effect was durable over their lifespans.

Some autoimmune diseases show a positive correlation with dramatically decreased expression of histone deacetylase 9 (HDAC9, an enzyme that removes acetyl groups from histones). Why would the decreased expression of HDAC9 cause immune cells to produce inflammatory genes at inappropriate times?

Histone acetylation reduces the positive charge of histone proteins, loosening the DNA wrapped around the histones. This looser DNA can then interact with transcription factors to express genes found in that region. Normally, once the gene is no longer needed, histone deacetylase enzymes remove the acetyl groups from histones so that the DNA becomes tightly wound and inaccessible again. However, when there is a defect in HDAC9, the deacetylation may not occur. In an immune cell, this would mean that inflammatory genes that were made accessible during an infection are not tightly rewound around the histones.

CH450 and CH451: Biochemistry - Defining Life at the Molecular Level

13.1 Prokaryotic Gene Regulation

13.2 Eukaryotic Gene Regulation

13.3 Protein-DNA Interactions

13.4 Epigenetics and Transgenerational Inheritence

13.5 References

Each nucleated cell in a multicellular organism contains copies of the same DNA. Similarly, all cells in two pure bacterial cultures inoculated from the same starting colony contain the same DNA, with the exception of changes that arise from spontaneous mutations. If each cell in a multicellular organism has the same DNA, then how is it that cells in different parts of the organism’s body exhibit different characteristics? Similarly, how is it that the same bacterial cells within two pure cultures exposed to different environmental conditions can exhibit different phenotypes? In both cases, each genetically identical cell does not turn on, or express, the same set of genes. Only a subset of proteins in a cell at a given time is expressed.

Genomic DNA contains both structural genes, which encode products that serve as cellular structures or enzymes, and regulatory genes, which encode products that regulate gene expression. The expression of a gene is a highly regulated process. Whereas regulating gene expression in multicellular organisms allows for cellular differentiation, in single-celled organisms like prokaryotes, it primarily ensures that a cell’s resources are not wasted making proteins that the cell does not need at that time.

Elucidating the mechanisms controlling gene expression is important to the understanding of human health. Malfunctions in this process in humans lead to the development of cancer and other diseases. Understanding the interaction between the gene expression of a pathogen and that of its human host is important for the understanding of a particular infectious disease. Gene regulation involves a complex web of interactions within a given cell among signals from the cell’s environment, signaling molecules within the cell, and the cell’s DNA. These interactions lead to the expression of some genes and the suppression of others, depending on circumstances.

Prokaryotes and eukaryotes share some similarities in their mechanisms to regulate gene expression however, gene expression in eukaryotes is more complicated because of the temporal and spatial separation between the processes of transcription and translation. Thus, although most regulation of gene expression occurs through transcriptional control in prokaryotes, regulation of gene expression in eukaryotes occurs at the transcriptional level and post-transcriptionally (after the primary transcript has been made).

13.1 Prokaryotic Gene Regulation

In bacteria and archaea, structural proteins with related functions are usually encoded together within the genome in a block called an operon and are transcribed together under the control of a single promoter, resulting in the formation of a polycistronic transcript(Figure 13.1). In this way, regulation of the transcription of all of the structural genes encoding the enzymes that catalyze the many steps in a single biochemical pathway can be controlled simultaneously, because they will either all be needed at the same time, or none will be needed. For example, in E. coli, all of the structural genes that encode enzymes needed to use lactose as an energy source are encoded next to each other in the lactose (or lac) operon under the control of a single promoter, the lac promoter. French scientists François Jacob (1920–2013) and Jacques Monod at the Pasteur Institute were the first to show the organization of bacterial genes into operons, through their studies on the lac operon of E. coli. For this work, they won the Nobel Prize in Physiology or Medicine in 1965.

Figure 13.1 Schematic Representation of an Operon. In prokaryotes, structural genes of related function are often organized together on the genome and transcribed together under the control of a single promoter. The operon’s regulatory region includes both the promoter and the operator. If a repressor binds to the operator, then the structural genes will not be transcribed. Alternatively, activators may bind to the regulatory region, enhancing transcription.

Each operon includes DNA sequences that influence its own transcription these are located in a region called the regulatory region. The regulatory region includes the promoter and the region surrounding the promoter, to which transcription factors, proteins encoded by regulatory genes, can bind. Transcription factors influence the binding of RNA polymerase to the promoter and allow its progression to transcribe structural genes. A repressor is a transcription factor that suppresses transcription of a gene in response to an external stimulus by binding to a DNA sequence within the regulatory region called the operator, which is located between the RNA polymerase binding site of the promoter and the transcriptional start site of the first structural gene. Repressor binding physically blocks RNA polymerase from transcribing structural genes. Conversely, an activator is a transcription factor that increases the transcription of a gene in response to an external stimulus by facilitating RNA polymerase binding to the promoter. An inducer, a third type of regulatory molecule, is a small molecule that either activates or represses transcription by interacting with a repressor or an activator.

In prokaryotes, there are examples of operons whose gene products are required rather consistently and whose expression, therefore, is unregulated. Such operons are constitutively expressed, meaning they are transcribed and translated continuously to provide the cell with constant intermediate levels of the protein products. Such genes encode enzymes involved in housekeeping functions required for cellular maintenance, including DNA replication, repair, and expression, as well as enzymes involved in core metabolism. In contrast, there are other prokaryotic operons that are expressed only when needed and are regulated by repressors, activators, and inducers.

Prokaryotic operons are commonly controlled by the binding of repressors to operator regions, thereby preventing the transcription of the structural genes. Such operons are classified as either repressible operonsor inducible operons. Repressible operons, like the tryptophan (trp) operon, typically contain genes encoding enzymes required for a biosynthetic pathway. As long as the product of the pathway, like tryptophan, continues to be required by the cell, a repressible operon will continue to be expressed. However, when the product of the biosynthetic pathway begins to accumulate in the cell, removing the need for the cell to continue to make more, the expression of the operon is repressed. Conversely, inducible operons, like the lac operon of E. coli, often contain genes encoding enzymes in a pathway involved in the metabolism of a specific substrate like lactose. These enzymes are only required when that substrate is available, thus expression of the operons is typically induced only in the presence of the substrate.

The trp Operon: A Repressible Operon

E. coli can synthesize tryptophan using enzymes that are encoded by five structural genes located next to each other in the trp operon (Figure 13.2). When environmental tryptophan is low, the operon is turned on. This means that transcription is initiated, the genes are expressed, and tryptophan is synthesized. However, if tryptophan is present in the environment, the trp operon is turned off. Transcription does not occur and tryptophan is not synthesized.

When tryptophan is not present in the cell, the repressor by itself does not bind to the operator therefore, the operon is active and tryptophan is synthesized. However, when tryptophan accumulates in the cell, two tryptophan molecules bind to the trp repressor molecule, which changes its shape, allowing it to bind to the trp operator. This binding of the active form of the trp repressor to the operator blocks RNA polymerase from transcribing the structural genes, stopping expression of the operon. Thus, the actual product of the biosynthetic pathway controlled by the operon regulates the expression of the operon.

Figure 13.2 The Trp Operon. The five structural genes needed to synthesize tryptophan in E. coli are located next to each other in the trp operon. When tryptophan is absent, the repressor protein does not bind to the operator, and the genes are transcribed. When tryptophan is plentiful, tryptophan binds the repressor protein at the operator sequence. This physically blocks the RNA polymerase from transcribing the tryptophan biosynthesis genes.

The Lac Operon: An Inducible Operon

The lac operon is an example of an inducible operon that is also subject to activation in the absence of glucose. The lac operon encodes three structural genes, lacZ, lacY, and lacA, necessary to acquire and process the disaccharide lactose from the environment (Fig 13.3A).

Figure 13.3 Biological Activity of the lac Operon. (A) Schematic representation of the lac operon in E. coli. The lac operon has three structural genes, lacZ, lacY, and lacA that encode for β-galactosidase, permease, and galactoside acetyltransferase, respectively. The promoter (p) and operator (o) sequences that control the expression of the operon are shown. Upstream of the lac operon is the lac repressor gene, lacI, controlled by the lacI promoter (p). (B) Shows the lac repressor inhibition of the lac operon gene expression in the absence of lactose. The lac repressor binds with the operator sequence of the operon and prevents the RNA polymerase enzyme which is bound to the promoter (p) from initiating transcription. (C) In the presence of lactose, some of the lactose is converted into allolactose, which binds and inhibits the activity of the lac repressor. The lac repressor-allolactose complex cannot bind with the operator region of the operon, freeing the RNA polymerase and causing the initiation of transcription. Expression of the lac operon genes enables the breakdown and utilization of lactose as a food source within the organism.

The lacZ gene encodes the β-galactosidase (β-gal) enzyme responsible for the hydrolysis of lactose into simple sugars glucose and galactose (Fig. 13.4A). The β-gal enzyme can also mediate the breakdown of the alternate substrate 5-bromo-4-chloro-3-indolyl-β-D-galactopyranoside (Xgal) (Fig. 13.4B). The breakdown product, 5-bromo-4-chloro-3-hydroxyindole – 1, spontaneously dimerizes to form the intensely blue blue product, 5,5′-dibromo-4,4′-dichloro-indigo – 2. Thus, Xgal has been a valuable research tool, not only in the study of the enzymatic activity of β-gal, but also in the development of the commonly used blue-white DNA cloning system that utilizes the β-gal enzyme as a marker in molecular cloning experiments.

The lac operon contains two more genes, in addition to lacZ (Fig. 13.3A). The lacY gene encodes a permease that increases the uptake of lactose into the cell and lacA encodes a galactoside acetyltransferase (GAT) enzyme. The exact function of GAT during lactose metabolism has not been conclusively elucidated but acetylation is thought to play a role in the transport of the modified sugars.

Figure 13.4 Reactions Controlled by the Expression of the Lac Operon. (A) Expression of the β-galactosidase enzyme enables the breakdown of lactose into the simple sugars, glucose and galactose for E. coli to use as a food resource. (B) The β-galactosidase enzyme also mediates the breakdown of the non-native substrate 5-bromo-4-chloro-3-indolyl-β-D-galactopyranoside (Xgal). Breakdown product (1) 5-bromo-4-chloro-3-hydroxyindole quickly dimerizes into the intensely blue product (2) 5,5′-dibromo-4,4′-dichloro-indigo making it a useful tool for molecular biology. (C) β-D-1-thiogalactopyranoside (IPTG) can serve as a non-native inducer of the lac operon. It mimics the structure of lactose and binds with the Lac Repressor.

For the lac operon to be expressed, lactose must be present. This makes sense for the cell because it would be energetically wasteful to create the enzymes to process lactose if lactose was not available.

In the absence of lactose, the lacI gene is constituitively expressed, expressing the lac repressor protein (Fig. 13.3 B). The lac repressor binds with an operator region of the lac operon and physically prevents RNA polymerase from transcribing the structural genes (Fig. 13.3 B). However, when lactose is present, the lactose inside the cell is converted to allolactose. Allolactose serves as an inducer molecule, binding to the repressor and changing its shape so that it is no longer able to bind to the operator DNA (Fig. 13.3 C). Removal of the repressor in the presence of lactose allows RNA polymerase to move through the operator region and begin transcription of the lac structural genes. In addition to lactose, laboratory experiments have revealed that the non-natural compound Isopropyl β-D-1-thiogalactopyranoside (IPTG) can also bind with the lac repressor and cause the expression of lac operon (Figure 13.4 C). Similar to Xgal, this compound has also been used as a research tool for molecular cloning.

The Lac Operon: Activation by Catabolite Activator Protein

Bacteria typically have the ability to use a variety of substrates as carbon sources. However, because glucose is usually preferable to other substrates, bacteria have mechanisms to ensure that alternative substrates are only used when glucose has been depleted. Additionally, bacteria have mechanisms to ensure that the genes encoding enzymes for using alternative substrates are expressed only when the alternative substrate is available. In the 1940s, Jacques Monod was the first to demonstrate the preference for certain substrates over others through his studies of E. coli’s growth when cultured in the presence of two different substrates simultaneously. Such studies generated diauxic growth curves, like the one shown in Figure 13.5. Although the preferred substrate glucose is used first, E. coli grows quickly and the enzymes for lactose metabolism are absent. However, once glucose levels are depleted, growth rates slow, inducing the expression of the enzymes needed for the metabolism of the second substrate, lactose. Notice how the growth rate in lactose is slower, as indicated by the lower steepness of the growth curve.

Figure 13.5. Utilization of Glucose in E. Coli.When grown in the presence of two substrates, E. coli uses the preferred substrate (in this case glucose) until it is depleted. Then, enzymes needed for the metabolism of the second substrate are expressed and growth resumes, although at a slower rate.

The ability to switch from glucose use to another substrate like lactose is a consequence of the activity of an enzyme called Enzyme IIA (EIIA). When glucose levels drop, cells produce less ATP from catabolism and EIIA becomes phosphorylated. Phosphorylated EIIA activates adenylyl cyclase, an enzyme that converts some of the remaining ATP to cyclic AMP (cAMP), a cyclic derivative of AMP and important signaling molecule involved in glucose and energy metabolism in E. coli (Fig. 13.6). As a result, cAMP levels begin to rise in the cell. This is an indicator to the cell, that overall energy levels are low and that ATP is being depleted.

Figure 13.6. Conversion of ATP to cAMP. When ATP levels decrease due to depletion of glucose, some remaining ATP is converted to cAMP by adenylyl cyclase. Thus, increased cAMP levels signal glucose depletion.

The lac operon also plays a role in this switch from using glucose to using lactose. When glucose is scarce, the accumulating cAMP caused by increased adenylyl cyclase activity binds to catabolite activator protein (CAP), also known as cAMP receptor protein (CRP). The complex binds to the promoter region of the lac operon (Figure 13.7). In the regulatory regions of these operons, a CAP binding site is located upstream of the RNA polymerase binding site in the promoter. Binding of the CAP-cAMP complex to this site increases the binding ability of RNA polymerase to the promoter region to initiate the transcription of the structural genes. Thus, in the case of the lac operon, for transcription to occur, lactose must be present (removing the lac repressor protein) and glucose levels must be depleted (allowing binding of an activating protein). When glucose levels are high, there is catabolite repression of operons encoding enzymes for the metabolism of alternative substrates. Because of low cAMP levels under these conditions, there is an insufficient amount of the CAP-cAMP complex to activate transcription of these operons.

Figure 13.7 Effect of CAP on the Lac Operon. (a) In the presence of cAMP, CAP binds to the promoters of operons, like the lac operon, that encode genes for enzymes for the use of alternate substrates. (b) For the lac operon to be expressed, there must be activation by cAMP-CAP as well as removal of the lac repressor from the operator.

Global Responses of Prokaryotes

In prokaryotes, there are also several higher levels of gene regulation that have the ability to control the transcription of many related operons simultaneously in response to an environmental signal. A group of operons all controlled simultaneously is called a regulon.


When sensing impending stress, prokaryotes alter the expression of a wide variety of operons to respond in coordination. They do this through the production of alarmones, which are small intracellular nucleotide derivatives, such as guanosine pentaphosphate (pppGpp) (Fig. 13.8).

Figure 13.8 Structure of Guanosine Pentaphosphate (pppGpp)

Alarmones change which genes are expressed and stimulate the expression of specific stress-response genes. For example, pppGpp signaling is involved in the stringent response in bacteria, causing the inhibition of RNA synthesis when there is a shortage of amino acids present. This causes translation to decrease and the amino acids present are therefore conserved. Furthermore, pppGpp causes the up-regulation of many other genes involved in stress response such as the genes for amino acid uptake (from surrounding media) and biosynthesis.

The use of alarmones to alter gene expression in response to stress appears to be important in pathogenic bacteria, as well. On encountering host defense mechanisms and other harsh conditions during infection, many operons encoding virulence genes are upregulated in response to alarmone signaling. Knowledge of these responses is key to being able to fully understand the infection process of many pathogens and to the development of therapies to counter this process.

Quorum Sensing

Quorum sensing (QS) is an intercellular communication mechanism of bacteria used to coordinate the activities of individual cells in population level in response to surroundings through production and perception of diffusible signal molecules such as Acyl Homoserine Lactones or small singaling peptides (Fig. 13.9). The signal synthase, signal receptor, and signal molecules are three essential elements of the basic QS circuit machinery (Fig. 13.9). Genes encoding signal generating proteins are also included among the QS target genes. This forms an autoinduction feedback loop to modulate generation of signal molecules. Several bacterial behaviors including virulence factors expression, secondary metabolites production, biofilm formation, motility, and luminescence are regulated by QS. Through complex regulatory networks bacteria are capable of expressing corresponding genes according to their own population size and of behaving in a coordinated manner.

Figure 13.9 Examples of Quorum Sensing Pathways. (Left panel) Typical Gram-negative quorum sensing mechanism. Acyl homoserine lactone molecules, synthesized by LuxI, passively pass the bacterial cell membrane and when a sufficient concentration is reached (threshold level) activate the intracellular LuxR which subsequently activates target gene expression in a coordinated way. Note that a single cell is shown for simplicity. However, acyl homoserine lactones will commonly diffuse and target neighboring cells within the colony to mediate a communal or population response within the bacterial colony. (Right panel) Quorum sensing peptides are synthesized by the bacterial ribosomes as pro-peptidic proteins and undergo posttranslational modifications during excretion by active transport. The quorum sensing peptides bind membrane associated receptors which get autophosphorylated and activate intracellular response regulators via phosphor-transfer. These phosphorylated response regulators induce increased target gene expression.

For example, some microbial species, such as Staphylococcus aureus, can encase their community within a self-produced matrix of hydrated extracellular polymeric substances that include polysaccharides, proteins, nucleic acids, and lipid molecules. These encasements are known as biofilms. The formation of the biofilm on solid surfaces is a step-wise process comprising several stages (Fig. 13.10). It starts with the conditioning of the surface through the coating with macromolecules from the aqueous surrounding, which enables initial reversible adhesion of microorganisms. The next step is a formation of stronger, irreversible attachments to the surface, followed by the proliferation and aggregation of microorganisms into multicellular and multilayered clusters, which actively produce extracellular matrix. Some cells in the mature biofilms continuously detach and separate from the aggregates, representing a continuous source of planktonic bacteria that can subsequently spread and form new microcolonies.

Figure 13.10 Schematic drawing of biofilm formation.

Biofilms are a common cause of chronic, nosocomial and medical device-related infections, due to the fact that they can develop either on vital or necrotic tissue as well as on the inert surfaces of different implanted materials. Moreover, biofilms are linked with high-level resistance to antimicrobials, frequent treatment failures, increased morbidity and mortality. As a consequence, biofilm infections and accompanying diseases have become a major health concern and a serious challenge for both modern medicine and pharmacy. The rough estimation shows that more than 60% of hospital-associated infections are attributable to the biofilms formed on indwelling medical devices, which result in more than one million cases of infected patients annually and more than $1 billion of hospitalization costs per year in the USA.

Biofilm infections share some common characteristics: slow development in one or more hot-spots, delayed clinical manifestation, persistency for months or years, usually with interchanging periods of acute exacerbations and absence of clinical symptoms. Even though they are less aggressive than acute infections, their treatment is challenging to a greater extent. The main reason for the aforesaid is up to 1000-fold decrease in susceptibility of biofilms to antimicrobial agents and disinfectants as well as resistance to host immune response. Thus, ways to reduce or inhibit biofilm formation are highly sought. The majority of the proposed biofilm-control methods focuses on: (i) prevention and minimization of biofilm formation by selection and surface modifications of anti-adhesive materials (ii) debridement techniques including ultrasound and surgical procedures (iii) disruption of biofilm QS-signaling system or (iv) achieving proper drug penetration and delivery to formed biofilms by the use of electromagnetic field, ultrasound waves, photodynamic activation or specific drug delivery systems.

Alternate σ Factors

Since the σ subunit of bacterial RNA polymerase confers specificity as to which promoters should be transcribed, altering the σ factor used is another way for bacteria to quickly and globally change what regulons are transcribed at a given time. The σ factor recognizes sequences within a bacterial promoter, so different σ factors will each recognize slightly different promoter sequences. In this way, when the cell senses specific environmental conditions, it may respond by changing which σ factor it expresses, degrading the old one and producing a new one to transcribe the operons encoding genes whose products will be useful under the new environmental condition. For example, in sporulating bacteria of the genera Bacillus and Clostridium (which include many pathogens), a group of σ factors controls the expression of the many genes needed for sporulation in response to sporulation-stimulating signals.

Prokaryotic Attenuation and Riboswitches

Although most gene expression is regulated at the level of transcription initiation in prokaryotes, there are also mechanisms to control both the completion of transcription, as well as translation, concurrently. Since their discovery, these mechanisms have been shown to control the completion of transcription and translation of many prokaryotic operons. Because these mechanisms link the regulation of transcription and translation directly, they are specific to prokaryotes, because these processes are physically separated in eukaryotes.

One such regulatory system is attenuation, whereby secondary stem-loop structures formed within the 5’ end of an mRNA being transcribed determine if transcription to complete the synthesis of this mRNA will occur and if this mRNA will be used for translation. Beyond the transcriptional repression mechanism already discussed, attenuation also controls expression of the trp operon in E. coli (Fig. 13.11). The trp operon regulatory region contains a leader sequence called trpL between the operator and the first structural gene, which has four stretches of RNA that can base pair with each other in different combinations. When a terminator stem-loop forms, transcription terminates, releasing RNA polymerase from the mRNA. However, when an antiterminator stem-loop forms, this prevents the formation of the terminator stem-loop, so RNA polymerase can transcribe the structural genes.

Figure 13.11. Attenuation of Transcription and Translation. When tryptophan is plentiful, translation of the short leader peptide encoded by trpL proceeds, the terminator loop between regions 3 and 4 forms, and transcription terminates. When tryptophan levels are depleted, translation of the short leader peptide stalls at region 1, allowing regions 2 and 3 to form an antiterminator loop, and RNA polymerase can transcribe the structural genes of the trp operon.

A related mechanism of concurrent regulation of transcription and translation in prokaryotes is the use of a riboswitch, a small region of noncoding RNA found within the 5’ end of some prokaryotic mRNA molecules (Figure 13.12). A riboswitch may bind to a small intracellular molecule to stabilize certain secondary structures of the mRNA molecule. The binding of the small molecule determines which stem-loop structure forms, thus influencing the completion of mRNA synthesis and protein synthesis.

Figure 13.12. Riboswitch Form and Function. Riboswitches found within prokaryotic mRNA molecules can bind to small intracellular molecules, stabilizing certain RNA structures, influencing either the completion of the synthesis of the mRNA molecule itself (left) or the protein made using that mRNA (right).

13.2 Eukaryotic Gene Regulation

As seen in Chapter 10, the initiation of transcription requires the assembly of a multitude of transcription factors (TF) localized at the promoter region. Transcription can also utilize far reaching interactions of enhancers, that bind at a distant DNA site and loop back around to stabilize the RNA polymerase at the promoter. Control of transcriptional initiation is dependent on TF factor activation, TF binding with specific DNA recognition sequences, and chromatin remodeling.

Transcription Factor (TF) Activation

Many TF are expressed within cells and held in an inactive conformation until the right environmental stimulus is present within the cell. Cellular signaling pathways can cause post-translational protein modifications leading to TF activation or small molecules may physically bind and allosterically modify the protein structure to mediate activation. Here we will use examples from the cell cycle signaling cascade and steroid hormone receptor pathways to highlight some mechanisms of TF activation. A key element to take away from this section is that transcription factor activation is often highly pleiotropic and has many cellular affects. Depending on the cell type and the environmental conditions, different combinations of downstream target genes may be activated or inactivated. Teasing apart these intricacies and the physiological effects that they have within an organism is a major goal of ongoing research.

Cell Cycle Regulation by p53

p53 is one of the most studied proteins in science. To date, over 68,000 papers appear in PubMed containing p53 or TP53 in the title and/or abstract. Originally described as an oncogene (since a mutated, functionally altered form of the protein was first characterized), p53 is now recognized as the most frequently inactivated tumor suppressors in human cancers. It is a transcription factor that controls the expression of genes and miRNAs affecting many important cellular processes including proliferation, DNA repair, programmed cell death (apoptosis), autophagy, metabolism, and cell migration (Fig. 13.13). Many of those processes are critical to a variety of human pathologies and conditions extending beyond cancer, including ischemia, neurodegenerative diseases, stem cell renewal, aging, and fertility. Notably, p53 also has non-transcriptional functions, ranging from intrinsic nuclease activity to activation of mitochondrial Bak (Bcl-2 homologous antagonist killer) and caspase-independent apoptosis.

As a transcription factor, p53 responds to various genotoxic insults and cellular stresses (e.g., DNA damage or oncogene activation) by inducing or repressing the expression of over a hundred different genes. p53 transcriptional regulation plays a dominant role in causing the arrest of damaged cells, facilitating their repair and survival, or inducing cell death when DNA is damaged irreparably. p53 can also cause cells to become permanently growth arrested, and there is compelling in vivo evidence that these “senescent” cells secrete factors that enhance their clearance by the immune system, leading to tumor regression. Through these mechanisms, p53 helps maintain genomic stability within an organism, justifying its long-held nickname “guardian of the genome”. Other p53 gene targets are involved in inhibiting tumor cell angiogenesis, migration, metastasis and other important processes (such as metabolic reprogramming) that normally promote tumor formation and progression

Figure 13.13. Cellular stress leads to p53 transcriptional activation of downstream targets. Normally, p53 levels are kept low by its major antagonist, Mdm2, an E3 ubiquitin ligase that is itself a transcriptional target of p53. Stress signals, such as DNA damage, oncogene activation and hypoxia, promote p53 stability and activity by inducing post-translational modifications (PTMs) and tetramerization of p53. p53 functions as a transcription factor that binds to specific p53 response elements upstream of its target genes. p53 affects many important cellular processes linked to tumor suppression, including the induction (green) of senescence, apoptosis, and DNA repair as well as inhibition (red) of metabolism, angiogenesis, and cell migration. These functions are largely mediated through transcriptional regulation of its targets (examples given).

p53 protein function is regulated post-translationally by coordinated interaction with signaling proteins including protein kinases, acetyltransferases, methyl-transferses, and ubiquitin-like modifying enzymes (Figure 13.14). The majority of the sites of covalent modification occur at intrinsically unstructured linear peptide docking motifs that flank the DNA-binding domain of p53 which play a role in anchoring or in allosterically activating the enzymes that mediate covalent modification of p53. In undamaged cells, p53 protein has a relatively short half-life and is degraded by a ubiquitin-proteasome dependent pathway through the action of E3 ubiquitin ligases, such as MDM2 (Fig 13.13). Following stress, p53 is phosphorylated at multiple residues, thereby modifying its biochemical functions required for increased activity as a transcription factor. Post-translational modifications help to stabilize the tetramer formation of the protein and enhance the translocation of the protein from the cytoplasm into the nucleus. The tetrameric form of p53 is then functional to bind to DNA in a sequence-specific manner and either activate or repress transcription, depending on the target sequence. Some post-translational modifications, such as acetylation, are DNA-dependent and can play a role in chromatin remodeling and activation of p53 target gene expression.

Figure 13.14 Sites of Post-Translational Modification on p53. Schematic representation of the 393 amino acid domain structure of human p53 showing the sites of post-translational modification including phosphorylation, acetylation, ubiquitination, methylation, neddylation, and sumoylation. Abbreviations: N-terminal transactivation domain (TAD) proline-rich domain (PRD) tetramerisation domain (TET) C-terminal regulatory domain (REG) arginine (R) lysine (K) serine (S) threonine (T).

It should be noted that single point mutations that modify the ability of the protein to be phosphorylated in one position, typically do not show a decrease in the stabilization or activation of the protein following a damage or stress event. Thus, multiple modifications likely allow for redundancy within this pathway and ensure the activation of the protein following a stress event. Furthermore, the environment within the cell can lead to different p53 phenotypes, such as the activation of growth arrest and DNA repair processes (ie if there is not a lot of damage) or it can lead to the activation of apoptosis or programmed cell death pathways (ie if damage is too extensive to be repaired).

Steroid Hormone Receptors

Steroid hormone receptors (SHRs) belong to the superfamily of nuclear receptors (NRs),which are one of the essential classes of transcriptional factors. NRs play a critical role in all aspects of human development, metabolism and physiology. Since they generally act as ligand-activated transcription factors, they are an essential component of cell signaling. NRs form an ancient and conserved family that arose early in the metazoan lineage. NR molecular evolution is characterized by major events of gene duplication and gene losses. Phylogenetic analysis revealed a distinct separation of NR ligand binding domains (LBDs) into 4 monophyletic branches, the steroid hormone receptor-like cluster, the thyroid hormone-like receptors cluster, the retinoid X-like and steroidogenic factor-like receptor cluster and the nerve growth factor-like/HNF4 receptor cluster (Fig. 13.15).

Figure 13.15 Phylogenetic tree of the nuclear receptors’ ligand binding domain. Four distinct monophyletic branches are visible. Those monophyletic branches are divided into subcategories. The phylogenetic trees confidently separate the steroid hormone-like (branch colored green), the retinoid X-like and steroidogenic factor-like receptors cluster (branch colored orange), the thyroid hormone-like receptors cluster (branch colored blue) and the nerve growth factor-like/hepatocyte nuclear factor-4 receptors cluster (branch colored yellow).

Here we will focus on the Steroid Hormone-Like Receptors branch (SHRs). SHRs plays a key role in many important physiological processes like organ development, metabolite homeostasis, and response to external stimuli. The estrogen receptor comes in two major forms, ERα and ERβ. Other members of this subgroup include the cortisol binding glucocorticoid receptor (GR), the aldosterone binding mineralocorticoid receptor (MR), the progesterone receptor (PR), and the dihydrotestosterone (DHT) binding androgen receptor (AR) (Fig. 13.16).

Figure 13.16 Overview of Steroid Hormone Receptor Family (SHR). A. Phylogenetic tree of the Steroid Hormone Receptor (SHR) family showing the evolutionary interrelationships and distance between the various receptors. Based on alignments available at The NucleaRDB [Horn et al., 2001]. B. All steroid receptors are composed of a variable N-terminal domain (A/B) containing the AF-1 transactivation region, a highly conserved DNA Binding Domain (DBD), a flexible hinge region (D), and a C-terminal Ligand Binding Domain (LBD, E) containing the AF-2 transactivation region. The estrogen receptor α is unique in that it contains an additional C-terminal F domain. Numbers represent the length of the receptor in amino acids.

The members of the Steroid Hormone Receptor family share a similar, modular architecture, consisting of a number of independent functional domains (Fig. 13.16B). Most conserved is the centrally located DNA binding domain (DBD) containing the characteristic zinc-finger motifs. The DBD is followed by a flexible hinge region and a moderately conserved Ligand Binding Domain (LBD), located at the carboxy-terminal end of the receptor. The estrogen receptor α is unique in that it contains an additional F domain of which the exact function is unclear. The LBD is composed of twelve α-helices (H1-H12) that together fold into a canonical α-helical sandwich. Besides its ligand binding capability, the LBD also plays an important role in nuclear translocation, chaperone binding, receptor dimerization, and coregulator recruitment through its potent ligand-dependent transactivation domain, referred to as AF-2. A second, ligand independent, transactivation domain is located in the more variable N-terminal part of the receptor, designated as AF-1. To date, no crystal structure of a full-length SHR exists, though structures of the DBD and LBD regions of most SHRs are available. These have helped significantly in understanding the molecular aspects of DNA and ligand binding, but have to some extent also led to biased attention to these parts of the receptor only. For example, many coregulator interaction studies are still performed with the LBD only, while numerous studies have demonstrated that the AF-2 domain often tells only part of the story. With the help of biophysical techniques, however, it is feasible to study the full-length receptor in its native environment (Figure 13.16).

Most SHRs remain in the cytoplasm of the cell until they are bound with the appropriate steroid (Fig 13.17). Steroid binding causes the dimerization of SHRs and localization to the cell nucleus, where the SHRs interact with the DNA at sequence specific motifs known as Hormone Response Elements (HREs) (Fig. 13.17, Step 5). Many SHRs can also interact with membrane-bound receptors and affect cellular signaling pathways, in addition to the activation of gene expression (Fig. 13.17, step 6).

Figure 13.17 Steroid Hormone Receptors (SHR) act as hormone dependent nuclear transcription factors. Upon entering the cell by passive diffusion, the hormone (H) binds the receptor, which is subsequently released from heat shock proteins, and translocates to the nucleus. There, the receptor dimerizes, binds specific sequences in the DNA, called Hormone Responsive Elements or HREs, and recruits a number of coregulators that facilitate gene transcription.

Steroid Hormones, such as the estrogens, reach their target cells via the blood, where they are bound to carrier proteins. Naturally occurring estrogens include estradiol, estrone, estriol, and estretrol and differ primarily in structure on the presence of hydroxyl-groups (Fig. 13.18). Estradiol is the predominant estrogen during reproductive years both in terms of absolute serum levels as well as in terms of estrogenic activity. During menopause, estrone is the predominant circulating estrogen and during pregnancy estriol is the predominant circulating estrogen in terms of serum levels. Another type of estrogen called estetrol (E4) is produced also produced predominantly during pregnancy (Fig 13.18). Estrogens function in many physiological processes, including the regulation of the menstrual cycle and reproduction, maintaining bone density, brain function, cholesterol mobilization, maturation of reproductive organs during development, and they play a role in controlling inflammation.

Figure 13.18 Naturally Occurring Estrogens.

Because of their lipophilic nature it is thought that steroid hormones, such as estrogen, pass the cell membrane by simple diffusion, although some evidence exists that they can also be actively taken up by endocytosis of carrier protein bound hormones. For a long time it has been assumed that binding of the ligand resulted in a simple on/off switch of the receptor (Fig. 13.17, step 1). While this is likely the case for typical agonists like estrogen and progesterone, this is not always correct for receptor antagonists, used in drug therapy. These antagonists come in two kinds, so-called partial antagonists (for the estrogen receptors known as SERMs for Selective Estrogen Receptor Modulators) and full antagonists. The partial antagonist can, depending on cell type, act as a SHR agonist or antagonist. In contrast, full antagonists (for ER known as SERDs for Selective Estrogen Receptor Downregulators) always inhibit the receptor, independent of cell type, in part by targeting the receptor for degradation. Binding of either type of antagonist results in major conformational changes within the LBD and in release from heat shock proteins that thus far had protected the unliganded receptor from unfolding and aggregation (Fig. 13.17 step 2).

Trancription Factor (TF) Recognition and Binding to DNA

TF control gene expression by binding to their target DNA site to recruit, or block, the transcription machinery onto the promoter region of the gene of interest. Their function relies on the ability to find their target site quickly and selectively. In living cells TFs are present in nM concentrations and bind the target site with comparable affinity, but they also bind any DNA sequence (nonspecific binding), resulting in millions of low affinity (i.e., >10 −6 M) competing sites. Nonspecific binding facilitates the search for the target site by three major mechanisms (Fig. 13.19). One of the main scenarios involves a ‘sliding’ mechanism, in which the protein moves from its initial non-specific site to its actual target site by sliding along the DNA (also known as 1-dimensional (1D) sliding) (Fig. 13.19). When the TF starts to move and shift counterions from the phosphate backbone, the same number of counterions binds to the site left free by the protein. The sliding rate is also dependent on the hydrodynamic radius of the protein the required rotational movement over the DNA backbone is greater for larger proteins, that tend to slide slowly. The second scenario is a ‘hopping’ mechanism, in which a TF might hop from one site to another in 3D space by dissociating from its original site and subsequently binding to the new site. This may happen within the same chain and re-association occurs adjacent to the former dissociated site. A third search mechanism is described as ‘intersegmental transfer’. In this scenario, the protein moves between two sites via an intermediate ‘loop’ formed by the DNA and subsequently bind at two different DNA sites. This mechanism is applicable to TFs with two DNA-binding sites. Proteins with two DNA-binding sites can occasionally bind non-specifically to two locations situated far apart within the DNA strand, that are brought into close contact through the formation of these loops. Such TFs transfer across a point of close contact without dissociating from the DNA.

Figure 13.19 Protein-DNA recognition mechanisms. The main three protein-DNA recognition mechanisms are shown. When the transcription factor (pink ring) moves from one site to another by means of sliding along the DNA and is transferred from one base pair to another without dissociating from the DNA, this mechanism is called sliding (top). Hopping occurs when the transcription factor moves on the DNA by dissociating from one site and re-associating with another site (center). Intersegmental transfer describes the mechanism by which the transcription factor gets transferred through DNA bending or the formation of a DNA loop, resulting in the protein being bound transiently to both sides and subsequently moving from on site to the other (bottom).

Each eukaryotic TF controls tens to hundreds of genes scattered throughout the genome, and expressing each gene needs various TFs simultaneously binding to their sites to form the transcription complex, an extremely rare event in probabilistic terms. As result, the in vivo site occupancy patterns of eukaryotic TFs are more complex than predicted by their in vitro site-specific binding profiles and do not strongly correlate with the actual levels of gene expression. An interesting feature highlighted by genome analysis is an accumulation of potential TF binding sites in regions flanking eukaryotic genes. Such clusters of degenerate recognition sites are assumed to be key for transcription control, and thus are generally classified as gene regulatory regions (RR). For example, the affinity of the Drosophila TF Engrailed to the RRs of its target genes is strongly amplified by long tracts of degenerate consensus repeats that are present in such regions.

Histone Modification and Chromatin Remodeling

Regulation of transcription involves dynamic rearrangements of chromatin structure. Recall that eukaryotic DNA is complexed with histone octamers, which are composed of dimers of the core histones H2A, H2B, H3 and H4. 147 bp of DNA are wrapped 1.65 times around each octamer forming nucleosomes, the basic packaging units of chromatin. Nucleosomes, connected by linker DNA of variable length as “beads on a string”, generate the 11 nm linear structure. The linker histone H1 is positioned at the top of the core histone octamer and enables higher organized compaction of DNA into transcriptionally inactive 30 nm fibres.

To understand the role of chromatin for regulation of transcription it is important to know where nucleosomes are positioned and how positioning is achieved. Basically there are four groups of activities which change chromatin structure during transcription: (1) histone modifications, (2) eviction and repositioning of histones, (3) chromatin remodeling and (4) histone variant exchange. Histone modifiers introduce post-translational, covalent modifications to histone tails and thereby change the contact between DNA and histones. These modifications govern access of regulatory factors. Histone chaperones aid eviction and positioning of histones. A third class of chromatin restructuring factors are ATP dependent chromatin remodelers. These multi-subunit complexes utilize energy from ATP hydrolysis for various chromatin remodeling activities including nucleosome sliding, nucleosome displacement and the incorporation and exchange of histone variants.

Post-translational modifications (PTMs) of histone proteins is a primary mechanism that controls chromatin architencture. Over 20 distinct types of histone PTMs have been described, among which the most abundant ones are acetylation and methylation of lysine residues. Histone PTMs can be deposited on and removed from chromatin by different enzymes, known as histone PTM ‘writers’ and ‘erasers’. Histone PTMs exert their regulatory effects via two main mechanisms. First, histone PTMs serve as docking sites for various nuclear proteins––histone PTM ‘readers’––that specifically recognize modified histone residues through their modification-binding domains. Recruitment of these proteins at specific genomic loci promotes key chromatin processes, such as transcriptional regulation and DNA damage repair. Second, some histone PTMs, such as acetylation, directly affect chromatin higher-order structure and compaction, thereby controlling chromatin accessibility to protein machineries such as those involved in transcriptiion. Chromatin may adopt one of two major states in an interchangeable manner. These states are heterochromatin and euchromatin. Heterochromatin is a compact form that is resistant to the binding of various proteins, such as transcriptional machinery. In contrast, euchromatin is a relaxed form of chromatin that is open to modifications and transcriptional processes (Fig. 13.20). Histone methylation promotes the formation of Heterochromatin whereas, histone acetylation promotes euchromatin.

Figure 13.20 Schematic drawing of histone methylation and acetylation in relation to chromatin remodeling. Addition of methyl groups to the tails of histone core proteins leads to histone methylation, which in turn leads to the adoption of a condensed state of chromatin called ‘heterochromatin.’ Heterochromatin blocks transcription machinery from binding to DNA and results in transcriptional repression. The addition of acetyl groups to lysine residues in the N-terminal tails of histones causes histone acetylation, which leads to the adoption of a relaxed state of chromatin called ‘euchromatin.’ In this state, transcription factors and other proteins can bind to their DNA binding sites and proceed with active transcription.

Chromatin remodeling can also be an ATP-dependent process and involve histone dimer ejection, full nucleosome ejection, nucleosome sliding, and histone variant exchange (Fig 13.21). ATP-dependent chr omatin remodeling complexes bind to nucleosome cores and the surrounding DNA, and, using energy from A TP hydrolysis, they disrupt the DNA-histone interactions, slide or eject nucleosomes, alter nucleosome structures, and modulate the access of transcription factors to the DNA (Figure 13.21 ). In addition to modulating gene expression, some of the complexes are involved in nucleosome assembly and organization, following transcription at locations in which nucleosomes have been ejected, packing of DNA, following replication and DNA repair.

Figure 13.21 Overview of the functions of ATP-dependent chromatin remodeling complexes. (a) A subset of ISWI and CHD complexes are involved in nucleosome assembly, maturation, and spacing. (b) SWI/SNF complexes are primarily involved in histone dimer ejection, nucleosome ejection, and nucleosome repositioning through sliding, thus modulating chromatin access. (c) INO80 complexes are involved in histone exchange. It should be noted that the complexes might be involved in other chromatin remodeling functions.

Another level of chromatin regulation is accomplished by a dynamic exchange of canonical histones with specific histone variants. Histone variants are non-allelic isoforms of canonical histones that differ in their primary sequence and functional properties. For example, the histone variant H3.3 has been found to progressively accumulate in various mouse somatic tissues with age, resulting in near complete replacement of the canonical H3.1/2iso-forms by the age of 18 months. Deletion of H3.3 in mice is lethal and in the fruit fly, Drosophila, causes sterility. Within the nematode, C. elegans, loss of H3.3 exhibit a significant ‘bagging’ phenotype which involves eggs hatching inside the animal body. Furthermore, in organisms that had deficient insulin signaling, loss of H3.3 caused a reduction in lifespan (although this phenotype is not observed in animals with a wildtype insulin signaling pathway) (Fig. 13.22). H3.3 also appears to acculumate with age in humans, and its accumulation is often absent in tumor cells. Overall, histone variant replacement is associated with changes in post translational modifications (such as methylation), and has multiple effects on overall chromosome structure.

Figure 13.22 The Effects of Histone Variant H3.3 on C. elegans Lifespan. H3.3 expression increases over time in C. elegans during their normal lifespan. In organisms with impaired Inulin/IGF-1 signaling, germline deficiency of H3.3 resulted in significant decreases in lifespan.

13.3 Protein-DNA Interactions

Proteins use a wide range of DNA-binding structural motifs, such as homeodomain (HD), helix-turn-helix (HTH), and high-mobility group box (HMG) to recognize DNA. HTH is the most common binding motif and can be found in several repressor and activator proteins (Fig. 13.23). Despite their structural diversity, these domains participate in a variety of functions that include acting as substrate interaction mediators, enzymes to operate DNA, and transcriptional regulators. Several proteins also contain flexible segments outside the DNA-binding domain to facilitate specific and non-specific interactions. For example, many HD proteins use N-terminal arms and a linker region to interact with DNA. The Encyclopedia of DNA Elements (ENCODE) data suggest that about 99.8% of putative binding motifs of TFs are not bound by their respective TFs in the genome. It is, therefore, clear that the presence of a single binding motif per TF is not adequate for TF binding.

Figure 13.23 Representative figures of the transcription factor binding domains. The figure shows the crystal structures of different types of TF domains (3l1p, 4m9e, 5d5v, 1lbg, 1gt0, and 1nkp). The structures were obtained from the Protein Data Bank (PDB) and redrawn using chimera. The respective domains and important regions have been labeled. HTH stands for helix-turn-helix domain. bHLH stands for basic helix-loop-helix motif. HD and HMG stand for homeodomain and high-mobility group box domain, respectively.

Most of the searching mechanism studies that try to determine how TFs find their binding sites are limited to naked DNA-protein complexes, which do not reflect the actual crowded environment of a cell. Studies with naked DNA and transcription factors have shown that many DNA-binding proteins travel a long distance by 1D diffusion. However, the search process for eukaryotes must occur in the presence of chromatin, which has the ability to hinder protein mobility. In this case, the protein must dissociate from the DNA, enter a 3D mode of diffusion state, and continue the target site searching process.

The sliding and intersegmental transfer mechanisms can be explained through the example of the lac repressor. The lac repressor contains 4 identical monomers (a dimer of dimers) for its DNA-binding. The binding sequence of these dimers is symmetric or pseudo-symmetric, and each half is identified by these identical monomers. The HTH domain of the lac repressor is the DNA-binding domain that facilitates the interaction with its target site on DNA (Fig. 13.24). As a result of a rapid search (sliding) along the DNA molecule and intersegmental transfer between distant DNA sequences, the lactose repressor finds its target sites faster than the diffusion limit. The section comprised between residues 1–46 of the HTH protein domain, characterized by three α-helices, maintains its secondary structure through specific and non-specific binding (Fig 13.24). When the repressor binds to a non-specific site, the HTH domain interacts with the DNA backbone and maintains the interaction with its helix region in the major groove juxtaposition. This arrangement facilitates the interaction of the recognition helix with the edges of the DNA bases, enabling the repressor to walk or search for its specific site on the DNA. The C-terminal residues of the DNA-binding domain, residues 47–62, form the hinge region, and are normally disordered during non-specific recognition however, during specific site recognition, residues 50–58 acquire an α-helix configuration (hinge helix) (Fig. 13.24). The disordered hinge region and the flexibility of the HTH domain allow the protein to move freely along the DNA to search for its target site. In specific binding complexes, the hinge helix of each monomer is located at the symmetrical center of the binding site, thereby causing the hinge helices to interact with each other (intersegmental transfer) to allow better stability. Moreover, DNA bends at the symmetrical center of the specific binding site (37° angle), thereby supporting monomer-monomer interactions (Fig 13.24).

Figure 13.24. The Helix-Turn-Helix Motif of the Lac Repressor. Lac repressor binds to DNA non-specifically, enabling it to slide rapidly along the DNA double helix until it encounters the lac operator sequence. The DNA-binding domain employs a helix-turn-helix (HTH) motif ( Alpha Helices , Turns ). During non-specific binding, the hinge region is disordered. The DNA double helix is depicted as straight in the model when the Lac Repressor binds non-specifically. Upon recognizing the specific operator sequence, the non-specific binding converts to specific binding . During this conversion, the hinge region changes from disordered loops to Alpha Helices , which bind to the minor groove of the DNA. As explained below, this binding stabilizes a kinked (“bent”) DNA double helix conformation.

In addition to the helix-turn-helix structure, the zinc finger motif is also very common, especially in eukaryotic TFs (Fig. 13.25). Proteins that contain zinc fingers (zinc finger proteins) are classified into several different structural families. Unlike many other clearly defined supersecondary structures such as Greek keys or β hairpins, there are a number of types of zinc fingers, each with a unique three-dimensional architecture. A particular zinc finger protein’s class is determined by this three-dimensional structure, but it can also be recognized based on the primary structure of the protein or the identity of the ligands coordinating the zinc ion. In spite of the large variety of these proteins, however, the vast majority typically function as interaction modules that bind DNA, RNA, proteins, or other small, useful molecules, and variations in structure serve primarily to alter the binding specificity of a particular protein. The most common type of zinc finger motif utilizes two Cys and two His residues (CCHH) coordinating the Zn(II) ion to adopt a ββα fold with three hydrophobic residues responsible for the formation of a small hydrophobic core which offers additional stabilization of the zinc finger domain (Fig. 13.25).

Figure 13.25 Sequence alignments of the CCHH zinc fingers and a representative structure. (a) Alignment of the TFIIIA-like zinc finger domains from different organisms. Green color denotes residues that are responsible for the hydrophobic core formation in most CCHH zinc fingers (L17, F11 and L2). Yellow and blue indicate the coordinating Cys and His residues, respectively. (b) The 3D NMR structure of 15-th ZF from zinc finger protein 478 [PDB: 2YRH].

Overall, zinc finger motifs display considerable versatility in binding modes, even between members of the same class (e.g., some bind DNA, others protein), suggesting that they are stable scaffolds that have evolved specialised functions. For example, zinc finger-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organization, epithelial development, cell adhesion, protein folding, chromatin remodeling, and zinc sensing, to name but a few. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.

The last binding domain that we will consider in detail here is the helix-loop-helix domains found in Leucine zipper-containing proteins. Specifically, bZIPs (Basic-region leucine zippers) are a class of eukaryotic transcription factors. The bZIP domain is 60 to 80 amino acids in length with a highly conserved DNA binding basic region and a more diversified leucine zipper dimerization region. The two regions form α-helical structures that are connected together via a looped region. This forms a core helix-loop-helix (HLH) structure within each monomer of the protein. Two monomers then join through the fomation of a leucine zipper junction forming a heterodimeric protein structure. The resulting heterodimer can bind with DNA in a sequence-specific manner through the basic α-helices (Fig. 13.26).

Specifically, basic residues, such as lysines and arginines, interact in the major groove of the DNA, forming sequence-specific interactions (Fig 13.26). Most bZIP proteins show high binding affinity for the ACGT motifs. The bZIP heterodimers exist in a variety of eukaryotes and are more common in organisms with higher evolution complexity .

Figure 13.26 Leucine Zipper Transcription Factors from the bZIP family. The monomer subunits of a heterodimeric bZIP protien contain a Helix-loop-Helix (HLH) core structure, where one helix forms the leucine zipper with the other monomer, and the basic helices of each monomer interact with the major groove of the target DNA. The helices are held together by a flexible loop region. (One monomer is shown in blue and one monomer is shown in green).

13.4 Epigenetics and Transgenerational Inheritence

Even though all somatic cells of a multicellular organism have the same genome, different cell types have different transcriptomes (set of all expressed RNA molecules), different proteomes (set of all proteins) and, hence, different functions. Cell differentiation during embryonic development requires the activation and repression of specific sets of genes by the action of cell lineage defining transcription factors. Within a cell lineage, gene activity states are often maintained over several rounds of cell divisions (a phenomenon called “cellular memory” or “cellular inheritance”). Since the rediscovery of epigenetics some 30 years ago (it was originally proposed by Conrad Hal Waddington in the early 1940s), cellular inheritance has been attributed to gene regulatory feedback loops, chromatin modifications (DNA methylation and histone modifications) as well as long-lived non-coding RNA molecules, which collectively are called the “epigenome”. Among the different chromatin modifications, DNA methylation and polycomb-mediated silencing are probably the most stable ones and endow genomes with the ability to impose silencing of transcription of specific sequences even in the presence of all of the factors required for their expression.

Defining Transgenerational Epigenetic Inheritance

The metastability of the epigenome explains why development is both plastic and canalized, as originally proposed by Waddington. Although epigenetics deals only with the cellular inheritance of chromatin and gene expression states, it has been proposed that epigenetic features could also be transmitted through the germline and persist in subsequent generations. The widespread interest in “transgenerational epigenetic inheritance” is nourished by the hope that epigenetic mechanisms might provide a basis for the inheritance of acquired traits. Yes, Lamarck has never been dead and every so often raises his head, this time with the help of epigenetics.

Although acquired traits concerning body or brain functions can be written down in the epigenome of a cell, they cannot easily be transmitted from one generation to the next. For this to occur, these epigenetic changes would have to manifest in the germ cells as well, which in mammals are separated from somatic cells by the so-called Weismann barrier. Further, the chromatin is extensively reshaped during germ cell differentiation as well as during the development of totipotent cells after fertilization, even though some loci appear to escape epigenetic reprogramming in the germline . Long-lived RNA molecules appear to be less affected by these barriers and therefore more likely to carry epigenetic information across generations , although the mechanisms are largely unsolved.

Evidence for Transgenerational Epigenetic Inheritance

In the past 10 years, numerous reports on transgenerational responses to environmental or metabolic factors in mice and rats have been published. The factors include endocrine disruptors, high fat diet, obesity, diabetes, undernourishment as well as trauma. These studies investigated DNA methylation, sperm RNA or both. For example, when male mice are made prediabetic by treatment with streptozotocin it affects the DNA methylation patterns in their resulting sperm, as well as the pancreatic islets of F1 and F2 of the resulting offspring. Furthermore, studies have shown that traumatic stress in early life altered behavioral and metabolic processes in the progeny and that injection of sperm RNAs from traumatized males into fertilized wild-type oocytes reproduced the alterations in the resulting offspring.

In humans, epidemiological studies have linked food supply in the grandparental generation to health outcomes in the grandchildren. An indirect study based on DNA methylation and polymorphism analyses has suggested that sporadic imprinting defects in Prader–Willi syndrome are due to the inheritance of a grandmaternal methylation imprint through the male germline. Because of the uniqueness of these human cohorts these findings still await independent replication. Most cases of segregation of abnormal DNA methylation patterns in families with rare diseases, however, turned out to be caused by an underlying genetic variant. Thus, it is important that studies of this nature rule out the effects of traditional genetic inheritence as being a factor of the observed phenotypes.

Genetic inheritance alone cannot fully explain why we resemble our parents. In addition to genes, we inherited from our parents the environment and culture, which in parts have been constructed by the previous generations (Fig. 13.27). A specific form of the environment is our mother’s womb, to which we were exposed during the first 9 months of our life. The maternal environment can have long-lasting effects on our health. In the Dutch hunger winter, for example, severe undernourishment affected pregnant women, their unborn offspring and the offspring’s fetal germ cells. The increased incidence of cardiovascular and metabolic disease observed in F1 adults, is not due to the transmission of epigenetic information through the maternal germline, but a direct consequence of the exposure in utero, a phenomenon called “fetal programming” or—if fetal germ cells and F2 offspring are affected—“intergenerational inheritance”.

Figure 13.27. Transgenerational inheritance systems. a Offspring inherit from their parents genes (black), the environment (green) and culture (blue). Genes and the environment affect the epigenome (magenta) and the phenotype 22 . Culture also affects the phenotype, but at present there is no evidence for a direct effect of culture on the epigenome (broken blue lines). It is a matter of debate, how much epigenetic information is inherited through the germline (broken magenta lines). G genetic variant, E epigenetic variant. b An epimutation (promoter methylation and silencing of gene B in this example) often results from aberrant read-through transcription from a mutant neighboring gene, either in sense orientation as shown here or in antisense orientation. The presence of such a secondary epimutation in several generations of a family mimics transgenerational epigenetic inheritance, although it in fact represents genetic inheritance. Black arrow, transcription black vertical bar, transcription termination signal broken arrow, read-through transcription

Epigenetics and transcription regulation during eukaryotic diversification: the saga of TFIID

The basal transcription factor TFIID is central for RNA polymerase II-dependent transcription. Human TFIID is endowed with chromatin reader and DNA-binding domains and protein interaction surfaces. Fourteen TFIID TATA-binding protein (TBP)-associated factor (TAF) subunits assemble into the holocomplex, which shares subunits with the Spt-Ada-Gcn5-acetyltransferase (SAGA) coactivator. Here, we discuss the structural and functional evolution of TFIID and its divergence from SAGA. Our orthologous tree and domain analyses reveal dynamic gains and losses of epigenetic readers, plant-specific functions of TAF1 and TAF4, the HEAT2-like repeat in TAF2, and, importantly, the pre-LECA origin of TFIID and SAGA. TFIID evolution exemplifies the dynamic plasticity in transcription complexes in the eukaryotic lineage.

Keywords: SAGA TFIID basal transcription phylogenetic analyses.

© 2019 Antonova et al. Published by Cold Spring Harbor Laboratory Press.


Structural variation between human (h)…

Structural variation between human (h) and yeast (y) TFIID and SAGA complexes. Shared…

Inferred evolutionary history of TAF1…

Inferred evolutionary history of TAF1 and TAF2. ( A ) TAF1 is duplicated…

Inferred evolutionary history of TAF3,…

Inferred evolutionary history of TAF3, TAF8, and SPT7. ( A ) TAF3 arises…

Evolutionary history of the relative…

Evolutionary history of the relative invariable TFIID subunits. ( A ) TAF5 duplicated…

Inferred evolutionary history of TAF4/Ada1…

Inferred evolutionary history of TAF4/Ada1 and the TAF12 HF partner. ( A )…

Inferred evolutionary history of TAF11/TAF13/SPT3.…

Inferred evolutionary history of TAF11/TAF13/SPT3. ( A ) SPT3 is the ancestral protein…

Model of TFIID and SAGA evolutionary divergence from pre-LECA until fungal and metazoan…

Interplay between genome organization and epigenomic alterations of pericentromeric DNA in cancer

In eukaryotic genome biology, the genomic organization inside the three-dimensional (3D) nucleus is highly complex, and whether this organization governs gene expression is poorly understood. Nuclear lamina (NL) is a filamentous meshwork of proteins present at the lining of inner nuclear membrane that serves as an anchoring platform for genome organization. Large chromatin domains termed as lamina-associated domains (LADs), play a major role in silencing genes at the nuclear periphery. The interaction of the NL and genome is dynamic and stochastic. Furthermore, many genes change their positions during developmental processes or under disease conditions such as cancer to activate certain sorts of genes and/or silence others. Pericentromeric heterochromatin (PCH) is mostly in the silenced region within the genome, which localizes at the nuclear periphery. Studies show that several genes located at the PCH are aberrantly expressed in cancer. The interesting question is that despite being localized in the pericentromeric region, how these genes still manage to overcome pericentromeric repression. Although epigenetic mechanisms control the expression of the pericentromeric region, recent studies about genome organization and genome-nuclear lamina interaction have shed light on a new aspect of pericentromeric gene regulation through a complex and coordinated interplay between epigenomic remodeling and genomic organization in cancer.

Keywords: Cancer Epigenetics Gene regulation Genome organization Heterochromatin LADs Pericentromere.

Copyright © 2021 Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, and Genetics Society of China. Published by Elsevier Ltd. All rights reserved.


In osteosarcoma and other solid tumors with high rates of metastases, therapeutic targets that may most improve patient outcomes have been recognized to be those that target metastatic progression and, as such, may not have substantial activity on measurable primary tumors. Furthermore, the fact that osteosarcoma lesions are associated with a rich bone stroma that may not immediately regress concurrent with a tumor response to therapy complicates the conventional use of tumor response to identify agents that may be active in osteosarcoma. For both of these reasons, drugs with potential clinical efficacy may reasonably fail to show activity in standard phase II clinical trials, which rely on shrinkage of the primary tumor as the key metric of therapeutic response. In light of this, the osteosarcoma drug development community has recently outlined the types of preclinical data that should be prioritized as a novel therapeutic agent is considered for inclusion in the treatment of patients with osteosarcoma as a means to prevent metastatic progression, realizing that response data in human patients may not be available. 206 To help assess the potential clinical utility of novel therapeutic agents, an important model/tool is provided by pet dogs that develop osteosarcoma. Indeed, studies of dogs with osteosarcoma are now underway to best define the activity of agents with the greatest promise to improve outcomes for patients. An additional resource available to the osteosarcoma preclinical/clinical research community is the data generated by the Pediatric Preclinical Testing Program, a program to systematically evaluate new agents against childhood leukemia and solid-tumor models (including osteosarcoma). Pediatric Preclinical Testing Program data are publicly available online (

Table 1 presents a list of therapeutic agents that may be reasonably considered to improve treatment outcomes for patients with osteosarcoma. These agents were selected for inclusion based on their specificity for targeting the genetic and epigenetic alterations identified in osteosarcoma and presented in this article, for targeting other key osteosarcoma pathways, or for their promise in preclinical and clinical studies. For each agent listed, a subjective measure of the strength of evidence (based on our assessment) is included.


Candidate Osteosarcoma Therapeutic Agents

AgentTarget(s)Mechanism of
Strength of
Evidence *
and small-molecule
agent Fas
upregulates Fas
Inhibited metastasis in
osteosarcoma xenograft
models 207 effect
abolished in
FasL-deficient mice 157
inhibitor of
Evidence for
dysregulation of
p53/Mdm2 in most
osteosarcoma (see text)
inhibited osteosarcoma
tumor growth in
xenograft models 208
Met inhibitor
Evidence for
overexpression in
osteosarcoma tissues
overexpression linked to
metastasis biology
reduced primary tumor
growth and metastasis in
xenograft models 209
interaction inhibitors,
specific kinase
Expression associated
with a less favorable
outcome 210 knockdown
inhibited metastasis in
xenograft models 210
small-molecule inhibitor
reduced invasive
phenotype in vitro 211
Hedgehog (HH)
Smoothened receptor
(SMO) antagonist
Known role in stem cell
differentiation during
normal bone
development known role
in metastasis in other
cancers 212 pathway
inhibition inhibits tumor
growth in xenograft
model 213 FDA approved
for treatment of other
cancers 214
SrcSelective Src kinase
Reduced cell motility in
vitro, no reduction of
metastasis in mouse
models 215 phase II.5
clinical trial underway
identifier <"type":"clinical-trial","attrs":<"text":"NCT00752206","term_id":"NCT00752206">> NCT00752206)
Signaling pathway active
in osteosarcoma
tissues 216 expression
correlated with metastasis
and survival 216 currently
in clinical trials in dogs
(COTC020) prevented
metastasis in xenograft
models 217
modulators and
antibody conjugates
hu14.18K322AGD2Humanized anti-GD2
Ubiquitously expressed in
osteosarcoma cell lines
and tissues 218 currently
being tested in phase I
clinical trials in
identifier <"type":"clinical-trial","attrs":<"text":"NCT00743496","term_id":"NCT00743496">> NCT00743496)
ADXS31-164Her2/neuVaccineExpression in
osteosarcoma tumors is
associated with poor
survival outcomes 219,220
targeted immunotherapy
reduced tumor-initiating
cells 221 currently in dog
clinical trials (
vedotin (CDX-011)
Variably expressed on
surface of osteosarcoma
xenografts 222 significant
improvement in
event-free survival in
osteosarcoma xenograft
models 222
CREG1, p14ARF,
p21, RASSF1
DNMTiNumerous genes
associated with promoter
hypermethylation in
osteosarcoma (see text)
phase I clinical trials
completed 222 and
identifier <"type":"clinical-trial","attrs":<"text":"NCT01241162","term_id":"NCT01241162">> NCT01241162)
IbandronateRas, DNMT, FasBisphosphonate
upregulates Fas
Inhibited Ras function
and downregulated
DNMT, leading to
increased Fas
expression 160 induced
apoptosis in vitro 160
ZolendronateSmall GTPasesBisphosphonate
downregulates VEGF
Suppressed lung
metastasis and prolonged
overall survival in mouse
models 224,225 phase I
clinical trial completed 226
TranylcypromineLSD1Forms adduct with
inactive region of
Expressed in
osteosarcoma tissues 172
reduced osteosarcoma
growth in vitro 172
Pracinotat (SB939)HDACHDACiPhase I trial completed 227 Medium-
Erinostat (MS-275)HDAC, FasHDACi Fas
Upregulates Fas
expression in Fas-
metastatic osteosarcoma
cells 228 caused
regression of metastasis
in xenograft models
through upregulation Fas
expression in Fas- cells 228
Valproic acidHDACHDACiInhibited growth in vitro
and in xenograph
metastasis models in
combination with
doxorubicin 229 phase I
clinical trials ongoing
<"type":"clinical-trial","attrs":<"text":"NCT01106872","term_id":"NCT01106872">> NCT01106872
<"type":"clinical-trial","attrs":<"text":"NCT01010958","term_id":"NCT01010958">> NCT01010958)

6. Epigenetic Chromatin Regulation and DNA Repair: Synthetic Lethal Interactions and Clinical Applications

As illustrated, chromatin regulation and DNA repair have a complex interplay that we only recently have begun to understand. As a result, there is substantial effort to translate these findings for patient benefit, particularly in cancer. A big portion of cancer treatment options rely on killing cancer cells through induction of DNA damage directly by chemotherapy or irradiation, or indirectly through targeting DNA repair. However, high toxicity and refractory or recurrent disease are frequent, which calls for new treatment options and better patient selection. Epigenomic alterations are thought to play an important role in drug resistance by contributing to gene expression plasticity and tumor heterogeneity [177]. Moreover, the interplay between epigenetic regulation and DNA repair can be exploited to achieve greater therapeutic response, by synergistic and synthetic lethal interactions. In this context, chemical inhibition of epigenetic factors that modulate DDR and drug resistance is a promising and attractive avenue for anticancer therapy ( Figure 4 ).

Targeting multiple steps of DNA damage resolution leads to efficient cell killing. Chromatin regulation is necessary for efficient DNA repair. These pathways are often defective in cancer (asterisks *) and can be targeted using specific inhibitors. As a result, synthetic lethal interactions can be exploited with multiple ways, which can be very advantageous in the clinical setting, where changes in treatment are required.

Chromatin regulation is a complex process, that is often disrupted in various ways within cancer cells. Epigenetic drugs that target components of chromatin regulation, such as inhibitors of DNMTs and HDACs have been proven clinically effective mostly in hematopoietic malignancies that are particularly reliant on epigenetic deregulation of progenitor/stem cells [178]. In solid tumors, broad use of epigenetic drugs has been proven ineffective [179] and only targeted approaches such as use of EZH2 a and IDH inhibitors in selected patients seem promising [180]. A synopsis of current epigenetic drugs used in clinic is shown in Table 2 . DNA repair is also frequently disrupted in cancer, and multiple approaches based on targeting DNA repair are currently employed in the clinic as reviewed previously [181].

Table 2

Representative epigenetic drugs used in clinic.

Inhibitor TypeRepresentative DrugsTargetStatusCancer Type
HDACVorinostatAll HDACsFDA approvedT-cell Lymphoma
RomidepsinHDAC1-3FDA approvedT-cell Lymphoma
BelinostatAll HDACsFDA approvedT-cell Lymphoma
PanobinostatAll HDACsFDA approvedRefractory multiple myeloma
BETOTX015/MK-8628BRD2/3/4phase 1bNUT midline carcinoma
I-BET762BRD2/3/5phase 1/2NUT midline carcinoma & hematological cancers
DNMT5-azacitidineDNMTsFDA appovedAML, MDS
DecitabineDNMTsFDA appovedAML, MDS
HDMtranylcypromineLSD1phase 1AML
HMTtazemetostatEZH2phase 1/2B-cell Lymphoma
PinometostatDOT1Lphase 1MLL-r Leukemia

6.1. Epigenetic Inhibitors in Combination with Chemotherapy/Radiotherapy

In the context of chromatin regulation and DNA repair, the most explored therapeutic approach so far is combination of epigenetic inhibitors with chemotherapeutic agents. Such combinations have displayed synergistic effects in pre-clinical models. More specifically, HDAC inhibitors have been shown to inhibit the DDR/HR pathway and cause sensitivity to DNA damage-inducing agents in various cell types [124]. Moreover, HDAC, DNMT, and LSD1 inhibitors were shown to counteract epigenetic resistance mechanisms and restore sensitivity to chemotherapy in solid tumors [182,183,184]. A number of clinical trials were conducted to assess the efficacy of these combinations in treatment of advanced solid tumors with mixed results in terms of patient response and toxicity, likely because of the differences between regimens and cohorts [185,186]. It is likely that more targeted approaches will be more beneficial, as in the context of BRF1 or EGFR bearing mutant non-small-cell lung cancers, where EZH2 inhibition was shown to selectively sensitize these tumors to topoisomerase II inhibition [187].

Following the same rationale, epigenetic inhibitors could also potentiate radiotherapy. Preclinical evidence has shown such synergism between radiotherapy and HDAC inhibitors [188,189], BET inhibitors [190], EZH2 inhibitors [191,192], and DNA methyltransferase inhibitors [193]. Of these, only the combination of HDAC inhibitors and irradiation is under assessment in the clinic and initial results indicate high toxicity and only limited patient benefit. DNA methyltransferase inhibitors, such 5-azacytidine and decitabine, are cytidine analogs that are incorporated in DNA and are potent radiosensitizers in all contexts, so this combination is not viable due to high toxicity. Optimization of regimen and dosage will be key to increasing efficacy of these drug combinations in the clinic.

6.2. Epigenetic Inhibitors in Combination with Drugs that Target DNA Repair Components

Another therapeutic approach to exploit this interplay is to utilize synthetic lethal interactions. As mentioned above, HDAC inhibitors have been shown to inhibit expression of HR repair genes and this provides a rationale for combing HDAC and PARP inhibitors to achieve effective tumor killing [194,195]. Such synergism has been observed in pre-clinical models of prostate, breast, and ovarian cancer [196,197,198,199]. The efficacy of combining HDAC and PARP inhibitors is currently under evaluation in the clinic ( <"type":"clinical-trial","attrs":<"text":"NCT03742245","term_id":"NCT03742245">> NCT03742245). Similar effects in the expression of HR components are also observed by BET inhibition [200]. Pre-clinical studies have shown significant synergism between BET and PARP inhibitors in multiple cell types, including breast and ovarian cancer [200,201,202,203], which is also under assessment in the clinic (NCT03991469). Combination of PARP and DNA methyltransferase inhibitors has also shown synergistic activity in AML and breast cancer cells [204]. PARP1 and DNMT1 were shown to interact during repair and simultaneous inhibition led to increased PARP1 trapping and DNA damage. A clinical trial is investigating the efficacy of this combination in AML patients ( <"type":"clinical-trial","attrs":<"text":"NCT02878785","term_id":"NCT02878785">> NCT02878785). All the above combinations allow administration of low doses of each drug, which is particularly important for HDAC and DNMT inhibitors, since high toxicity has been a limiting factor to their application. Optimal patient selection and regimen will be crucial to the success of these and future trials.

6.3. Targeting DNA Repair in Tumors with Epigenetic Alterations

Another way to take advantage of the interplay between DNA repair and the epigenome is to target DNA repair components in cancer cells with specific epigenomic alterations. An example of this is H3K36me3 loss which occurs in tumors with either mutations in SETD2, the methyltransferase that deposits this mark, or mutations in histone H3 that inhibit the generation of this modification (e.g., H3K36me3), as well as in tumors that overexpress the demethylases KDM4A and KDM4B [205,206]. These events are frequent in various cancer types, including renal cell carcinoma, lung cancer and glioma [207], and have been associated with poor prognosis [208]. H3K36me3 has been implicated in many DNA repair pathways including HR, NHEJ, and MMR [209]. Pfister et al. identified a dependency of tumors with low H3K36me3 to cell cycle checkpoints, rendering them sensitive to WEE1, CHK, and ATR inhibition [210]. This discovery led to the initiation of a clinical trial assessing the use of the WEE1 inhibitor adavosertib in SETD2-deficient solid tumors ( <"type":"clinical-trial","attrs":<"text":"NCT03284385","term_id":"NCT03284385">> NCT03284385). Based on the role of H3K36me3 in other aspects of DNA repair, it would be interesting to examine other potential synthetic lethal interactions in H3K36me3-low tumors such as PARP inhibition.

Another component of chromatin regulation that has been shown to be actively involved in DNA repair pathways is the subunit of the SWI/SNF complex, ARID1A. In solid tumors, this epigenetic factor is frequently mutated and its inactivation has been linked with aggressive disease [211,212]. ARID1A-deficient tumors were shown to have cell cycle defects due to its role in DNA damage response and cell cycle checkpoint regulation [213,214]. Consequently, these tumors were found to be sensitive to PARP and ATR inhibitors. A number of clinical trials are currently investigating the efficacy of targeting ARID1A deficient tumors with these inhibitors in patients ( <"type":"clinical-trial","attrs":<"text":"NCT04065269","term_id":"NCT04065269">> NCT04065269, <"type":"clinical-trial","attrs":<"text":"NCT03207347","term_id":"NCT03207347">> NCT03207347, <"type":"clinical-trial","attrs":<"text":"NCT04042831","term_id":"NCT04042831">> NCT04042831).

79 Regulation of Gene Expression

By the end of this section, you will be able to do the following:

  • Discuss why every cell does not express all of its genes all of the time
  • Describe how prokaryotic gene regulation occurs at the transcriptional level
  • Discuss how eukaryotic gene regulation occurs at the epigenetic, transcriptional, post-transcriptional, translational, and post-translational levels

For a cell to function properly, necessary proteins must be synthesized at the proper time and place. All cells control or regulate the synthesis of proteins from information encoded in their DNA. The process of turning on a gene to produce RNA and protein is called gene expression . Whether in a simple unicellular organism or a complex multi-cellular organism, each cell controls when and how its genes are expressed. For this to occur, there must be internal chemical mechanisms that control when a gene is expressed to make RNA and protein, how much of the protein is made, and when it is time to stop making that protein because it is no longer needed.

The regulation of gene expression conserves energy and space. It would require a significant amount of energy for an organism to express every gene at all times, so it is more energy efficient to turn on the genes only when they are required. In addition, only expressing a subset of genes in each cell saves space because DNA must be unwound from its tightly coiled structure to transcribe and translate the DNA. Cells would have to be enormous if every protein were expressed in every cell all the time.

The control of gene expression is extremely complex. Malfunctions in this process are detrimental to the cell and can lead to the development of many diseases, including cancer.

Prokaryotic versus Eukaryotic Gene Expression

To understand how gene expression is regulated, we must first understand how a gene codes for a functional protein in a cell. The process occurs in both prokaryotic and eukaryotic cells, just in slightly different manners.

Prokaryotic organisms are single-celled organisms that lack a cell nucleus, and their DNA therefore floats freely in the cell cytoplasm. To synthesize a protein, the processes of transcription and translation occur almost simultaneously. When the resulting protein is no longer needed, transcription stops. As a result, the primary method to control what type of protein and how much of each protein is expressed in a prokaryotic cell is the regulation of DNA transcription. All of the subsequent steps occur automatically. When more protein is required, more transcription occurs. Therefore, in prokaryotic cells, the control of gene expression is mostly at the transcriptional level.

Eukaryotic cells, in contrast, have intracellular organelles that add to their complexity. In eukaryotic cells, the DNA is contained inside the cell’s nucleus and there it is transcribed into RNA. The newly synthesized RNA is then transported out of the nucleus into the cytoplasm, where ribosomes translate the RNA into protein. The processes of transcription and translation are physically separated by the nuclear membrane transcription occurs only within the nucleus, and translation occurs only outside the nucleus in the cytoplasm. The regulation of gene expression can occur at all stages of the process ((Figure)). Regulation may occur when the DNA is uncoiled and loosened from nucleosomes to bind transcription factors ( epigenetic level), when the RNA is transcribed ( transcriptional level), when the RNA is processed and exported to the cytoplasm after it is transcribed ( post-transcriptional level), when the RNA is translated into protein ( translational level), or after the protein has been made ( post-translational level).

The differences in the regulation of gene expression between prokaryotes and eukaryotes are summarized in (Figure). The regulation of gene expression is discussed in detail in subsequent modules.

Differences in the Regulation of Gene Expression of Prokaryotic and Eukaryotic Organisms
Prokaryotic organisms Eukaryotic organisms
Lack a membrane-bound nucleus Contain nucleus
DNA is found in the cytoplasm DNA is confined to the nuclear compartment
RNA transcription and protein formation occur almost simultaneously RNA transcription occurs prior to protein formation, and it takes place in the nucleus. Translation of RNA to protein occurs in the cytoplasm.
Gene expression is regulated primarily at the transcriptional level Gene expression is regulated at many levels (epigenetic, transcriptional, nuclear shuttling, post-transcriptional, translational, and post-translational)

Prokaryotic cells can only regulate gene expression by controlling the amount of transcription. As eukaryotic cells evolved, the complexity of the control of gene expression increased. For example, with the evolution of eukaryotic cells came compartmentalization of important cellular components and cellular processes. A nuclear region that contains the DNA was formed. Transcription and translation were physically separated into two different cellular compartments. It therefore became possible to control gene expression by regulating transcription in the nucleus, and also by controlling the RNA levels and protein translation present outside the nucleus.

Most gene regulation is done to conserve cell resources. However, other regulatory processes may be defensive. Cellular processes such as developed to protect the cell from viral or parasitic infections. If the cell could quickly shut off gene expression for a short period of time, it would be able to survive an infection when other organisms could not. Therefore, the organism evolved a new process that helped it survive, and it was able to pass this new development to offspring.

Section Summary

While all somatic cells within an organism contain the same DNA, not all cells within that organism express the same proteins. Prokaryotic organisms express most of their genes most of the time. However, some genes are expressed only when they are needed. Eukaryotic organisms, on the other hand, express only a subset of their genes in any given cell. To express a protein, the DNA is first transcribed into RNA, which is then translated into proteins, which are then targeted to specific cellular locations. In prokaryotic cells, transcription and translation occur almost simultaneously. In eukaryotic cells, transcription occurs in the nucleus and is separate from the translation that occurs in the cytoplasm. Gene expression in prokaryotes is mostly regulated at the transcriptional level (some epigenetic and post-translational regulation is also present), whereas in eukaryotic cells, gene expression is regulated at the epigenetic, transcriptional, post-transcriptional, translational, and post-translational levels.

Review Questions

Control of gene expression in eukaryotic cells occurs at which level(s)?

  1. only the transcriptional level
  2. epigenetic and transcriptional levels
  3. epigenetic, transcriptional, and translational levels
  4. epigenetic, transcriptional, post-transcriptional, translational, and post-translational levels

Post-translational control refers to:

  1. regulation of gene expression after transcription
  2. regulation of gene expression after translation
  3. control of epigenetic activation
  4. period between transcription and translation

How does the regulation of gene expression support continued evolution of more complex organisms?

  1. Cells can become specialized within a multicellular organism.
  2. Organisms can conserve energy and resources.
  3. Cells grow larger to accommodate protein production.
  4. Both A and B.

Critical Thinking Questions

Name two differences between prokaryotic and eukaryotic cells and how these differences benefit multicellular organisms.

Eukaryotic cells have a nucleus, whereas prokaryotic cells do not. In eukaryotic cells, DNA is confined within the nuclear region. Because of this, transcription and translation are physically separated. This creates a more complex mechanism for the control of gene expression that benefits multicellular organisms because it compartmentalizes gene regulation.

Gene expression occurs at many stages in eukaryotic cells, whereas in prokaryotic cells, control of gene expression only occurs at the transcriptional level. This allows for greater control of gene expression in eukaryotes and more complex systems to be developed. Because of this, different cell types can arise in an individual organism.

Describe how controlling gene expression will alter the overall protein levels in the cell.

The cell controls which proteins are expressed and to what level each protein is expressed in the cell. Prokaryotic cells alter the transcription rate to turn genes on or off. This method will increase or decrease protein levels in response to what is needed by the cell. Eukaryotic cells change the accessibility (epigenetic), transcription, or translation of a gene. This will alter the amount of RNA and the lifespan of the RNA to alter the amount of protein that exists. Eukaryotic cells also control protein translation to increase or decrease the overall levels. Eukaryotic organisms are much more complex and can manipulate protein levels by changing many stages in the process.


Structural biology-based insights into combinatorial readout and crosstalk among epigenetic marks

Epigenetic mechanisms control gene regulation by writing, reading and erasing specific epigenetic marks. Within the context of multi-disciplinary approaches applied to investigate epigenetic regulation in diverse systems, structural biology techniques have provided insights at the molecular level of key interactions between upstream regulators and downstream effectors. The early structural efforts focused on studies at the single domain-single mark level have been rapidly extended to research at the multiple domain-multiple mark level, thereby providing additional insights into connections within the complicated epigenetic regulatory network. This review focuses on recent results from structural studies on combinatorial readout and crosstalk among epigenetic marks. It starts with an overview of multiple readout of histone marks associated with both single and dual histone tails, as well as the potential crosstalk between them. Next, this review further expands on the simultaneous readout by epigenetic modules of histone and DNA marks, thereby establishing connections between histone lysine methylation and DNA methylation at the nucleosomal level. Finally, the review discusses the role of pre-existing epigenetic marks in directing the writing/erasing of certain epigenetic marks. This article is part of a Special Issue entitled: Molecular mechanisms of histone modification function.

Keywords: Combinatorial readout DNA methylation Epigenetic regulation Histone modification.

Copyright © 2014 Elsevier B.V. All rights reserved.


Structural basis for multivalent readout…

Structural basis for multivalent readout of histone marks from a single histone tail.…

Structural basis for multivalent readout…

Structural basis for multivalent readout of multiple histone tails. (A) Ribbon-representation of a…

Structural basis for multivalent readout…

Structural basis for multivalent readout of histone and DNA marks. (A) The domain…

Structural basis for recognition by…

Structural basis for recognition by histone modification-directed histone modification enzyme. (A) Ribbon representation…

Watch the video: Gene Regulation in Eukaryotes (January 2023).