LECTURE 20
EUKARYOTIC GENES AND GENOMES II
In the last lecture we considered the structure of genes in eukaryotic
organisms and went on to figure out a way to identify S. cerevisiae genes that
are transcriptionally regulated in response to a change in environment. The
ability to regulate gene expression in response to environmental cues is a
fundamental requirement for all living cells, both prokaryote and eukaryote.
We considered how many genes each organism has, about 4,000 for E. coli,
6,000 for yeast and a little over 20,000 for mouse and humans. But only a
subset of these genes is actually expressed at any one time in any particular
cell. For multicellular organisms this becomes even more apparent…it is
obvious that skin cells must be expressing a different set of genes than liver
cells, although of course there must be a common set of genes that are
expressed in both cell types; these are often called housekeeping genes.
There are a number of ways that gene regulation in eukaryotes differs from
gene regulation in prokaryotes.
? Eukaryotic genes are not organized into operons.
? Eukaryotic regulatory genes are not usually linked to the genes they
regulate.
? Some of the regulatory proteins must ultimately be
compartmentalized to the nucleus, even when signaling begins at
the cell membrane or in the cytoplasm.
? Eukaryotic DNA is wrapped around nucleosomes
Today we will consider how one can use genetics to begin to dissect the
mechanisms by which gene transcription can be regulated. For this we will
take the example of the yeast GAL genes in S. cerevisiae.
GALACTOSE METABOLISM IN YEAST
Reaction
D-galactose
D-galactose-1-phosphate
UDP-D-galactose
UDP-D-glucose
D-glucose-1-phosphate
D-glucose-6-phosphate
GLYCOLYSIS
Enzyme
Gene
GAL1
GAL7
GAL10
Galactokinase
Galactose transferase
Galactose epimerase
UDP-glucose
Phosphorylase
Phosphoglucomutase
GAL1 encoded
GAL1, GAL7, GAL10 transcription all induced in the
presence of glucose. How is this achieved.
Once a gene has been identified as being inducible under certain
inducing conditions, in this case in the presence of galactose, we can begin to
dissect the regulatory mechanism by isolating mutants; i.e., mutants that
constitutively express the GAL genes even in the absence of galactose, and
mutants that have lost the ability to induce the GAL genes in the presence of
galactose. If we were studying galactose regulation today we would probably
use a lacZ reporter system as we discussed in the last lecture. However,
when the Gal regulatory system was fist genetically dissected, it was done by
actually measuring the induction of Gal1 encoded galactokiase activity, so this
is how we will discuss the genetic dissection of the system.
Another approach is to simply measure galactokinase
activity in the presence or absence of Galactose
GAL1::Tn7lacZ fusion strain grown on:
GLYCEROL
GLYCEROL
+ X-Gal
GALACTOSE
+ X-Gal
Constitutive
Uninducible
Mutagenized
What we know is that Gal4 mutants are uninducible and that Gal80 and
Gal81 mutants constitutively express the Gal1 galactokinase gene, along
with the other Gal genes. Let’s analyze each mutant in turn:
Gal4 mutant: It was first established that, like Gal1
-
, the Gal4
-
mutant
phenotype is recessive, because heterozygous diploids generated by mating
Gal4
-
to wild type have normal regulation. It was then established that the
mutation in the Gal4
-
strain lies in a
new gene, and not simply in the GAL1
galactokinase gene; Gal1
-
mutants
don’t express galactokinase activity in
the presence of galactose, just as was
seen for the Gal4
-
mutant. That
Gal1
-
and Gal4
-
mutants have
mutations in different genes was
shown by complementation analysis,
(diploids from mating Matα Gal4
-
with Mata Gal1
-
behave like wild
type) and the fact that the GAL4 and
GAL1 genes are unlinked was established by tetrad analysis. You should
think about what the tetrads from the aforementioned diploids would look like.
Put together the simplest model
is that Gal4 is a positive
regulator of Gal1 (and the other
Gal genes). The + sign indicates
that Gal4 increases Gal
expression, but does not indicate
whether this is direct or indirect.
Gal80 mutant: The next useful
regulatory mutant isolated was Gal80
-
,
in which the Gal1 encoded galactokinase
is expressed even in the absence of
galactose and is not further induced in
its presence. Again, heterozygous
diploids (Gal80
-
/wt) showed that
Gal80
-
is recessive, Tetrad analysis
showed that Gal80 is not linked to Gal1,
Gal4 or any of the Gal genes.
If a mutant Gal80 results in constitutive Gal1 expression, the simplest model
is that Gal80 negatively regulates the Gal genes. Since Gal4 positively
regulates, and Gal80 negatively regulated Gal1 expression, we have to figure
out how these two gene products work together to achieve such regulation.
Assuming that Gal4 and Gal80 act in series there are two formal possibilities:
Model 1 is that Gal4
positively regulates Gal1,
and that Gal80 negatively
regulates Gal4; the
presence of galactose
somehow inhibits Gal80
function thus releasing
Gal4 to positively activate
Gal1 expression.
Model 2 is that Gal80
negatively regulates
Gal1, and Gal4
negatively regulates Gal80; here the presence of galactose positively
activates Gal4 which in turn negatively regulates Gal80, thus relieving
inhibition of
Model 1
Model 2
Gal1 expression.
We can distinguish between these two models by doing what’s called an
epistasis test to establish the epistatic relationship between Gal4
-
and
Gal80
-
. This involves making a double Gal4
-
/ Gal80
-
mutant strain. The
phenotype of the double mutant will indicate which of the two models is most
likely to be true….take a look at the two models to predict what phenotype the
double mutant should have. For Model 1 the double mutant would become
uninducible, for Model 2 the double mutant should be constitutive.
We could make the Gal4
-
/ Gal80
-
double mutant strain using molecular
biological approaches…but an easier way is to let yeast meiosis do the job for
you. If we mate the Gal4
-
/ Gal80
+
haploid strain with the Gal4+ / Gal80
-
haploid strain (we know these two genes are unlinked) we should obtain
double mutants among the tetratype and non-parental ditype tetrads that
result from this cross.
Parental
Ditype
Gal4- Gal80+
Gal4- Gal80+
Gal4+ Gal80-
Gal4+ Gal80-
Tetratype
Gal4- Gal80+
Gal4- Gal80-
Gal4+ Gal80-
Gal4+ Gal80+
Non Parental
Ditype
Gal4- Gal80-
Gal4- Gal80-
Gal4+ Gal80+
Gal4+ Gal80+
These results clearly support Model 1, i.e., because the double mutant is
uninducible rather than constitutive, Gal4 liklely behaves as a positive
activator of Gal1 expression, and in the absence of galactose Gal80 somehow
prevents Gal4 from activation Gal1 expression. When galactose is present
Gal80 can no longer prevent Gal4 from activating Gal1 expression.
Now lets consider a new class of mutant that turned out to be quite
informative, Gal81
-
. Gal81
-
mutants, like Gal80
-
mutants are consititutive
for Gal1 expression, but unlike Gal80
-
, Gal81
-
is dominant. (Gal81
-
/ Gal80
-
diploids are constitutive).
An obvious question is whether Gal81
-
mutants are still constitutive in a
Gal4
-
background, since it was already established that Gal4 positively
regulates Gal1 (and the other Gal genes).
Mat Gal81
-
Gal4
+
X Matα Gal4
-
Gal81
+
The surprising finding was that from this cross all the tetrads were of the
parental ditype; in other words there were no tetratypes or nonparental
ditypes, indicating that Gal81
-
and Gal4
-
are very tightly linked. Indeed, it
turns out that the Gal81
-
mutation maps to the Gal4 gene, in the coding
region. The Gal81
-
mutation was redesignated as Gal4
81
. Essentially Gal4
81
behaves as a super-activator that is impervious to the negative effects of
Gal80; Gal4
81
thus activates independently of galactose and Gal80.
So, how do all these genetic facts fit into a molecular model? Upstream of the
Gal1 gene (and other Gal genes) two cis-acting elements are needed for
transcriptional activation. First the TATA-binding protein (TBP) binds to the
TATA-concensus site, and provides a landing pad for a very large RNA
ploymerase complex (RNAP). However, just binding to TBP does not enable
transcription, the complex must
be activated by a transcriptional
activator, in this case the Gal4
protein. The Gal4 protein sits at
another cis-acting element in the
Gal1 promoter region, the
upstream activator sequence
(UAS) that tethers Gal4 to the
promoter. In the absence of
galactose, Gal80 physically
prevents Gal4 from recruiting
and activating RNAP. In the
presence of galactose the Gal80
protein changes conformation
and binds to a different region of Gal4, unveiling the ability of Gal4 to recruit
and activate RNAP.
Gal80
(Note that the mutation in the Gal4
81
allele interferes with Gal80 binding
allowing Gal4 to recruit and activate RNAP all the time, even in the absence
of galactose.)
One final comment about
the model for induction of
the Gal genes by galactose.
For many years it was
assumed that galactose (or a derivative of galactose) actually binds directly to
the Gal80 protein, thus preventing it from inhibiting the Gal4 protein from
activating Gal1 transcription. However, it now seems that one extra protein
involved in this chain of events. The Gal3 protein turns out to be directly
bound by galactose (or a derivative); this allows Gal3 to move from the
cytoplasm into the nucleus, the galactose/Gal3 moiety binds to Gal80 to
facilitate moving Gal80 to a different site on the Gal4 protein, thus allowing
Gal4 to activate transcription. Thus while the model as written in this figure
does not actually include Gal3, the models are still formally correct.
In the next lecture we will be looking at promoter elements in eukaryotic
genes in more detail.