Inquiry Question 1: How does mutation introduce new alleles into a population?
Assess the significance of 'coding' and 'non-coding' DNA segments in the process of mutation and investigate the effects of different mutations on a protein's amino acid sequence
A focused answer to the HSC Biology Module 6 dot point on how mutations alter protein products. Coding versus non-coding regions, silent missense and nonsense substitutions, frameshift consequences, splice-site mutations, and a worked sickle cell example.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this dot point is asking
NESA wants you to trace the consequence of a mutation through the DNA, mRNA and protein levels, and explain why mutations in non-coding regions can also affect phenotype. The standard worked example is sickle cell anaemia.
The answer
A mutation's effect on protein depends on where it lands (coding vs non-coding region) and what kind of change it is (silent, missense, nonsense or frameshift).
Coding vs non-coding DNA
Coding DNA (exons). Translated into amino acids. A mutation here directly changes the protein sequence (or stops translation).
Non-coding DNA. Includes promoters, enhancers, introns, splice sites, untranslated regions (UTRs) and non-coding RNA genes. Not translated, but mutations here can still change the amount, timing or splicing of the protein.
In humans, less than 2% of the genome codes directly for protein. Most regulatory sequence is non-coding, so non-coding mutations are common and important.
Effects on amino acid sequence (coding region mutations)
Silent mutation. A substitution that does not change the amino acid because the genetic code is degenerate (e.g. GGA and GGC both code for glycine). No effect on protein sequence.
Missense mutation. A substitution that changes one amino acid for another. Effect depends on:
- Which amino acid changes. A conservative change (one hydrophobic for another) often preserves function. A non-conservative change (charged to non-polar, as in sickle cell) is more likely to disrupt folding or activity.
- Where in the protein. Changes at the active site of an enzyme or at a protein-protein interface are usually catastrophic; changes in loops or surface residues may be tolerated.
Nonsense mutation. A substitution that creates a premature stop codon (UAA, UAG, UGA). The protein is truncated and usually non-functional. Many Duchenne muscular dystrophy alleles are nonsense mutations in the dystrophin gene.
Frameshift mutation. An insertion or deletion of a number of bases not divisible by three shifts the reading frame from the mutation onward. The amino acid sequence past the mutation is essentially random, and a premature stop codon usually appears within a few codons, producing a truncated, non-functional protein.
Effects of non-coding region mutations
- Promoter mutations
- Alter transcription factor binding, increasing or decreasing transcription. A weaker promoter for a tumour suppressor reduces its expression and increases cancer risk.
- Splice-site mutations
- Disrupt the GT...AG signals at intron boundaries, causing exon skipping or intron retention. Many beta-thalassaemia and Marfan syndrome alleles are splice-site mutations.
- Enhancer and silencer mutations
- Change tissue-specific or developmental-stage expression.
- Mutations in non-coding RNA genes
- A mutation in a microRNA gene can dysregulate dozens of target mRNAs.
Worked example: sickle cell anaemia
- DNA
- Beta-globin gene, codon 6, sense strand changes from GAG to GTG (a single A to T substitution).
- mRNA
- Codon 6 changes from GAG to GUG.
- Protein
- Glutamic acid (charged, hydrophilic) is replaced by valine (uncharged, hydrophobic). This is a non-conservative missense mutation at a surface residue.
- Cell level
- The hydrophobic valine creates a sticky patch on the beta-globin surface. Under low oxygen, the deoxygenated haemoglobin (HbS) polymerises into long fibres, deforming red blood cells into rigid sickled shapes.
- Organism level
- Sickled cells block capillaries (vaso-occlusive pain crises), are destroyed by the spleen (chronic haemolytic anaemia) and have a shortened lifespan. Heterozygotes are carriers with partial resistance to malaria, which explains the high allele frequency in malarial regions.
Summary table
| Mutation type | Region | Effect on protein |
|---|---|---|
| Silent | Coding | None |
| Missense | Coding | One amino acid changed (effect depends on chemistry and location) |
| Nonsense | Coding | Premature stop; truncated, non-functional |
| Frameshift | Coding | Reading frame shifted; mostly non-functional |
| Promoter | Non-coding | Altered amount of protein |
| Splice site | Non-coding | Faulty mRNA; usually non-functional protein |
Examples in context
Example 1. Cystic fibrosis and the F508del mutation. Around 1 in 2500 babies in Australia is born with cystic fibrosis, and the most common cause is the F508del mutation in the CFTR gene. The mutation is an in-frame deletion of three nucleotides (CTT) in exon 11, which removes the codon for phenylalanine at position 508 of the CFTR chloride channel protein. Because exactly three bases are removed, the reading frame is preserved (this is not a frameshift), but the missing amino acid causes the protein to misfold and be degraded by the cell's quality control machinery before reaching the cell membrane. The lung epithelium cannot then move chloride ions, mucus thickens, and recurrent infection follows.
Example 2. A non-coding mutation in lactase persistence. Most adult mammals lose the ability to digest lactose, but roughly 35 percent of Australian adults of European ancestry can. The mutation is not in the coding region of the lactase gene (LCT) itself but in a regulatory region 14 kilobases upstream, within an intron of an adjacent gene. A single C to T substitution at position -13910 creates a stronger binding site for a transcription factor, keeping LCT transcription active in adulthood. Although the lactase protein itself is unchanged, the amount made is much higher. This is a textbook example of why non-coding DNA matters and why focussing only on coding regions misses important phenotypic variation.
Try this
Q1. A coding DNA sequence reads 5'-ATGCATCCATAA-3'. A point mutation changes the second codon from CAT to CAC. State the type of mutation and justify your answer using a codon table reference. [3 marks]
- Cue. CAT and CAC both code for histidine (His); this is a silent (synonymous) mutation that does not alter the amino acid sequence.
Q2. The mRNA sequence 5'-AUGGAAUUCUAA-3' is mutated by inserting a single G after the start codon. Predict the new amino acid sequence and identify the type of mutation. Use AUG = Met, GGA = Gly, AUU = Ile, CUA = Leu, A and other codons as needed. [3 marks]
- Cue. Insertion of one base shifts the reading frame: AUG-GGA-AUU-CUA-A... so the polypeptide becomes Met-Gly-Ile-Leu... This is a frameshift mutation.
Q3. Compare the phenotypic effects of a missense mutation, a nonsense mutation and a frameshift mutation occurring in the same coding region. (a) For each, describe the effect on the amino acid sequence. (b) Predict which is most likely to produce a non-functional protein and justify your answer. [3+3 marks]
- Cue. (a) Missense: one amino acid swapped; nonsense: premature stop truncates the protein; frameshift: changes every downstream codon. (b) Frameshift or nonsense typically more damaging because they disrupt the entire C-terminal portion of the protein.
Exam-style practice questions
Practice questions written in the style of NESA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
2024 HSC4 marksThe faulty CFTR allele is often caused by the deletion of three nucleotides. Explain how the deletion of nucleotides in the CFTR gene removes only one amino acid. Include reference to the nucleotides that code for the isoleucine and phenylalanine amino acids. (Use the codon chart provided.)Show worked answer →
Recognise that three deleted nucleotides span two codons, then use the redundancy of the genetic code. Sample answer: A triplet of mRNA nucleotides codes for one amino acid, but the three deleted nucleotides span across two codons - the codon for isoleucine and the codon for phenylalanine - so you would expect both to be affected. However, isoleucine can be coded by several different triplets. The correct coding is AUC, but after the deletion the remaining code is AUU, which also codes for isoleucine. Thus only the phenylalanine is removed. Marks: 4 = thorough explanation referring to isoleucine, phenylalanine and the final sequence; 3 = sound explanation referring to appropriate amino acids/deletion effects; 2 = some understanding of how mRNA codes for amino acids; 1 = some relevant information. Common error: not grasping that the deletion spanned two codons / that multiple codons code the same amino acid.
2022 HSC4 marksThe EGFR protein includes a receptor and an enzyme component, and its function is to help regulate cell division. EGFR mutations are present in about 32% of Non-Small Cell Lung Cancer cases. Explain how a mutation in the EGFR gene could result in changes in protein structure and function to increase the risk of lung cancer.Show worked answer →
Link mutation to amino-acid change to altered structure/function, then to uncontrolled cell division. Sample answer: A mutation of the EGFR gene is a change in its DNA base sequence/codons. If this change is in a coding region it can change the amino acid sequence of the polypeptide, altering the folding and properties of the protein and so its function. For example, a change in the enzyme region could alter the active site and therefore enzyme activity, changing the rate of DNA replication and cell division. Since EGFR helps control cell division, the mutation can lead to uncontrolled cell division, and lung cancer is the result of uncontrolled cell division. Marks: 4 = explains the link between mutation, EGFR structure and regulation of cell division AND links cancer to uncontrolled division; 3 = describes the link to protein structure and links cancer to uncontrolled division; 2 = outlines an effect of mutation on EGFR; 1 = relevant information. Use the stimulus and link the changed structure to loss of cell-division control.
2023 HSC4 marks5-Bromouracil (bU) bonds with adenine in place of thymine, then binds with guanine during replication, making a guanine-cytosine pair instead of an adenine-thymine pair. Describe the possible effects on a protein if this mutation occurred within a gene.Show worked answer →
Trace the base change through polypeptide synthesis to several possible protein outcomes. Sample answer: During polypeptide synthesis an mRNA strand is made from the DNA template, and the sequence of bases codes for specific amino acids. After the bU substitution, the mutated strand contains guanine and continues to reproduce the mutated gene. Possible effects: (1) if the new (mutated) codon still codes for the same amino acid, there is no change to the protein (silent); (2) if it codes for a different amino acid, a different polypeptide forms, which could fold differently and produce a non-functioning protein; (3) if the mutation creates a STOP codon, the chain terminates early, and if it disrupts the AUG START, translation will not begin. Marks: 4 = comprehensive understanding of how the mutation could alter the amino acid sequence AND relates a changed sequence to protein structure/function; 3 = sound understanding of one pathway and a structure/function link; 2 = identifies a possible way it alters the DNA/amino acid sequence; 1 = relevant information.
2023 HSC2 marksThe normal Huntingtin protein has 10-26 repeats of CAG; in Huntington's disease there are 37-80 repeats, altering protein structure. Using the graph (age of onset vs number of CAG repeats), explain the relationship between the number of CAG repeats and the age of onset of Huntington's disease.Show worked answer →
State the inverse relationship and support it with a data point and a structural reason. Sample answer: As the number of CAG repeats increases, the age of onset of Huntington's disease decreases - e.g. with about 60 CAG repeats the age of onset is around 20 years. The increased CAG repeats lead to the alteration of the protein responsible for the disease at an earlier age (change in protein structure occurs at an earlier stage). Marks: 2 = explains the relationship AND makes relevant reference to the graph data OR the effect of increased repeats on protein structure; 1 = makes the relationship evident OR makes some reference to the data. Common error: not identifying the independent/dependent variables or not using data to explain the relationship.
Related dot points
- Explain how a range of mutagens operate, including but not limited to: electromagnetic radiation sources, chemicals, naturally occurring mutagens; and classify different types of mutation including point, silent, frameshift and chromosomal mutations
A focused answer to the HSC Biology Module 6 dot point on classifying mutations. Covers point mutations (substitution, insertion, deletion), silent vs missense vs nonsense, frameshift effects on reading frame, and chromosomal mutations (deletion, duplication, inversion, translocation, non-disjunction).
- Explain how a range of mutagens operate, including but not limited to: electromagnetic radiation sources, chemicals, naturally occurring mutagens
A focused answer to the HSC Biology Module 6 dot point on mutagens. Physical mutagens (UV, X-rays, gamma rays), chemical mutagens (base analogues, alkylating agents, intercalators) and biological mutagens (viruses, transposons), with named examples and the molecular mechanism by which each damages DNA.
- Investigate the causes of genetic variation relating to the changes and conservation of the DNA sequence including: the use of pedigree analysis to identify patterns of inheritance and mutation
A focused answer to the HSC Biology Module 6 dot point on pedigree analysis. How to identify autosomal recessive, autosomal dominant, X-linked recessive and X-linked dominant inheritance patterns from pedigree charts, with a worked haemophilia example and rules for spotting new mutations.