Lecture 6: Genetic basis of Antibody Structure.

Chapter 6, Benjamini et al.

The following observations have to be accounted for:

  1. Each B-cell produces antibodies that are homogenous in specificity.
  2. At any given time there are about 108 different specificities present in the average human (or mouse)
  3. Diversity in amino acid sequence is generally limited to the three hypervariable regions on the light and heavy chains.
  4. The amino acid sequences in the third hypervariable regions on both the heavy and light chains are more diverse than the first two hypervariable region. The third hypervariable region on the heavy chain is more diverse than that on the light chain.
  5. The specificity is maintained, even if the isotype (e.g. IgG and IgA) of the antibody changes.
  6. The constant regions of the different isotypes are the same in all antibodies of the same isotype from a single individual.
  7. Changes in isotype follow a regular sequence.
  8. Once an isotype is no longer produced, it can never be produced by the same cell.
  9. Membrane and soluble forms of the same isotype can exist, even from the same cell.
  10. IgM and IgD are co-expressed on the surface of mature, no activated B-cells
  11. The affinity is increased during the maturation of the B-cell after stimulation with the foreign antigen.

Early Models:

  1. Each different polypeptide is produced by its own gene.
  2. There are only a few immunoglobulin genes, and the diversity is generated by a high mutation rate in the B-cell during its development.
  3. Two gene model (Dryer and Bennett, 1965). There are two genes that are used to construct a single functional immunoglobulin gene, one for the variable region and one for the constant region. Many different variable region ëcassettesí exist that can be combined with the constant region.

In 1976 Tonegawa showed that the genomic DNA that encodes light chain DNA changes it restriction map when stem cell DNA is compared to DNA from a mature B-cell.

Subsequent DNA sequencing produced the following structure of the kappa light chains in mice. Humans have a similar organization of DNA, some of which are discussed in Benjamini et al.( Figure 6.2).

 

 

 

 

 

 

 

 

A comparison of the protein sequences of k -chains to the DNA sequence revealed that the k -chain locus consists of a series of approximately 300 variable regions (with a leader exon), indicated by ëL Vk 1í, followed by 23 kb of DNA, five short segments of DNA called J segments that corresponded to CDR3, 2.5 kb of DNA and then DNA the encodes the constant region of the k light chain. (The y indicates a pseudogene)

Based on the DNA sequence in mature B-cells it became clear that the following sequence of events occur:

  1. One of the 300 L-V segments joins, at random, to one of the 4 J segments, leading to altered chromosomal DNA.
  2. mRNA splicing removes the intro between the L-V segment and the J and Ck segment
  3. The peptide encoded by the leader exon is removed by the leader peptidase as the growing light chain is synthesized on the rough endoplasmic reticulum.

 

The high diversity in CDR3 of the light chain is due to the fact that this sequence comes from the J-segment. This fact in itself does not generate diversity (there are only 4 J segments) but the DNA joining event is imprecise (see below) leading to considerable diversity.

 

The l -chain DNA and the DNA encoding the heavy chain show a similar motif. In the mouse the l -locus is less diverse, showing only two V segments and 3 functional J domains.

 

 

 

 

 

 

 

 

 

 

The heavy chain locus contains additional segments that are not seen in the light chain:

 

The sequence of events that occur to generate a viable heavy chain gene are as follows:

  1. One of the 13 D segments joins to one of the four J regions, generating a DJ junction.
  2. One of the 300-100 L-V segments joins to new DJ segment.
  3. Introns are removed between the L segment and the V segment by mRNA splicing
  4. Alternate splicing generates either an IgM (m chain) or IgD (d chain)

 

Combinatoral Diversity.

 

Number of possible l -chains: 2 (V) x 3 (J) = 6 different l light chains possible

Number of possible k -chains: 300 (V) x 4 (J) = 1200 different k light chains possible

Number of possible H-chains: 500 (V) x 13 (D) x 4 (J) =26,000 different VH possible.

 

Total diversity, assuming all light chains can pair with all heavy chains:

 

26,000 x 1206 = 3 x 107 different antibodies

Mechanism of Joining:

Need to insure that the right segments are joined. For example, you would not want to join two J segments together instead of a VJ joining event. There are two conserved recombination signal sequences (RSS) found directly adjacent to the V,J, and D regions in immunoglobulins:

 

CACAGTG - 23 bp - ACAAAAACC GGTTTTTGT - 12 bp - CACTGTG

GTGTCAC - 23 bp - TGTTTTTGG CCAAAAACA - 12 bp - GTGACAC

Two-turn sequence One-turn sequence

 

When fragments are joined, a two-turn RSS can only join with a one-turn RSS. The position of these RSS in the heavy and light chain segments are as follows:

 

l -chain: Vl -2 RSS 1 RSS Jl

k -chain: Vk -1 RSS 2 RSS Jk

H-chain: VH ñ 2 RSS 1 RSS ñ D ñ 1RSS 2 RSS ñ J

 

Diversity in Joining:

  1. The end points of the RSS are the nominal points for joining heavy and light chains. However, there is some uncertainty (up to 5 bases) where the crossover point occurs. These leads to the loss of codons, and subsequent amino acids from CDR3.
  2. Addition of up to 15 nucleotides, by the enzyme terminal transferase, adds codons to CDR3. The added nucleotides are referred to as N nucleotides.

 

The loss and gain of codons from the joining regions increases the diversity by 10-1000 fold.

 

Productive Rearrangements and Allelic Exclusion:

The mature B-cell:

  1. must produce a functional antibody (productive rearrangement)
  2. will only produce a single light and heavy chain, although the genes from both parents are present in the immature B-cell (Allelic exclusion).

The steps that insure that the above happens are as follows:

  1. Cell attempts to make a functional heavy chain, if this fails the cell dies.
  2. Once a single heavy chain is functional, rearrangement of the other heavy chain allele is inhibited.
  3. Successful rearrangement of one k -chain inhibits rearrangement of the other k allele
  4. Unsuccessful rearrangement of both k -chains prompts rearrangement of l -chains.
  5. Unsuccessful rearrangements of l -chains causes death.

Class Switching:

 

 

 

Membrane Bound or Secreated:

 

The 3í end of every constant region consists of the following structure (example shown for IgM, m gene)

 

m 4-S---polyA ------ M1 ------ M2 ñpolyA

 

  1. m 4 is the last exon, coding for the 4th constant domain in IgM (the hinge region in IgM is replaced by another immunoglobulin domain.
  2. S codes for the carboxy-terminal sequence found in soluble IgM
  3. polyA: poly A cleavage and addition sites
  4. M1, M2 exons that code for the membrane domain.

 

Alternate use of the two poly A sites control expression of soluble IgM (m 4-S) or membrane bound (m 4-M1-M2) IgM.

 

Co-expression of IgM and IgD.

 

These two isotypes are expressed at the same time in the cell. The two constant domain gene segments are adjacent to each other on the chromosome. The 3í end of the m gene and the 3í end of the d gene have two polyadenylation sites (as discussed above). Alternative use of these four sites (and alternative splicing) leads to expression of IgM or IgD.

 

Somatic Mutation:

 

After stimulation with antigen, affinity maturation occurs. The CDR1 and CDR2 regions of immunoglobulins acquire mutations. Since those B-cells with higher affinity antibodies on the surface are stimulated by antigen, the resultant B-cell population that is selected has higher affinities. The mechanism for this event is current unknown.