BiologyMolecular BiologyMedium

Exon

Also known as:Expressed sequenceCoding sequence segment

An exon is any nucleotide sequence within a gene that is represented in the mature, spliced mRNA and is ultimately translated into protein (or retained in functional non-coding RNAs). After transcription, exons are joined together following the removal of introns to form the continuous open reading frame. The modular nature of exons provides an evolutionary advantage: exon shuffling — the rearrangement of exon modules between genes — is a major mechanism for generating new protein functions and domain architectures.

Human Genome Exon Statistics

ParameterTypical ValueNotes
Average exon length~170 bpMuch shorter than average intron (~3,500 bp)
Average exons per gene~8–9Range from 1 to over 300 (titin gene)
Total exonic fraction of genome~1.5%Only ~3% includes UTRs
Coding exon content~1.0% of genomeTranslates to protein sequences
First exon (5' UTR + start)Often untranslated partiallyContains Kozak/5'UTR regulatory elements

Interactive Tools

Ensembl Genome Browser

Visualise exon-intron structure and transcript isoforms for any gene

Open Tool

Khan Academy — RNA Processing

Video and article on exons and the splicing process

Open Tool

NCBI RefSeq

Reference sequences with annotated exon coordinates for all human genes

Open Tool
Annotated diagram of a eukaryotic gene structure highlighting exons alongside introns and regulatory regions

Wikimedia Commons, CC BY-SA

Related Terms

Biology

Intron

An intron (intervening sequence) is a non-coding segment of a pre-mRNA transcript that is removed by RNA splicing before the mature mRNA is translated into protein. Introns are excised as lariat-shaped structures by the spliceosome, a large ribonucleoprotein complex that recognises conserved splice-site sequences at intron boundaries. Although historically considered "junk DNA," introns are now understood to play important roles in gene regulation, alternative splicing, and as sources of small non-coding RNAs such as snoRNAs and microRNAs.

Biology

Alternative Splicing

Alternative splicing is a regulated process during gene expression in which particular exons of a pre-mRNA transcript are included or excluded to generate multiple distinct mature mRNA isoforms from a single gene. This mechanism dramatically expands proteomic diversity; estimates suggest that over 95% of human multi-exon genes undergo alternative splicing, allowing a genome of ~20,000 genes to produce hundreds of thousands of protein variants. Mis-regulation of alternative splicing is implicated in many diseases including spinal muscular atrophy, various cancers, and neurological disorders.

Biology

Central Dogma of Biology

The Central Dogma of Molecular Biology, formulated by Francis Crick in 1958, describes the general flow of genetic information within a biological system: DNA is transcribed into RNA, and RNA is translated into protein. Crick's original statement also specified that information transfer from protein back to nucleic acid does not normally occur, establishing a directional framework for gene expression. The discovery of reverse transcriptase in retroviruses revealed that RNA can also be reverse-transcribed into DNA, representing a known exception to the standard flow while still consistent with Crick's framework.

Coined by Walter Gilbert in 1978 from "ex(pressed)" + "-on". Gilbert used the term to describe the portions of a gene that are expressed (i.e., appear) in the final mRNA, in contrast to the intervening introns that are removed.

rna-processinggene-expressionmrnacoding-sequencemolecular-biology