How to Design a Plasmid for Bacterial Expression: Six Decisions That Determine Whether You Get Protein
Designing a plasmid for bacterial expression means making a series of decisions that each constrain the next. Choose the wrong promoter and your protein is toxic before induction. Choose the wrong origin and your plasmid is unstable at high copy number. Choose the wrong tag position and your protein misfolds. This guide walks through each decision in order, with the practical thresholds and tradeoffs that determine whether you get soluble protein or inclusion bodies.
The Bacterial Expression Plasmid Design Checklist
Every expression plasmid for E. coli requires six components. The decisions are interdependent, so work through them in this order:
- Promoter system (controls when and how strongly your gene is transcribed)
- Origin of replication (controls plasmid copy number per cell)
- Selectable marker (antibiotic resistance for maintaining the plasmid)
- Ribosome binding site (Shine-Dalgarno sequence for translation initiation)
- Purification/detection tag (His, GST, SUMO, or none)
- Terminator (stops transcription downstream of your gene)
Decision 1: Choosing a Promoter System
The promoter is the single most important design choice. It determines expression level, inducibility, and whether your cells survive long enough to make protein.
| Promoter | Inducer | Strength | Leakiness | Best For |
|---|---|---|---|---|
| T7/lac (pET system) | IPTG | Very high | Moderate | High-yield expression of non-toxic proteins |
| T7/lac + pLysS | IPTG | Very high | Low | Toxic proteins (T7 lysozyme suppresses basal expression) |
| trc | IPTG | High | Low-moderate | Moderate expression without requiring DE3 lysogen |
| lac | IPTG or lactose | Moderate | Moderate | Low-level expression, metabolic engineering |
| araBAD (pBAD) | L-arabinose | Tunable | Very low | Dose-dependent expression, toxic proteins |
| rhaPBAD | L-rhamnose | Tunable | Very low | Tight regulation, orthogonal to IPTG systems |
The T7/lac promoter system (pET vectors from Novagen/MilliporeSigma) dominates bacterial expression. It requires a host strain carrying the DE3 lysogen (e.g., BL21(DE3)), which provides T7 RNA polymerase under lac control. IPTG induces T7 RNAP expression, which then drives transcription from the T7 promoter on your plasmid at very high rates.
If your protein is toxic to E. coli, leaky expression from the uninduced T7 promoter may kill cells before induction. Solutions: use BL21(DE3)pLysS (constitutive T7 lysozyme reduces basal T7 RNAP activity), switch to the tightly regulated pBAD system, or lower the copy number (see Decision 2).
Decision 2: Origin of Replication and Copy Number
The origin of replication (ori) determines how many copies of your plasmid exist per cell. Higher copy number means more template for transcription—but also more metabolic burden and more basal (leaky) expression.
| Origin | Copy Number | Common Vectors | Notes |
|---|---|---|---|
| pMB1/ColE1 (mutant) | ~500–700 | pUC, pBluescript | Very high copy; good for DNA prep, stressful for expression |
| pMB1 (wild-type) | ~15–20 | pBR322, pET | Standard for expression vectors; balances yield and stability |
| p15A | ~10–12 | pACYC184 | Compatible with pMB1 for co-expression; lower copy |
| pSC101 | ~5 | pSC101-based | Low copy; good for toxic proteins or metabolic engineering |
For most expression work, the pMB1 wild-type origin (~15–20 copies/cell, as in pET vectors) is the right choice. It provides enough template for high-level induction without overwhelming the cell during growth.
Decision 3: Selectable Marker
The antibiotic resistance marker keeps selective pressure on cells to maintain the plasmid. Without selection, plasmid-free cells outgrow plasmid-bearing cells because they lack the metabolic burden.
| Marker | Antibiotic | Mechanism | Notes |
|---|---|---|---|
| bla (AmpR) | Ampicillin | Beta-lactamase degrades ampicillin | Secreted enzyme; ampicillin degrades over time—satellite colonies appear after 16+ hours |
| aph (KanR) | Kanamycin | Aminoglycoside phosphotransferase | Stable selection; no satellite colony problem; preferred for expression |
| cat (CmR) | Chloramphenicol | Chloramphenicol acetyltransferase | Bacteriostatic; commonly used with pLysS and pACYC-based vectors |
Kanamycin resistance is preferred for expression vectors because ampicillin selection degrades over time: the beta-lactamase enzyme is secreted and destroys ampicillin in the medium, allowing plasmid-free satellite colonies to appear on overnight plates. This is a common source of contamination and low expression yields.
Decision 4: Ribosome Binding Site
In prokaryotes, translation initiation requires a Shine-Dalgarno (SD) sequence upstream of the start codon. The SD sequence (consensus: AGGAGG) base-pairs with the 16S rRNA to position the ribosome at the AUG start codon. The spacing between the SD sequence and the start codon (typically 5–10 nt, optimally ~7 nt) critically affects translation efficiency.
Most commercial expression vectors (pET, pBAD, pTrc) include an optimized RBS. If you are building a custom vector, use the RBS Calculator to design an SD sequence with a predicted translation initiation rate matched to your needs.
Decision 5: Purification and Detection Tags
Tags serve two purposes: enabling affinity purification and allowing detection (Western blot). The choice of tag, its position (N- or C-terminal), and whether to include a cleavage site all affect your downstream workflow.
| Tag | Size | Purification Method | Cleavable? | Best For |
|---|---|---|---|---|
| 6×His | ~0.8 kDa | Ni-NTA (IMAC) | Optional (TEV, thrombin site) | Routine purification; small, rarely affects function |
| GST | ~26 kDa | Glutathione resin | Yes (PreScission, thrombin) | Improved solubility; large but effective |
| MBP | ~42 kDa | Amylose resin | Yes (TEV, Factor Xa) | Best solubility enhancement for aggregation-prone proteins |
| SUMO | ~12 kDa | Ni-NTA (via His-SUMO) | Yes (SUMO protease, scarless) | Solubility + native N-terminus after cleavage |
| Strep-tag II | ~1 kDa | Strep-Tactin resin | Optional | High purity in one step; no metal ions |
For most initial expression tests, a C-terminal 6×His tag is the simplest starting point. It rarely interferes with folding and enables single-step Ni-NTA purification. If solubility is a problem, move to N-terminal MBP-His or SUMO-His fusions.
Decision 6: Terminator
A transcription terminator downstream of your gene prevents read-through transcription into the backbone, which can destabilize the plasmid and interfere with antibiotic resistance gene expression. The T7 terminator is standard in pET vectors. For non-T7 systems, the rrnB T1T2 terminator is commonly used.
Missing terminators are a frequent oversight in custom-built vectors. If you see unexpectedly low plasmid yields or unstable constructs, check for a functional terminator downstream of your insert.
Putting It Together: A pET Expression Example
Promoter: T7/lac (pET-28a) — high yield expected, non-toxic protein, IPTG-inducible
Origin: pMB1 (~20 copies/cell) — standard for pET vectors
Marker: KanR — stable selection, no satellite colonies
RBS: pET-28a built-in SD sequence (optimized spacing)
Tag: N-terminal 6×His with thrombin cleavage site — allows purification and optional tag removal
Terminator: T7 terminator (built into pET-28a backbone)
Host: BL21(DE3) — protease-deficient, DE3 lysogen for T7 RNAP
Induce at OD600 = 0.4–0.8 with 0.1–1 mM IPTG. For soluble expression, try 18°C overnight induction first (slower folding = fewer inclusion bodies). If yield is insufficient at 18°C, try 30°C for 4 hours or 37°C for 2–3 hours.
Common Design Mistakes in Bacterial Expression Plasmids
- Wrong host strain for the promoter: T7 promoter in a non-DE3 strain = zero expression. araBAD promoter in a strain that metabolizes arabinose = poor induction.
- Codon usage mismatch: Genes from eukaryotes or AT-rich organisms contain rare codons for E. coli. Consecutive rare codons stall ribosomes and truncate the protein. Use codon-optimized synthetic genes or co-express rare tRNAs (BL21-CodonPlus, Rosetta strains). See our guide on codon optimization for E. coli expression.
- No terminator: read-through transcription destabilizes the plasmid and reduces yield.
- Over-induction at 37°C: strong promoter + high IPTG + 37°C often drives protein into inclusion bodies. Lower the temperature, reduce IPTG concentration, or both.
- Ampicillin selection for overnight cultures: ampicillin degrades; satellite colonies contaminate your starter culture. Use kanamycin-resistant vectors when possible.
For more on how restriction enzyme site selection affects your cloning strategy when building expression vectors, see our guide on restriction enzyme digestion temperature and buffer decisions.
Try PlasmidStudio
AI-assisted plasmid design with automated validation. Start free — $0 to sign up.
Get started free