You have three E. coli strain variants on your shortlist: the parent strain (WT backbone), a variant with reduced acetate kinase activity (ΔackA), and a variant with overexpressed citrate synthase (gltA↑). All three have been screened at shake flask level. The WT produced 38 g/L titer in the best shake flask replicate. The ΔackA produced 42 g/L. The gltA↑ produced 35 g/L but with cleaner acetate profile. Which one do you put in the pilot?
The shake flask screen tells you the bench-scale phenotype. It does not tell you which strain's phenotype will be most advantageous when the physical environment changes at 500L — specifically, when oxygen transfer becomes limiting, substrate gradients appear, and mixing time increases 20×. Choosing the wrong strain for pilot is an expensive mistake. Choosing it for the wrong reasons — because it was best in shake flasks, not because it's most likely to maintain its advantage at scale — is a preventable mistake.
What Shake Flask Data Tells You (and Doesn't)
Shake flask screening is well-suited for rapid phenotypic comparison under conditions where oxygen transfer is not limiting (kLa in a 250mL baffled flask at 250 rpm is typically 100–200 h⁻¹, adequate for most aerobic organisms below OD600 10–15), nutrient supply is batch (no fed-batch dynamics), and mixing is essentially instantaneous relative to metabolic rates.
What shake flasks measure well:
- Specific growth rate (μ) under non-limiting conditions
- Biomass yield (Yx/s) on the primary carbon source in batch mode
- Qualitative assessment of overflow metabolism during batch exponential phase (acetate or ethanol detectable in broth at the end of batch)
- Product yield (Yp/s) under batch growth conditions
What shake flasks systematically fail to capture for scale-up decisions:
- Performance under fed-batch substrate limitation (where qs is externally controlled, not set by batch growth kinetics)
- Performance at high cell density (OD600 above 30–40) where oxygen supply becomes challenging even at bench scale
- Overflow onset threshold at different qs values (the q_glc_max parameter that determines where overflow begins)
- Robustness to transient oxygen limitation (which occurs at scale even in well-operated vessels)
The Flux Phenotype That Matters for Scale-Up
The metabolic phenotype that predicts scale-up performance is not the shake flask titer at the best replicate. It is the relationship between specific substrate uptake rate (qs) and specific oxygen uptake rate (qO2) across a range of qs values. This relationship encodes the strain's overflow threshold.
Specifically, what you want to know for each candidate strain:
- q_glc_max: The maximum specific glucose uptake rate at which the strain can maintain fully oxidative metabolism. Above this threshold, every additional glucose consumed results in overflow metabolite secretion.
- qO2 at μmax under full aeration: The peak oxygen demand the strain generates at its maximum growth rate. This determines the minimum kLa the pilot vessel must achieve to sustain the strain's maximum productivity.
- Product yield at sub-maximal qs: If you constrain qs to below q_glc_max (as you must at scale to avoid overflow), what is the product yield Yp/s and the volumetric productivity at the constrained growth rate? A strain with 10% higher shake flask titer may have lower volumetric productivity at the constrained qs relevant to scale-up conditions.
Measuring the Scale-Up-Relevant Phenotype at Bench
Extracting the scale-up-relevant flux phenotype requires bench-scale fed-batch runs, not shake flask screens. Specifically:
Fed-batch characterization protocol
- Run a batch phase to OD600 10–15 in your standard media.
- Start a glucose fed-batch at a conservative initial specific feed rate (approximately 0.5× the expected q_glc_max for your organism).
- Ramp the feed rate in steps: 0.5×, 0.8×, 1.0×, 1.2× q_glc_max (where q_glc_max is estimated from literature or a preliminary run). At each step, hold for 2 residence times (approximately 2/μ hours) and measure: OD600, glucose, acetate (offline HPLC or spectrophotometric), DO.
- Record the qs and qO2 at each feed rate step from: qs = (F × C_feed) / (V × DCW), qO2 from off-gas analysis or dynamic O₂ balance.
- The feed rate at which acetate first appears in your offline samples defines q_glc_max for that strain under those conditions.
This protocol generates a q_glc_max value, a qO2 vs qs curve, and a Yp/s vs qs curve — the minimum information set needed to predict which strain will perform best at scale.
Applying FBA to Rank Candidate Strains for Scale-Up
Once you have the bench-scale flux phenotype for each candidate strain, you can apply FBA to predict which strain will maintain the best productivity under the physical constraints of the pilot vessel.
The procedure:
- Parameterize a FBA model for each candidate strain using its measured exchange fluxes (qs, μ, qp, qO2) from the bench fed-batch characterization.
- Set the oxygen uptake bound in each model to the maximum OUR your pilot vessel can sustain at the target operating conditions: OUR_max = kLa × (DO* − DO_setpoint).
- Run FBA for each strain model at the pilot-scale oxygen constraint. What is the predicted qs, qp, and overflow metabolite secretion rate under pilot-scale oxygen limitation?
- Rank strains by predicted volumetric productivity (qp × DCW_target) at the pilot-scale oxygen constraint.
This ranking is often different from the shake flask ranking. The strain with the highest shake flask titer (highest qp at unlimited qs) may have a lower q_glc_max than a competitor strain — meaning it enters overflow metabolism at a lower feed rate, and its pilot-scale productivity under oxygen-limited conditions is lower. The strain with lower shake flask titer but higher q_glc_max may maintain its productivity better at scale because it can be fed more aggressively without overflowing at the oxygen-limited qs that pilot conditions allow.
A Concrete Example
Consider three strains with the following bench-scale characterization:
- Strain A: Shake flask titer 42 g/L. q_glc_max = 0.85 g/g·h. qO2 at qmax = 18 mmol/g·h.
- Strain B: Shake flask titer 38 g/L. q_glc_max = 1.15 g/g·h. qO2 at qmax = 15 mmol/g·h.
- Strain C: Shake flask titer 35 g/L. q_glc_max = 1.05 g/g·h. qO2 at qmax = 14 mmol/g·h.
At 500L pilot with kLa = 120 h⁻¹ and DO setpoint 30%, maximum OTR = 120 × 7.2 × 0.70 = 605 mg O₂/L·h = 18.9 mmol O₂/L·h.
At target DCW = 30 g/L, maximum sustainable qO2 = 18.9/30 = 0.63 mmol/g·h... wait, that's 0.63 mmol/g·h which is clearly below all the strains' demands at q_max. But this shows the constraint: at 30 g/L DCW, the maximum qs each strain can sustain without overflow under pilot oxygen limitation is:
Strain A: max qs at pilot = min(q_glc_max, OUR_max / (OUR/qs ratio at overflow))
= min(0.85, 0.63 mmol/g·h / (18 mmol/g·h / 0.85 × correction))
The calculation clarifies the ranking: Strain B, with the highest q_glc_max and the lowest oxygen demand per unit substrate, can be fed most aggressively under pilot-scale oxygen constraint without entering overflow. Even though its shake flask titer was the middle-ranking result, its volumetric productivity at pilot scale is highest.
References
- Varma A, Palsson BO. Stoichiometric flux balance models quantitatively predict growth and metabolic by-product secretion. Appl Environ Microbiol. 1994;60(10):3724–3731.
- Lee SY. High cell-density culture of Escherichia coli. Trends Biotechnol. 1996;14(3):98–105.
- Schaub J, Mauch K, Reuss M. Metabolic flux analysis in Escherichia coli by integrating isotopic dynamic and isotopic stationary ¹³C labeling, extracellular flux data, and biomass data. Biotechnol Bioeng. 2008;99(5):1170–1185.