Blog
Strain Dev

Strain Selection Before the Pilot: Using Flux Phenotype to Pick the Right Candidate

Not all high-titer bench strains are equally scale-up-ready. How metabolic flux phenotyping from bench data predicts which strain candidates will maintain their performance under the physical constraints of pilot scale.

Fermvyne Science Team 8 min read
Strain Selection Before the Pilot: Using Flux Phenotype to Pick the Right Candidate

You have three E. coli strain variants on your shortlist: the parent strain (WT backbone), a variant with reduced acetate kinase activity (ΔackA), and a variant with overexpressed citrate synthase (gltA↑). All three have been screened at shake flask level. The WT produced 38 g/L titer in the best shake flask replicate. The ΔackA produced 42 g/L. The gltA↑ produced 35 g/L but with cleaner acetate profile. Which one do you put in the pilot?

The shake flask screen tells you the bench-scale phenotype. It does not tell you which strain's phenotype will be most advantageous when the physical environment changes at 500L — specifically, when oxygen transfer becomes limiting, substrate gradients appear, and mixing time increases 20×. Choosing the wrong strain for pilot is an expensive mistake. Choosing it for the wrong reasons — because it was best in shake flasks, not because it's most likely to maintain its advantage at scale — is a preventable mistake.

What Shake Flask Data Tells You (and Doesn't)

Shake flask screening is well-suited for rapid phenotypic comparison under conditions where oxygen transfer is not limiting (kLa in a 250mL baffled flask at 250 rpm is typically 100–200 h⁻¹, adequate for most aerobic organisms below OD600 10–15), nutrient supply is batch (no fed-batch dynamics), and mixing is essentially instantaneous relative to metabolic rates.

What shake flasks measure well:

  • Specific growth rate (μ) under non-limiting conditions
  • Biomass yield (Yx/s) on the primary carbon source in batch mode
  • Qualitative assessment of overflow metabolism during batch exponential phase (acetate or ethanol detectable in broth at the end of batch)
  • Product yield (Yp/s) under batch growth conditions

What shake flasks systematically fail to capture for scale-up decisions:

  • Performance under fed-batch substrate limitation (where qs is externally controlled, not set by batch growth kinetics)
  • Performance at high cell density (OD600 above 30–40) where oxygen supply becomes challenging even at bench scale
  • Overflow onset threshold at different qs values (the q_glc_max parameter that determines where overflow begins)
  • Robustness to transient oxygen limitation (which occurs at scale even in well-operated vessels)

The Flux Phenotype That Matters for Scale-Up

The metabolic phenotype that predicts scale-up performance is not the shake flask titer at the best replicate. It is the relationship between specific substrate uptake rate (qs) and specific oxygen uptake rate (qO2) across a range of qs values. This relationship encodes the strain's overflow threshold.

Specifically, what you want to know for each candidate strain:

  1. q_glc_max: The maximum specific glucose uptake rate at which the strain can maintain fully oxidative metabolism. Above this threshold, every additional glucose consumed results in overflow metabolite secretion.
  2. qO2 at μmax under full aeration: The peak oxygen demand the strain generates at its maximum growth rate. This determines the minimum kLa the pilot vessel must achieve to sustain the strain's maximum productivity.
  3. Product yield at sub-maximal qs: If you constrain qs to below q_glc_max (as you must at scale to avoid overflow), what is the product yield Yp/s and the volumetric productivity at the constrained growth rate? A strain with 10% higher shake flask titer may have lower volumetric productivity at the constrained qs relevant to scale-up conditions.

Measuring the Scale-Up-Relevant Phenotype at Bench

Extracting the scale-up-relevant flux phenotype requires bench-scale fed-batch runs, not shake flask screens. Specifically:

Fed-batch characterization protocol

  1. Run a batch phase to OD600 10–15 in your standard media.
  2. Start a glucose fed-batch at a conservative initial specific feed rate (approximately 0.5× the expected q_glc_max for your organism).
  3. Ramp the feed rate in steps: 0.5×, 0.8×, 1.0×, 1.2× q_glc_max (where q_glc_max is estimated from literature or a preliminary run). At each step, hold for 2 residence times (approximately 2/μ hours) and measure: OD600, glucose, acetate (offline HPLC or spectrophotometric), DO.
  4. Record the qs and qO2 at each feed rate step from: qs = (F × C_feed) / (V × DCW), qO2 from off-gas analysis or dynamic O₂ balance.
  5. The feed rate at which acetate first appears in your offline samples defines q_glc_max for that strain under those conditions.

This protocol generates a q_glc_max value, a qO2 vs qs curve, and a Yp/s vs qs curve — the minimum information set needed to predict which strain will perform best at scale.

Applying FBA to Rank Candidate Strains for Scale-Up

Once you have the bench-scale flux phenotype for each candidate strain, you can apply FBA to predict which strain will maintain the best productivity under the physical constraints of the pilot vessel.

The procedure:

  1. Parameterize a FBA model for each candidate strain using its measured exchange fluxes (qs, μ, qp, qO2) from the bench fed-batch characterization.
  2. Set the oxygen uptake bound in each model to the maximum OUR your pilot vessel can sustain at the target operating conditions: OUR_max = kLa × (DO* − DO_setpoint).
  3. Run FBA for each strain model at the pilot-scale oxygen constraint. What is the predicted qs, qp, and overflow metabolite secretion rate under pilot-scale oxygen limitation?
  4. Rank strains by predicted volumetric productivity (qp × DCW_target) at the pilot-scale oxygen constraint.

This ranking is often different from the shake flask ranking. The strain with the highest shake flask titer (highest qp at unlimited qs) may have a lower q_glc_max than a competitor strain — meaning it enters overflow metabolism at a lower feed rate, and its pilot-scale productivity under oxygen-limited conditions is lower. The strain with lower shake flask titer but higher q_glc_max may maintain its productivity better at scale because it can be fed more aggressively without overflowing at the oxygen-limited qs that pilot conditions allow.

A Concrete Example

Consider three strains with the following bench-scale characterization:

  • Strain A: Shake flask titer 42 g/L. q_glc_max = 0.85 g/g·h. qO2 at qmax = 18 mmol/g·h.
  • Strain B: Shake flask titer 38 g/L. q_glc_max = 1.15 g/g·h. qO2 at qmax = 15 mmol/g·h.
  • Strain C: Shake flask titer 35 g/L. q_glc_max = 1.05 g/g·h. qO2 at qmax = 14 mmol/g·h.

At 500L pilot with kLa = 120 h⁻¹ and DO setpoint 30%, maximum OTR = 120 × 7.2 × 0.70 = 605 mg O₂/L·h = 18.9 mmol O₂/L·h.

At target DCW = 30 g/L, maximum sustainable qO2 = 18.9/30 = 0.63 mmol/g·h... wait, that's 0.63 mmol/g·h which is clearly below all the strains' demands at q_max. But this shows the constraint: at 30 g/L DCW, the maximum qs each strain can sustain without overflow under pilot oxygen limitation is:

Strain A: max qs at pilot = min(q_glc_max, OUR_max / (OUR/qs ratio at overflow))
         = min(0.85, 0.63 mmol/g·h / (18 mmol/g·h / 0.85 × correction))

The calculation clarifies the ranking: Strain B, with the highest q_glc_max and the lowest oxygen demand per unit substrate, can be fed most aggressively under pilot-scale oxygen constraint without entering overflow. Even though its shake flask titer was the middle-ranking result, its volumetric productivity at pilot scale is highest.

References

  • Varma A, Palsson BO. Stoichiometric flux balance models quantitatively predict growth and metabolic by-product secretion. Appl Environ Microbiol. 1994;60(10):3724–3731.
  • Lee SY. High cell-density culture of Escherichia coli. Trends Biotechnol. 1996;14(3):98–105.
  • Schaub J, Mauch K, Reuss M. Metabolic flux analysis in Escherichia coli by integrating isotopic dynamic and isotopic stationary ¹³C labeling, extracellular flux data, and biomass data. Biotechnol Bioeng. 2008;99(5):1170–1185.