DOW Experimental Design: Difference between revisions

DOW Experimental Design
State dimension:	1
Differential states:	11
Discrete control functions:	2
Path constraints:	4
Interior point equalities:	11

← Older edit

Latest revision as of 06:19, 26 August 2025

The DOW Experimental Design problem models the OED problem for the parameter estimation problem formulated by the DOW Chemical Co. in 1981. The following formulation is taken from [1].

Chemical background

The chemical species are disguised for proprietary reasons and the desired reaction is given by $H A + 2 B M \to A B + H B M H$ , where $A B$ is the desired product. The reactions are described as follows:

Slow Kinetic Reactions:

$\begin{array}{c} M^{-} + B M \underset{k_{- 1}}{\leftarrow} \overset{k_{1}}{\to} M B M^{-} \\ A^{-} + B M \overset{k_{2}}{\to} A B M^{-} \\ M^{-} + A B \underset{k_{- 3}}{\leftarrow} \overset{k_{3}}{\to} A B M^{-} \end{array}$

Acid-Base Reactions:

$\begin{array}{c} M B M H \leftarrow K_{1} \to M B M^{-} + H^{+} \\ H A \leftarrow K_{2} \to A^{-} + H^{+} \\ H A B M \leftarrow K_{3} \to A B M^{-} + H^{+} \end{array}$

In order to devise a model to account for these reactions, it is first necessary to distinguish between the overall concentration of a species and the concentration of its neutral form. Overall concentrations are defined for three components based on neutral and ionic species

$\begin{aligned} [H B M H] & = [(M B M H)_{N}] + [M B M^{-}] \\ [H A] & = [(H A)_{N}] + [A^{-}] \\ [H A B M] & = [(H A B M)_{N}] + [A B M^{-}] \end{aligned}$

Here $[]$ denotes the concentration of the species in $m o l / k g$ . By assuming the rapid acid-base reactions are at equilibrium, the equilibrium constants $K_{1}, K_{2}, K_{3}$ can be defined as

$\begin{aligned} K_{1} & = \frac{[M B M^{-}] [H^{+}]}{[(M B M H)_{N}]} \\ K_{2} & = \frac{[A^{-}] [H^{+}]}{[(H A)_{N}]} \\ K_{3} & = \frac{[A B M^{-}] [H^{+}]}{[(H A B M)_{N}]} \end{aligned}$

The anionic species may then be represented by

$\begin{aligned} [M B M^{-}] & = \frac{K_{1} [M B M H]}{K_{1} + [H^{+}]} & (a) \\ [A^{-}] & = \frac{K_{2} [H A]}{K_{2} + [H^{+}]} & (b) \\ [A B M^{-}] & = \frac{K_{3} [H A B M]}{K_{3} + [H^{+}]} & (c) \end{aligned}$

Material balance equations for the three reactants in the slow kinetic reactions yield:

$\begin{aligned} \frac{d [M^{-}]}{d t} & = - k_{1} [M^{-}] [B M] + k_{- 1} [M B M^{-}] - k_{3} [M^{-}] [A B] + k_{- 1} [A B M^{-}] & (d) \\ \frac{d [B M]}{d t} & = - k_{1} [M^{-}] [B M] + k_{- 1} [M B M^{-}] - k_{2} [A^{-}] [B M] & (e) \\ \frac{d [A B]}{d t} & = - k_{3} [M^{-}] [A B] + k_{- 3} [A B M^{-}] & (f) \end{aligned}$

From stoichiometry, rate expressions can also be written for the total species:

$\begin{aligned} \frac{d [M B M H]}{d t} & = k_{1} [M^{-}] [B M] - k_{- 1} [M B M^{-}] & (g) \\ \frac{d [H A]}{d t} & = k_{2} [A^{-}] [B M] & (h) \\ \frac{d [H A B M]}{d t} & = k_{2} [A^{-}] [B M] + k_{3} [M^{-}] [A B] - k_{- 3} [A B M^{-}] & (i) \end{aligned}$

An electroneutrality constraint gives the hydrogen ion concentration $[H^{+}]$ as

$[H^{+}] + [Q^{+}] = [M^{-}] + [M B M^{-}] + [A^{-}] + [A B M^{-}] (j)$

Based on similarities of reacting species, we assume

$k_{3} = k_{1}, k_{- 3} = \frac{1}{2} k_{- 1}$

With these assumptions, the three rate constants $k_{1}, k_{2}$ and $k_{- 1}$ must be estimated. Each of these can be fitted with two adjustable model parameters, assuming an Arrhenius temperature dependence. That is

$\begin{array}{c} k_{i} = α_{i} \exp (- E_{i} / (R T)), i \in {- 1, 1, 2} \end{array}$

Here $R \approx 1.98720425864083 c a l / (K \cdot m o l)$ is the gas constant and $T$ is the reaction temperature in Kelvins. The parameters $α_{i}$ , given in $m o l / (k g \cdot h)$ , represent the pre-exponential factors and the $E_{i}$ , with unit $c a l / m o l$ , are the activation energies.

Mathematical formulation

The chemical processes $(a) - (j)$ can be expressed mathematically as six differential equations and four algebraic equations:

$\begin{aligned} {\dot{y}}_{1} & = - k_{2} y_{8} y_{2} & (1), (h) \\ {\dot{y}}_{2} & = - k_{1} y_{6} y_{2} + k_{- 1} y_{10} - k_{2} y_{8} y_{2} & (2), (e) \\ {\dot{y}}_{3} & = k_{2} y_{8} y_{2} + k_{1} y_{6} y_{4} - \frac{1}{2} k_{- 1} y_{9} & (3), (i) \\ {\dot{y}}_{4} & = - k_{1} y_{6} y_{4} + \frac{1}{2} k_{- 1} y_{9} & (4), (f) \\ {\dot{y}}_{5} & = k_{1} y_{6} y_{2} - k_{- 1} y_{10} & (5), (g) \\ {\dot{y}}_{6} & = - k_{1} (y_{6} y_{2} + y_{6} y_{4}) + k_{- 1} (y_{10} + \frac{1}{2} y_{9}) & (6), (d) \\ y_{7} & = - [Q^{+}] + y_{6} + y_{8} + y_{9} + y_{10} & (7), (j) \\ y_{8} & = \frac{θ_{8} y_{1}}{θ_{8} + y_{7}} & (8), (b) \\ y_{9} & = \frac{θ_{9} y_{3}}{θ_{9} + y_{7}} & (9), (c) \\ y_{10} & = \frac{θ_{7} y_{5}}{θ_{7} + y_{7}} & (10), (a) \end{aligned}$

Here the letters in parentheses stand for the corresponding chemical process and the quantity $[Q^{+}] = 0.0131$ is a constant during the reaction. The nine parameters form the vector

$θ = (α_{1}, E_{1}, α_{2}, E_{2}, α_{- 1}, E_{- 1}, K_{1}, K_{2}, K_{3})$

The predicted concentrations form the vector

$y = (H A, B M, H A B M, A B, M B M H, M^{-}, H^{+}, A^{-}, A B M^{-}, M B M^{-})$

Let $f_{k} (\cdot)$ denote the right hand side of equation $(k)$ for $k = 1, \dots, 6$ .

The right hand sides of $(1) - (10)$ are summarized as the vector-valued function $f$ . Moreover, let

$\begin{aligned} f_{y, m, n} (\cdot) & = \frac{\partial f_{m} (\cdot)}{\partial y_{n}}, m, n \in {1, \dots, 10} \\ f_{θ, m, n} (\cdot) & = \frac{\partial f_{m} (\cdot)}{\partial θ_{n}}, m \in {1, \dots, 10}; n \in {1, \dots, 9} \end{aligned}$

Parameters

The initial parameter estimates are:

$α_{1}$	$2.0 \cdot 1 0^{13} m o l / (k g \cdot h)$
$α_{2}$	$2.0 \cdot 1 0^{13} m o l / (k g \cdot h)$
$α_{- 1}$	$4.3 \cdot 1 0^{15} m o l / (k g \cdot h)$
$E_{1}$	$2.0 \cdot 1 0^{4} c a l / m o l$
$E_{2}$	$2.0 \cdot 1 0^{4} c a l / m o l$
$E_{- 1}$	$2.0 \cdot 1 0^{4} c a l / m o l$
$K_{1}$	$1.0 \cdot 1 0^{- 17} m o l / k g$
$K_{2}$	$1.0 \cdot 1 0^{- 11} m o l / k g$
$K_{3}$	$1.0 \cdot 1 0^{- 17} m o l / k g$

Note that for the calculations all temperatures given in $^{\circ} C$ have to be rescaled to $K$ by adding $273.15$ .

There are three datasets for different temperatures $T$ , with corresponding starting values

	$4 0^{\circ} C$	$6 7^{\circ} C$	$10 0^{\circ} C$
$y_{1} (0)$	$1.7066$	$1.6749$	$1.5608$
$y_{2} (0)$	$8.32$	$8.2262$	$8.3546$
$y_{3} (0)$	$0.01$	$0.0104$	$0.0082$
$y_{4} (0)$	$0$	$0.0017$	$0.0086$

The initial model conditions in addition to those given in the data sets are:

$\begin{array}{l} y_{5} = 0 \\ y_{6} = [Q^{+}] \\ y_{7} = \frac{1}{2} \cdot (- K_{2} + \sqrt{K_{2}^{2} + 4 K_{2} y_{1} (0)}) \\ y_{8} = y_{7} \\ y_{9} = 0 \\ y_{10} = 0 \end{array}$

To reduce the intercorrelation between the parameters in the rate constants, we apply the following reparametrization (cf. [4].):

$\begin{aligned} k_{i} & = α_{i} \cdot \exp (- \frac{E_{i}}{R T}) \\ = k_{0, i} \cdot \exp (- \frac{E_{i}}{R} \cdot (\frac{1}{T} - \frac{1}{T_{0}})), i = 1, 2, - 1 \end{aligned}$

in which $k_{0, i} = α_{i} \cdot \exp (- \frac{E_{i}}{R T_{0}})$ . The reference temperature in $T_{0}$ is chosen as the average over all performed experiments, i.e., $T_{0} = 6 9^{\circ} C$ . Additionally, we add a logarithmic transformation, which gives rise to the following transformed starting values:

$\ln k_{0, 1}$	$1.194$
$\ln k_{0, 2}$	$1.194$
$\ln k_{0, - 1}$	$6.565$
$E_{1}$	$2.0 \cdot 1 0^{4}$
$E_{2}$	$2.0 \cdot 1 0^{4}$
$E_{- 1}$	$2.0 \cdot 1 0^{4}$
$\ln K_{1}$	$- 34.54$
$\ln K_{2}$	$- 25.33$
$\ln K_{3}$	$- 39.14$

Optimal Experimental Design Problem

To be specified.

We are interested in when to measure (with an upper bound $M_{i}$ on the measuring time for each observable). We define

$\begin{aligned} f_{y} (\cdot) & \in ℝ^{10 \times 10} & with (f_{y})_{i, j} = f_{y, i, j}, \\ f_{θ} (\cdot) & \in ℝ^{10 \times 9} & with (f_{θ})_{i, j} = f_{θ, i, j} \end{aligned}$

In this approach, we add the so-called sensitivities $G = d y / d θ$ . For the differential equations this means

$\dot{G} (t) = f_{y} (y (t), θ) G (t) + f_{θ} (y (t), θ), G (0) = \frac{\partial y (0)}{\partial θ}$

Now we formulate the OED problem as described in [2].

$\begin{array}{lll} \min_{y, G, F, z, w} & trace (F^{- 1} (t_{f})) \\ subject to \\ \dot{y} (t) & = & f (y (t), θ) \\ \dot{G} (t) & = & f_{y} (y (t), θ) G (t) + f_{θ} (y (t), θ) \\ \dot{F} (t) & = & \sum_{i = 1}^{n_{o}} w_{i} (t) (h_{y}^{i} (y (t)) G (t))^{T} (h_{y}^{i} (y (t)) G (t)) \\ \dot{z} (t) & = & w (t), \\ y (0) & = & y_{0} \\ G (0) & = & \frac{\partial y (0)}{\partial θ} \\ F (0) & = & 0, \\ z (0) & = & 0 \\ w (t) & \in & 𝒲 \\ z_{i} (t_{f}) & \leq & M_{i} \end{array}$

Here $h : ℝ^{10} \to ℝ^{n_{o}}$ is the observed function. The evolution of the symmetric matrix $F : [0, t_{f}] \to ℝ^{9 \times 9}$ is given by the weighted sum of observability Gramians $h_{y}^{i} (y (t)) G (t), i = 1, \dots, n_{o}$ for each observed function of states. The weights $w_{i} (t) \in {0, 1}, i = 1, \dots, n_{o}$ are the (binary) sampling decisions, where $w_{i} (t) = 1$ denotes the decision to perform a measurement at time $t$ .

Miscellaneous and Further Reading

To be specified.

References

[1] "Nonlinear Parameter Estimation: a Case Study Comparison" by L. T. Biegler and J. J. Damiano
[2] "Optimal Experimental Design for Universal Differential Equations" by C. Plate, C.J. Martensen and S. Sager
[3] "Parameter estimation in nonlinear systems" by W.J.H. Stortelder
[4] "Parameter Estimation in Nonlinear Dynamical Systems" by Morten Rode Kristensen