Blinded Sample Size Re-estimation for Continuous Endpoints - Part 2
- Andrew Yan

- Sep 7
- 2 min read
Updated: Oct 20
As noted in Part 1 of this series, the combined data from a two-group parallel study with a continuous endpoint can be viewed as a random sample from a two-component Gaussian mixture model with a known mixture proportion. Let š denote an observation from the combined data, then
where š is the mixture proportion, µā and µā are the component means, and ϲ is the common variance. Let šæ = µā - µā and š = 1/2, then the variance of the mixture distribution is given by
Eq. (1) indicates that blinded sample size re-estimation (SSR) can be performed using the sample variance of the combined data when either ϲ or šæĀ² is reliably obtained from historical studies. However, this reliance on historical information is a key limitation of this approach. The good news is that the (non-excess) kurtosis š of the mixture distribution is
We can use Eq. (1) and Eq. (2) to solve for šæĀ² and ϲ, then obtain
Eq. (3) implies that the ratio š = šæĀ²/ϲ can be estimated by
where š Ģ denotes the sample kurtosis of the pooled data. It's clear that this estimator is well defined only if 1 < š Ģ ā¤ 3, so it will be defined as zero if š Ģ >3 (note that it is unlikely to observe š Ģ ā¤ 1 in practice).
Since the required sample size depends on šæĀ² and ϲ only through the ratio šĀ = šæĀ²/ϲ, Eq. (4) suggests that blinded SSR can be performed using the pooled sample kurtosis, eliminating the need for historical estimates of either parameter. But how good is this estimator?
The graph below shows the effect size (šæ/Ļ) in the range of [0, 1] as a function of the kurtosis (š ), where the solid curve represents the most likely scenarios in clinical trials. It is clear that the effect size is very sensitive to the changes in the kurtosis, which has a narrow range of 2.876 to 3 when šæ/Ļ ā [0, 0.8]. This suggests that a small bias in the estimate of š can lead to a large bias in that of the effect size.

It can be shown that š Ģ has asymptotic variance
where š is the pooled sample size. Using the delta method, we have
where š'(š ) = -(š+4)³/16š is the first order derivative of š at š .
The following table reports the asymptotic variances Var(š Ģ) and Var(š^) evaluated at different š (and corresponding šæ/Ļ) values spanning a range commonly encountered in practice.

These results indicate that, despite its mathematical elegance, the estimator for šĀ in Eq. (4) is extremely unstable and therefore unsuitable for practical use.
Comments