##
**A novel design for estimating relative accuracy of screening tests when complete disease verification is not feasible.**
*(English)*
Zbl 1097.62120

Summary: The accuracy (sensitivity and specificity) of a new screening test can be compared with that of a standard test by applying both tests to a group of subjects in which disease status can be determined by a gold standard (GS) test. However, it is not always feasible to administer a GS test to all study subjects. For example, a study is planned to determine whether a new screening test for cervical cancer (“ThinPrep”) is better than the standard test (“Pap”), and in this setting it is not feasible (or ethical) to determine disease status by biopsy in order to identify women with and without disease for participation in a study. When determination of disease status is not possible for all study subjects, the relative accuracy of two screening tests can still be estimated by using a paired screen-positive (PSP) design in which all subjects receive both screening tests, but only have the GS test if one of the screening tests is positive. Unfortunately in the cervical cancer example, the PSP design is also infeasible because it is not technically possible to administer both the ThinPrep and the Pap at the same time.

We describe a randomized paired screen-positive (RPSP) design in which subjects are randomized to receive one of the two screening tests initially, and only receive the other screening test and GS if the first screening test is positive. We derive maximum likelihood estimators and confidence intervals for the relative accuracy of the two screening tests, and assess the small sample behavior of these estimators using simulation studies. Sample size formulae are derived and applied to the cervical cancer screening trial example, and the efficiency of the RPSP design is compared with other designs.

We describe a randomized paired screen-positive (RPSP) design in which subjects are randomized to receive one of the two screening tests initially, and only receive the other screening test and GS if the first screening test is positive. We derive maximum likelihood estimators and confidence intervals for the relative accuracy of the two screening tests, and assess the small sample behavior of these estimators using simulation studies. Sample size formulae are derived and applied to the cervical cancer screening trial example, and the efficiency of the RPSP design is compared with other designs.

### MSC:

62P10 | Applications of statistics to biology and medical sciences; meta analysis |

### Keywords:

cervical cancer; inverse probability; weighting; Pap; screening; sensitivity; specificity; ThinPrep; verification bias
PDF
BibTeX
XML
Cite

\textit{T. A. Alonzo} and \textit{J. M. Kittelson}, Biometrics 62, No. 2, 605--612 (2006; Zbl 1097.62120)

Full Text:
DOI

### References:

[1] | Alonzo, Verification bias-corrected estimators of the relative true and false positive rates of two binary screening tests, Statistics in Medicine 24 pp 403– (2005) |

[2] | Alonzo, Sample size calculations for comparative studies of medical tests for detecting presence of disease, Statistics in Medicine 21 pp 835– (2002) |

[3] | ASCUS-LSIL Triage Study (ALTS) Group, A randomized trial on the management of low-grade squamos intraepithelial lesion cytology interpretations, American Journal of Obstetrics and Gynecology 188 pp 1393– (2003) |

[4] | Baker, A proposed design and analysis for comparing digital and analog mammography: Special ROC methods for cancer screening, Journal of the American Statistical Association 96 pp 421– (2001) · Zbl 1031.62093 |

[5] | Baker, The partial testing design: A less costly way to test equivalence for sensitivity and specificity, Statistics in Medicine 17 pp 2219– (1998) |

[6] | Begg, Assessment of diagnostic tests when disease is subject to selection bias, Biometrics 39 pp 207– (1983) |

[7] | Cheng, Comparison of the accuracy of two tests with a confirmatory procedure limited to positive results, Epidemiology 8 pp 104– (1997) |

[8] | Cox, Impact of Liquid-Based Cytology on the National Cervical Cancer Screening Programme (2004) |

[9] | Horvitz, A generalization of sampling without replacement from a finite universe, Journal of the American Statistical Association 47 pp 663– (1952) · Zbl 0047.38301 |

[10] | Little, Statistical Analysis with Missing Data (1987) |

[11] | McCrory, Evaluation of cervical cytology. Evidence report/technology assessment no. 5 (1999) |

[12] | Medical Services Advisory Committee, Liquid Based Cytology for Cervical Screening (2002) |

[13] | Pepe, The Statistical Evaluation of Medical Tests for Classification and Prediction (2003) · Zbl 1039.62105 |

[14] | Pepe, Comparing disease screening tests when true disease status is ascertained only for screen positives, Biostatistics 2 pp 249– (2001) · Zbl 1097.62566 |

[15] | Pepe, Phases of biomarker development for early detection of cancer, Journal of the National Cancer Institute 93 pp 1054– (2001) |

[16] | Reis, SEER Cancer Statistics Review, 1971-1999 (2002) |

[17] | Schatzkin, Comparing new and old screening tests when a reference procedure cannot be performed on all screenees, American Journal of Epidemiology 125 pp 672– (1987) |

[18] | Schiffman, Cancer Epidemiology and Prevention pp 1090– (1996) |

[19] | Steering Group on the Feasibility of Introducing Liquid Based Cytology, Steering Group Report on the Feasibility of Introducing Liquid Based Cytology (2002) |

This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.