Diagnostic Efficiency of Home Pregnancy Test Kits

Objective  To assess the diagnostic efficiency of homepregnancy test (HPT) kits.
Data Sources  A literature search of English-language studieswas performed with MEDLINE and a review of bibliographies.
Study Selection  Studies were included if HPT kits werecompared with a criterion standard (laboratory testing), ifthey used appropriate controls, and if data were available todetermine sensitivity and specificity.
Data Extraction  Two investigators independently extracteddata, and disagreement was resolved by consensus. Sensitivity,specificity, and an effectiveness score (a measure of the discriminatorypower of the test, with higher scores implying greater effectiveness)were calculated.
Data Synthesis  Five studies evaluating 16 HPT kits metthe inclusion criteria. The range of sensitivities for HPT kitswas 0.52 to 1.0. In studies where urine samples obtained bythe investigators were tested by volunteers, sensitivity was0.91 (95% confidence interval [CI], 0.84-0.96). However, thesensitivity was less in studies where subjects were actual patientswho performed the test on their own urine samples (sensitivity,0.75 [95% CI, 0.64-0.85]). The test effectiveness score was2.75 (95% CI, 2.3-3.2) for studies where subjects were volunteersbut deteriorated to 0.82 (95% CI, 0.4-1.2) for studies withactual patients.
Conclusions  The diagnostic efficiency of HPT kits is greatlyaffected by characteristics of the users. Despite the popularityof these kits, the relatively low effectiveness scores of thesekits when used by actual patients are of concern. We suggestthat manufacturers of HPT kits publish results of trials inactual patients before marketing them to the general public.

INTRODUCTION

Jump to Section
•	Top
•	Introduction
•	Materials and methods
•	Results
•	Comment
•	Conclusions
•	Author information
•	References

HOME PREGNANCY test (HPT) kits have become increasingly popularsince the first kit was released in the mid-1970s. These kitscurrently make up the fastest-growing segment of the home-diagnostictesting market.^1-2 In the United States, approximately 33% ofwomen have used an HPT kit to determine their pregnancy statusbefore seeking professional health care.^3-6 Most studies havefound that women choose to use HPT kits because of the speedof obtaining results and the convenience of testing at home.⁴Another advantage of the HPT kit is that the woman is the firstperson to know that she is pregnant. Since some women preferto wait until they are sure they are pregnant before visitingtheir physician, HPT kits may lead to an earlier pregnancy diagnosis.An earlier diagnosis provides an opportunity for health careproviders to counsel women about pregnancy options and to discouragepotentially harmful behaviors, such as smoking and use of alcoholor drugs.^5, 7
The history of HPT kits parallels the development of laboratorytests for urinary human chorionic gonadotropin (HCG). The firstkits used chemical and hemagglutination-inhibition methods,⁸but most current kits use HCG-directed monoclonal antibodies.^9-11The active ingredients in monoclonal-based kits are the HCG ${alpha}$ -chain–specific monoclonal antibodies, the ${beta}$ -chain–specificantibody/enzyme conjugate, the chromogenic substrate solution,and buffer solution.¹¹ In the presence of urine HCG, the monoclonalantibody binds the hormone and produces a reaction, usuallya color change because of the chromogenic substrate and buffersolutions.² A reaction should not occur when HCG is absent,because the antibody adheres only to HCG. The accuracy of HPTkits is claimed to be 97% to 99% by the manufacturers.¹ Thenewer products, such as Advance (Advanced Care Products, OrthoPharmaceutical Corp, Raritan, NJ), Answer (Carter Products,Carter-Wallace, Inc, New York, NY), Clearblue (Unipath DiagnosticsCo, New York), e.p.t. (Warner-Lambert Co, Morris Plains, NJ),First Response (Carter Products), and Daisy 2 (manufacturedby Bio-Dynamic Home Health Care, Inc, Indianapolis, Ind, until1982, then by Advanced Care Products), are reported by the manufacturersto be even more accurate than earlier kits, such as Daisy 1(Bio-Dynamic Home Health Care, Inc) and the first-generatione.p.t.(Warner-Lambert).¹²
The first HPT kit was released and marketed for general saleswithout Food and Drug Administration approval, because its releasepredated the 1976 Medical Device Amendment of the Food, Drug,and Cosmetic Act. ^13-16 This amendment allowed the marketingof new HPT kits that the Food and Drug Administration consideredto be "substantially equivalent" to the first product withoutapplying for approval. Criticism of this Food and Drug Administrationpolicy, which has been described as too lenient, was expressedin editorials and studies that showed poor performance of thesekits by individual consumers.¹⁶ Despite these concerns thatHPT kits require more testing in actual patients, most consumersand clinicians make decisions on the basis of the excellentsensitivity and specificity reported by the kit manufacturers.We reviewed the available literature and explored the variabilityin diagnostic efficiency among HPT kits.

MATERIALS AND METHODS

Jump to Section
•	Top
•	Introduction
•	Materials and methods
•	Results
•	Comment
•	Conclusions
•	Author information
•	References

DATA SOURCES
We searched the MEDLINE and HEALTHSTAR databases for English-languagearticles concerning the diagnosis of pregnancy that were publishedbetween 1966 and 1996. The key words used were pregnancy, diagnosis,pregnancy tests, and home tests. The HPT kits were not developedbefore 1966. References cited in articles and those listed inthe bibliographies of standard obstetric texts were also retrieved.Articles were systematically reviewed by 2 of us (L.A.B. andK.N.) and given a grade of A through C based on the study designand level of evidence.¹⁷ Studies were included if the resultsof the HPT kit under investigation were compared to a criterionstandard (laboratory tests) and used appropriate controls, andif data were available to determine sensitivity and specificitywith total sample size greater than 20 (attempts were made toreach authors of potential articles to obtain additional informationneeded to determine sensitivity and specificity). We also attemptedto obtain information from manufacturers of HPT kits but wereunsuccessful.
STUDY SELECTION

Through the MEDLINE, textbook reference, and bibliography searches,we initially identified 55 articles; 45 were excluded eitherbecause the article was a review or because the HPT kit wasnot compared with a criterion standard laboratory-based urineor serum HCG test. The remaining 10 articles (Table 1) werethen analyzed and 5 more were excluded.^{5, 16, 18-25} The additionalexclusions were because the study had no control group of nonpregnantpatients,⁵ there were insufficient data for determining sensitivityand specificity,^18-19 the kit was no longer available becauseof its demonstrated poor performance,²⁰ or the study had aninadequate sample size.²¹

DATA EXTRACTION
Data were abstracted independently by 2 of us (L.A.B. and K.N.)by means of structured forms that were pretested. Disagreementswere resolved by consensus. Sensitivity, specificity, and atest effectiveness score were calculated.
The test effectiveness score has been used in previous meta-analysesbecause it allows comparison of the relative and absolute abilityof tests to discriminate those with from those without the targetcondition.²⁶ A logistic odds transformation of the sensitivityand specificity allows creation of more normally distributedfrequency plots of the test results for pregnant vs nonpregnantwomen. The effectiveness score quantifies the degree of overlapbetween the 2 plots and is interpreted directly as the numberof SDs separating the means of the 2 curves. Tests that leadto considerable overlap between the 2 plots would have effectivenessscores of 1.0 or less and would not effectively distinguishpregnant from nonpregnant women.²⁷ An effectiveness score of1.0 means that 27% of pregnant women have tests results equivalentto those of women who are not pregnant. Tests that lead to minimaloverlap between the 2 plots would be highly efficient in distinguishingpregnant from nonpregnant women and would have effectivenessscores approaching 3.0 and greater. Thus, a pregnancy test withan effectiveness score of 3.0 would yield results for pregnantwomen that are 3 SDs away from those of a population of nonpregnantwomen; the overlap in frequency plots for tests with an effectivenessscore of 3.0 is only 3% of the patient sample.
A test of homogeneity of both sensitivity and test effectivenessscore was performed to evaluate consistency of findings acrossstudies in these 2 categories. Because studies were found tobe heterogeneous (P<.05), data were analyzed statisticallyby empirical Bayesian methods to arrive at summary statisticsof sensitivity and test effectiveness score with 95% confidenceintervals (CIs).²⁸

RESULTS

Jump to Section
•	Top
•	Introduction
•	Materials and methods
•	Results
•	Comment
•	Conclusions
•	Author information
•	References

General characteristics of the 10 studies retrieved initiallyare presented in Table 1. The sensitivity, specificity, andtest effectiveness scores of the 5 retained studies (16 kits)are presented categorized by the HPT kit in Table 2. These studiesachieved methodologic quality scores of either A or B. Threeof the studies evaluated volunteers who performed the pregnancytests on study samples obtained previously by the investigators.Two of the studies evaluated pregnancy tests performed by womenwho collected their own urine samples according to the kit instructionsand performed the pregnancy test on their own samples.

The summary sensitivity was 0.91 (95% CI, 0.84-0.96) for studieswhere subjects were volunteers. Test performance deterioratedin studies where subjects were women who collected and testedtheir own samples, as demonstrated by a decreased summary sensitivityof 0.75 (95% CI, 0.64-0.85). Effectiveness scores of HPT kitsalso differed between these 2 groups. Figure 1 shows the effectivenessscores with 95% CIs, stratified by whether the study used volunteersor patients as subjects. The pooled test effectiveness scoreapproached the desired benchmark value of 3.0 for studies inwhich volunteers performed the test (pooled effectiveness score,2.75 [95% CI, 2.3-3.2]). However, kits were inefficient whenwomen collected their own urine and performed the tests themselves(pooled test effectiveness score, 0.82 [95% CI, 0.4-1.2]).

COMMENT

Jump to Section
•	Top
•	Introduction
•	Materials and methods
•	Results
•	Comment
•	Conclusions
•	Author information
•	References

These findings demonstrate differences in performance of theHPT kits. The low sensitivity and effectiveness score when HPTkits were used by women evaluating their own samples suggeststhat consumers and physicians should be concerned about thediagnostic efficiency of these kits, especially when the testresult is negative for pregnancy. Despite the potential forproblems with the HPT kits, most women and their physiciansconsider them reliable. Clinicians routinely advise patientsto use these kits before scheduling prenatal appointments.⁵Some physicians also rely on the results of HPT kits beforetreating patients with potentially teratogenic medications.
The overall marketing success of HPT kits led to the developmentof other home testing kits, such as an ovulation test kit anda human immunodeficiency virus test kit.²⁹ On the basis of ourreview, we suggest that manufacturers of all home testing kitsfor any target condition should publish results of trials inactual patients before marketing them to the general public.If there are differences between volunteers and actual users,then modifications should be required until performance meetsacceptable standards.
With one third of pregnant women using HPT kits, the low sensitivity(high rate of false-negative results) is a public health concern.False-negative results, even if they occur 10% of the time,may result in a delay in obtaining proper prenatal care anda missed opportunity to potentially motivate change in behaviorssuch as smoking or use of alcohol or drugs.¹³ A false-negativeresult may affect the feasibility and safety of pregnancy termination.
Two major reasons exist for the high false-negative rate whentesting is performed by women on their own samples. First, womenmay be obtaining their samples before the recommended numberof days after their first missed menstrual period (usually 9days), when HCG levels become reliably detectable by the kits.Although many kits advertise their effectiveness at 9 days afterthe user's last menstrual period, the sensitivities reportedby manufacturers ( $>=$ 90%) are not applicable until 2 weeks afterthe last menstrual period.^19, 25
Another reason for false-negative results is operator error.Operator errors result from failure to read or follow instructions,or difficult procedures inherent to the kit. In 1 study, pharmacystudents evaluated the instruction leaflets in 9 popular kits.²⁵Although they did not rate the instruction leaflets significantlydifferently across kits, they did determine that the resultsof the 3 kits that used a color change were easier to interpretthan the other kits. In a 1993 study from France, 27 HPT kitswere studied for their diagnostic efficiency.¹⁸ The investigatorsfound a sensitivity range for all kits in France of 3% to 100%.They then tested the best 11 kits in 638 inexperienced volunteers.Of the 478 positive (pregnant) urine samples distributed, 230were falsely interpreted as negative (sensitivity, 48%). Themain explanation for the high rate of false-negative resultswas difficulty in understanding the instructions in the HPTkits, regardless of the socioeconomic situation (age, education,and employment) of the subjects. Valanis and Perlman,⁵ who studiedonly pregnant women, found that only 32% of users complied withall test kit instructions. The incidence of false-negative resultsin this study was 24.3%.
The fewer false-positive results with current monoclonal-basedkits have been attributed to ectopic sources of HCG or elevatedlevels of circulating luteinizing hormone. Ectopic HCG productionmay rarely occur with certain tumors, such as small-cell carcinomaof the lung.^8, 30 High levels of luteinizing hormone may occurin postmenopausal women and in women just before ovulation.³⁰Proteinuria does not interfere with monoclonal-based kits, butit can result in inconclusive readings in the hemagglutination-inhibitiontests.^31-32 A few medications, such as methadone hydrochloride,carbamazepine, and aspirin, as well as medical conditions, suchas ovarian cysts, abscesses, and pelvic inflammatory disease,may also interfere with the hemagglutination-inhibition testresults.³³False-positive results are not thought to be as significanta public health concern as false-negative results, as they shouldlead to a prenatal appointment and follow-up laboratory testing.^7,16, 30 False-positive results, however, can have extremely devastatingpsychological effects on the woman and her significant other.
A population of particular concern with regard to using HPTkits is teenagers. A recent study of teenagers requesting pregnancytests in health departments showed that 28% of adolescents hadused an HPT kit before their visit.⁶ Of those teenagers whowere pregnant, one third had at least 1 negative pregnancy testbefore their positive result. The decision by a sexually activeteenager to test herself for pregnancy marks the need for counselingabout contraception, even when the result is negative. Thishas led others to recommend discouraging teenagers from usingHPT kits so that those with negative results performed in clinicswill be afforded the opportunity to discuss health behaviorsintended to reduce the rate of teenage pregnancy.^6, 34 It isunlikely that we can prevent teenagers from using HPT kits.An alternative suggestion would be to encourage manufacturersto label kits with a warning suggesting that teenagers talkwith an adult about their pregnancy test result, even if itis negative.
Publication bias may impair our ability to assess home testingkits because manufacturers of self-diagnostic kits do not publishtheir results. An editorial by the vice president of an HPTmanufacturer cited that extensive, but unpublished, clinicaltrials in hundreds of women were conducted before the kit wasmarketed.³⁵ We attempted to obtain these unpublished data frommanufacturers of HPT kits but were unsuccessful. We do not knowif publication bias would change our findings, but usually publicationbias results in the increased publication rates of studies withgood results. We also are unaware of whether the issue of actualpatient use vs testing by volunteers is adequately evaluatedin premarketing trials.

CONCLUSIONS

Jump to Section
•	Top
•	Introduction
•	Materials and methods
•	Results
•	Comment
•	Conclusions
•	Author information
•	References

Researchers have been concerned about the differences in diagnostictest characteristics when HPT kits were used outside of a controlledlaboratory setting. In the hands of experienced technicians,the HPT kits have been proven to be almost as accurate (97.4%)as professional laboratory testing.¹⁶ However, they are lessaccurate when performed by consumers. The limitation of self-testingis the ability of the users to perform the test. It is essentialthat HPT kits provide adequate instructions that are easy toread and understand. Our study suggests that HPT kit instructionsshould be reviewed to (1) make sure women understand them; (2)encourage women to wait at least 2 weeks after a missed periodbefore performing the test; and (3) notify women of the potentialfor false-negative results.
Clinicians should be concerned about the diagnostic efficiencyof HPT kits, given the relatively low effectiveness score whenused by actual patients. When a patient calls reporting a negativeresult, she should be encouraged to repeat the test 1 week laterif she remains amenorrheic, and to call her provider if thetest result remains negative. When a patient calls reportinga positive result, she should be encouraged to schedule an appointmentfor her first prenatal visit to confirm that she is pregnant.Because most manufacturers do not publish the results of theirtrials in actual patients, we are not able to report sensitivity,specificity, and test effectiveness scores for all HPT kitscurrently on the market. Consumer Reports ranked these kitson the basis of the manufacturer's reported accuracy, ease ofuse, and cost and determined Answer to be the best value.⁹ Withoutmore information from the manufacturers, we cannot recommend1 specific HPT kit. Further research is needed in this area.

AUTHOR INFORMATION

Jump to Section
•	Top
•	Introduction
•	Materials and methods
•	Results
•	Comment
•	Conclusions
•	Author information
•	References

Accepted for publication October 6, 1997.
Presented in part at the annual sessions of the Society of GeneralInternal Medicine, Washington, DC, May 1, 1997.
Reprints: Lori A. Bastian, MD, MPH, Durham Veterans AffairsMedical Center, 508 Fulton St (152), Durham, NC 27705 (e-mail:LBastian@acpub.duke.edu).

From the Departments of Internal Medicine (Drs Bastian, Nanda, and Simel), Obstetrics and Gynecology (Dr Bastian), and Center for Health Policy and Research (Dr Hasselblad), Duke University Medical Center, the Center for Health Services Research in Primary Care (Drs Bastian and Simel), and the Women Veterans Comprehensive Health Center, Durham Veterans Affairs Medical Center (Drs Bastian, Nanda, and Simel), Durham, NC.

Huynhluu

Friday, September 8, 2017

Diagnostic Efficiency of Home Pregnancy Test Kits

Featured Post

Teste

Popular Posts

Total de visualizações

Followers