Errors in reported degrees and respondent driven sampling: Implications for bias
File(s)
Author(s)
Mills, HL
Johnson, S
Hickman, M
Jones, NS
Colijn, C
Type
Journal Article
Abstract
Background
Respondent Driven Sampling (RDS) is a network or chain sampling method designed to access individuals from hard-to-reach populations such as people who inject drugs (PWID). RDS surveys are used to monitor behaviour and infection occurence over time; these estimations require adjusting to account for over-sampling of individuals with many contacts. Adjustment is done based on individuals’ reported total number of contacts, assuming these are correct.
Methods
Data on the number of contacts (degrees) of individuals sampled in two RDS surveys in Bristol, UK, show larger numbers of individuals reporting numbers of contacts in multiples of 5 and 10 than would be expected at random. To mimic these patterns we generate contact networks and explore different methods of mis-reporting degrees. We simulate RDS surveys and explore the sensitivity of adjusted estimates to these different methods.
Results
We find that inaccurate reporting of degrees can cause large and variable bias in estimates of prevalence or incidence. Our simulations imply that paired RDS surveys could over- or under-estimate any change in prevalence by as much as 25%. These are particularly sensitive to inaccuracies in the degree estimates of individuals with who have low degree.
Conclusions
There is a substantial risk of bias in estimates from RDS if degrees are not correctly reported. This is particularly important when analysing consecutive RDS samples to assess trends in population prevalence and behaviour. RDS questionnaires should be refined to obtain high resolution degree information, particularly from low-degree individuals. Additionally, larger sample sizes can reduce uncertainty in estimates.
Respondent Driven Sampling (RDS) is a network or chain sampling method designed to access individuals from hard-to-reach populations such as people who inject drugs (PWID). RDS surveys are used to monitor behaviour and infection occurence over time; these estimations require adjusting to account for over-sampling of individuals with many contacts. Adjustment is done based on individuals’ reported total number of contacts, assuming these are correct.
Methods
Data on the number of contacts (degrees) of individuals sampled in two RDS surveys in Bristol, UK, show larger numbers of individuals reporting numbers of contacts in multiples of 5 and 10 than would be expected at random. To mimic these patterns we generate contact networks and explore different methods of mis-reporting degrees. We simulate RDS surveys and explore the sensitivity of adjusted estimates to these different methods.
Results
We find that inaccurate reporting of degrees can cause large and variable bias in estimates of prevalence or incidence. Our simulations imply that paired RDS surveys could over- or under-estimate any change in prevalence by as much as 25%. These are particularly sensitive to inaccuracies in the degree estimates of individuals with who have low degree.
Conclusions
There is a substantial risk of bias in estimates from RDS if degrees are not correctly reported. This is particularly important when analysing consecutive RDS samples to assess trends in population prevalence and behaviour. RDS questionnaires should be refined to obtain high resolution degree information, particularly from low-degree individuals. Additionally, larger sample sizes can reduce uncertainty in estimates.
Date Issued
2014-09-01
Date Acceptance
2014-06-05
Citation
Drug and Alcohol Dependence, 2014, 142, pp.120-126
ISSN
1879-0046
Publisher
Elsevier
Start Page
120
End Page
126
Journal / Book Title
Drug and Alcohol Dependence
Volume
142
Copyright Statement
Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported (CC BY-NC-ND 3.0)
Sponsor
Wellcome Trust
Engineering & Physical Science Research Council (EPSRC)
Engineering & Physical Science Research Council (EPSRC)
Engineering & Physical Science Research Council (E
Grant Number
093488/Z/10/Z
EP/I031626/1
EP/K026003/1
EP/I005765/1- UEL 2501/R100547
Subjects
Science & Technology
Life Sciences & Biomedicine
Substance Abuse
Psychiatry
Respondent driven sampling
At-risk populations
Contact network size
INJECTING DRUG-USERS
NEW-YORK-CITY
HIDDEN POPULATIONS
METHODOLOGICAL CHALLENGES
SEXUAL PARTNERS
NETWORKS
MEN
SEROPREVALENCE
RECRUITMENT
PREVALENCE
Publication Status
Published