Survey data collection
With the application of probability sampling in the 1930s, surveys became a standard tool for empirical research in social sciences, marketing, and official statistics. The methods involved in survey data collection are any of a number of ways in which data can be collected for a statistical survey. These are methods that are used to collect information from a sample of individuals in a systematic way. First there was the change from traditional paper-and-pencil interviewing (PAPI) to computer-assisted interviewing (CAI). Now, face-to-face surveys (CAPI), telephone surveys (CATI), and mail surveys (CASI, CSAQ) are increasingly replaced by web surveys.
Modes of data collection
There are several ways of administering a survey. Within a survey, different methods can be used for different parts. For example, interviewer administration can be used for general topics but self-administration for sensitive topics. The choice between administration modes is influenced by several factors, including 1) costs, 2) coverage of the target population, 3) flexibility of asking questions, 4) respondents’ willingness to participate and 5) response accuracy. Different methods create mode effects that change how respondents answer. The most common modes of administration are listed under the following headings.
Mobile data collection or mobile surveys is an increasingly popular method of data collection. Over 50% of surveys today are opened on mobile devices. The survey, form, app or collection tool is on a mobile device such as a smart phone or a tablet. These devices offer innovative ways to gather data, and eliminate the laborious "data entry" (of paper form data into a computer), which delays data analysis and understanding. By eliminating paper, mobile data collection can also dramatically reduce costs: one World Bank study in Guatemala found a 71% decrease in cost while using mobile data collection, compared to the previous paper-based approach.
SMS surveys can reach any handset, in any language and in any country. As they are not dependent on internet access and the answers can be sent when its convenient, they are a suitable mobile survey data collection channel for many situations that require fast, high volume responses. As a result, SMS surveys can deliver 80% of responses in less than 2 hours  and often at much lower cost compared to face-to-face surveys, due to the elimination of travel/personnel costs.
Apart from the high mobile phone penetration, further advantages are quicker response times and the possibility to reach previously hard-to-reach target groups. In this way, mobile technology allows marketers, researchers and employers to create real and meaningful mobile engagement in environments different from the traditional one in front of a desktop computer. However, even when using mobile devices to answer the web surveys, most respondents still answer from home.
Online (Internet) surveys are becoming an essential research tool for a variety of research fields, including marketing, social and official statistics research. According to ESOMAR online survey research accounted for 20% of global data-collection expenditure in 2006. They offer capabilities beyond those available for any other type of self-administered questionnaire. Online consumer panels are also used extensively for carrying out surveys but the quality is considered inferior because the panelists are regular contributors and tend to be fatigued. However, when estimating the measurement quality (defined as product of reliability and validity) using a multitrait-mutlimethod approach (MTMM), some studies found a quite reasonable quality  and even that the quality of a series of questions in an online opt-in panel (Netquest) was very similar to the measurement quality for the same questions asked in the European Social Survey (ESS), which is a face-to-face survey.
Some studies have compared the quality of face-to-face surveys and/or telephone surveys with the one of online surveys, for single questions, but also for more complex concepts measured with more than one question (also called Composite Scores or Index). Focusing only on probability-based surveys (also for the online ones), they found overall that the face-to-face (using show-cards) and web surveys have quite similar levels of measurement quality, whereas the telephone surveys were performing worse. Other studies comparing paper-and-pencil questionnaires with web-based questionnaires showed that employees preferred online survey approaches to the paper-and-pencil format. There are also concerns about what has been called "ballot stuffing" in which employees make repeated responses to the same survey. Some employees are also concerned about privacy. Even if they do not provide their names when responding to a company survey, can they be certain that their anonymity is protected? Such fears prevent some employees from expressing an opinion.
Advantages of online surveys
- Web surveys are faster, simpler, and cheaper. However, lower costs are not so straightforward in practice, as they are strongly interconnected to errors. Because response rate comparisons to other survey modes are usually not favourable for online surveys, efforts to achieve a higher response rate (e.g., with traditional solicitation methods) may substantially increase costs.
- The entire data collection period is significantly shortened, as all data can be collected and processed in little more than a month.
- Interaction between the respondent and the questionnaire is more dynamic compared to e-mail or paper surveys. Online surveys are also less intrusive, and they suffer less from social desirability effects.
- Complex skip patterns can be implemented in ways that are mostly invisible to the respondent.
- Pop-up instructions can be provided for individual questions to provide help with questions exactly where assistance is required.
- Questions with long lists of answer choices can be used to provide immediate coding of answers to certain questions that are usually asked in an open-ended fashion in paper questionnaires.
- Online surveys can be tailored to the situation (e.g., respondents may be allowed save a partially completed form, the questionnaire may be preloaded with already available information, etc.).
- Online questionnaires may be improved by applying usability testing, where usability is measured with reference to the speed with which a task can be performed, the frequency of errors and user satisfaction with the interface.
Key methodological issues of online surveys
- Sampling. The difference between probability samples (where the inclusion probabilities for all units of the target population is known in advance) and non-probability samples (which often require less time and effort but generally do not support statistical inference) is crucial. Probability samples are highly affected by problems of non-coverage (not all members of the general population have Internet access) and frame problems (online survey invitations are most conveniently distributed using e-mail, but there are no e-mail directories of the general population that might be used as a sampling frame). Because coverage and frame problems can significantly impact data quality, they should be adequately reported when disseminating the research results.
- Invitations to online surveys. Due to the lack of sampling frames many online survey invitations are published in the form of an URL link on web sites or in other media, which leads to sample selection bias that is out of research control and to non-probability samples. Traditional solicitation modes, such as telephone or mail invitations to web surveys, can help overcoming probability sampling issues in online surveys. However, such approaches are faced with problems of dramatically higher costs and questionable effectiveness.
- Non-response. Online survey response rates are generally low and also vary extremely – from less than 1% in enterprise surveys with e-mail invitations to almost 100% in specific membership surveys. In addition to refusing participation, terminating surveying during the process or not answering certain questions, several other non-response patterns can be observed in online surveys, such as lurking respondents and a combination of partial and item non-response. Response rates can be increased by offering monetary or some other type of incentive to the respondents, by contacting respondents several times (follow-up), and by keeping the questionnaire difficulty as low as possible. There are draw-backs to using an incentive to garner a response. Non-bias responses could be questioned in this type of situation. The most concrete way to gain feedback is to publicize what is done with the results. To take concrete actions based on feedback and to show that to the customer base is extremely motivating to customers to continue to let their voice be heard.
- Platform Issues. Lack of familiarity with the platform used can cause participants and clients confusion.
- Questionnaire design. While modern web questionnaires offer a range of design features (different question types, images, multimedia), the use of such elements should be limited to the extent necessary for respondents to understand questions or to stimulate the response. It should not affect their responses, because that would mean lower validity and reliability of data. Appropriate questionnaire design can help lowering the measurement error that can arise also due to the respondents or the survey mode itself (respondent’s motivation, computer literacy, abilities, privacy concerns, etc.).
- Post-survey adjustments. Various robust procedures have been developed for situations where sampling deviate from probability selection, or, when we face non-coverage and non-response problems. The standard statistical inference procedures (e.g. confidence interval calculations and hypothesis testing) still require a probability sample. The actual survey practice, particularly in marketing research and in public opinion polling, which massively neglects the principles of probability samples, increasingly requires from the statistical profession to specify the conditions where non-probability samples may work.
||This section is in a list format that may be better presented using prose. (January 2012)|
- Use of interviewers encourages sample persons to respond, leading to higher response rates.
- Interviewers can increase comprehension of questions by answering respondents' questions.
- Fairly cost efficient, depending on local call charge structure
- Good for large national (or international) sampling frames
- Some potential for interviewer bias (e.g., some people may be more willing to discuss a sensitive issue with a female interviewer than with a male one)
- Cannot be used for non-audio information (graphics, demonstrations, taste/smell samples)
- Three types:
- Traditional telephone interviews
- Computer assisted telephone dialing
- Computer assisted telephone interviewing (CATI)
- The questionnaire may be handed to the respondents or mailed to them, but in all cases they are returned to the researcher via mail.
- An advantage is, is that cost is very low, since bulk postage is cheap in most countries
- Long delays, often several months, before the surveys are returned and statistical analysis can begin
- Not suitable for issues that may require clarification
- Respondents can answer at their own convenience (allowing them to break up long surveys; also useful if they need to check records to answer a question)
- No interviewer bias
- Non-response bias can be corrected by extrapolation across waves
- Large amount of information can be obtained: some mail surveys are as long as 50 pages
- Response rates can be improved by using mail panels
- Response rates can be improved by using monetary incentives
- Response rates are affected by the class of mail through which the survey was sent
- Members of the panel have agreed to participate
- Panels can be used in longitudinal designs where the same respondents are surveyed several times
- Suitable for locations where telephone or mail are not developed
- Potential for interviewer bias
- Easy to manipulate by completing multiple times to skew results
Researchers can combine several above methods for the data collection. For example, researchers can invite shoppers at malls, and send willing participants questionnaires by emails. With the introduction of computers to the survey process, survey mode now includes combinations of different approaches or mixed-mode designs. Some of the most common methods are:
- Computer-assisted personal interviewing (CAPI): The computer displays the questions on screen, the interviewer reads them to the respondent, and then enters the respondent's answers.
- Audio computer-assisted self-interviewing (audio CASI): The respondent operates the computer, the computer displays the question on the screen and plays recordings of the questions to the respondents, who then enters his/her answers.
- Computer-assisted telephone interviewing (CATI)
- Interactive voice response (IVR): The computer plays recordings of the questions to respondents over the telephone, who then respond by using the keypad of the telephone or speaking their answers aloud.
- Web surveys: The computer administers the questions online.
- Vehovar, V.; Lozar Manfreda, K. (2008). "Overview: Online Surveys". In Fielding, N.; Lee, R. M.; Blank, G. The SAGE Handbook of Online Research Methods. London: SAGE. pp. 177–194. ISBN 978-1-4129-2293-7.
- Bethlehem,, J.; Biffignandi, S. (2012). Handbook of Web Surveys. Wiley Handbooks in Survey Methodology. 567. New Jersey: John Wiley & Sons. ISBN 978-1-118-12172-6.
- Mellenbergh, G.J. (2008). "Surveys". In Adèr, H.J.; Mellenbergh, G.J. Advising on Research Methods: A consultant's companion. Huizen, The Netherlands: Johannes van Kessel Publishing. pp. 183–209. ISBN 978-90-79418-01-5.
- "Mobile-ready. Event driven. Feature rich. Online customer surveys". QuestBack. Archived from the original on 22 October 2015.
- Schuster, Christian; Perez Brito, Carlos. "Evaluating Cash Transfers in Guatemala". Magpi. Retrieved 27 November 2016.
- Global, OnePoint. "SMS surveys". OnePoint Global. Retrieved 27 June 2016.
- Selanikio, Joel. "Getting More Data for Less Money". Magpi. Retrieved 9 November 2016.
- Revilla, M., Toninelli, D., Ochoa, C., and G. Loewe (2015). “Who has access to mobile devices in an online opt-in panel? An analysis of potential respondents for mobile surveys”. In D. Toninelli, R. Pinter, and P. de Pedraza (eds), Mobile Research Methods: Opportunities and challenges of mobile research methodologies, pp. 119-139 (Chapter 8). London: Ubiquity Press. ISBN 978-1-909188-53-2. DOI: http://dx.doi.org/10.5334/bar.h. License: CC-BY 4.0.
- Callegaro, M. 2010. Do you know which device your respondent has used to take your online survey? Survey Practice 3(6): 1–12. Available at http://www.surveypractice.org/index.php/SurveyPractice/article/view/250/html.
- "Mobile engagement becomes standard operating procedure". Survey Anyplace.
- "Reaching the Mobile Respondent: Determinants of High-Level Mobile Phone Use Among a High-Coverage Group" (PDF). Social Science Computer Review.
- Mavletova, A. and M.P. Couper. 2013. Sensitive topics in PC web and mobile web surveys: is there a difference? Survey Research Methods 7(3): 191–205. Available at https://ojs.ub.uni-konstanz.de/srm/article/view/5458.
- Toninelli, D. and M. Revilla (2016). "Smartphones vs PCs: Does the Device Affect the Web Survey Experience and the Measurement Error for Sensitive Topics? A Replication of the Mavletova & Couper’s 2013 Experiment." Survey Research Methods, 10(2):153-169. DOI: 10.18148/srm/2016.v10i2.6274
- Dillman, D.A. (2006). Mail and Internet Surveys: The Tailored Design Method (2nd ed.). New Jersey: John Wiley & Sons. ISBN 978-0-470-03856-7.
- Revilla, M., and C. Ochoa (2015). “Quality of Different Scales in an Online Survey in Mexico and Colombia”. Journal of Politics in Latin America, 7(3): 157–177. Available at: http://journals.sub.uni-hamburg.de/giga/jpla/article/view/903/910
- Revilla, M., and W.E. Saris (2015). “Estimating and comparing the quality of different scales of an online survey using an MTMM approach”. In Engel, U. (Ed), Survey Measurements: Techniques, Data Quality and sources of Error. Chapter 5, pp. 53-74. Campus. Frankfurt. New York. ISBN 9783593502809. Available at: http://www.press.uchicago.edu/ucp/books/book/distributed/S/bo22196267.html
- Revilla, M., Saris, W.E., Loewe, G, and C. Ochoa (2015). “Can a non-probabilistic online panel get similar question quality as the ESS?” International Journal of Market Research. 57(3): 395-412. Available at: https://www.mrs.org.uk/ijmr_article/article/104501
- Revilla, M. (2015). “Comparison of the quality estimates in a mixed-mode and a unimode design: an experiment from the European Social Survey”, Quality and Quantity. 2015, 49(3): 1219-1238. Published online first 13 of June 2014. DOI: 10.1007/s11135-014-0044-5
- Revilla, M. (2013). “Measurement invariance and quality of composite scores in a face-to-face and a web survey” Survey Research Methods 7(1): 17-28. Available at: https://ojs.ub.uni-konstanz.de/srm/article/view/5098
- Revilla, M. (2010) “Quality in Unimode and Mixed-Mode designs: A Multitrait-Multimethod approach” Survey Research Methods 4(3): 151-164. Available at: https://ojs.ub.uni-konstanz.de/srm/article/view/4278
- Schultz & Schultz, Duane (2010). Psychology and work today. New York: Prentice Hall. p. 40. ISBN 0-205-68358-4.
- Salant, Priscilla, and Don A. Dillman. "How to Conduct your own Survey: Leading professional give you proven techniques for getting reliable results." (1995).
- Kalton, Graham. Introduction to survey sampling. Vol. 35. Sage, 1983.
- Groves, R.M. (1989). Survey Costs and Survey Errors. New York: Wiley. ISBN 978-0-471-67851-9.
- J. Scott Armstrong and Terry S. Overton (1977). "Estimating Nonresponse Bias in Mail Surveys" (PDF). Journal of Marketing Research. 14: 396–402. doi:10.2307/3150783.
- J. Scott Armstrong (1975). "Monetary Incentives in Mail Surveys" (PDF). Public Opinion Quarterly. 39: 111–116. doi:10.1086/268203.
- J. Scott Armstrong (1990). "Class of Mail Does Affect Response Rates to Mailed Questionnaires: Evidence from Meta-Analysis (with a Reply by Lee Harvey)" (PDF). Journal of the Market Research Society. 32: 469–472.
- Groves, R.M.; Fowler, F. J.; Couper, M.P.; Lepkowski, J.M.; Singer, E.; Tourangeau, R. (2009). Survey Methodology. New Jersey: John Wiley & Sons. ISBN 978-1-118-21134-2.