Combining and Analyzing Complex Data
When you enroll for courses through Coursera you get to choose for a paid plan or for a free plan .
- Free plan: No certicification and/or audit only. You will have access to all course materials except graded items.
- Paid plan: Commit to earning a Certificate—it's a trusted, shareable way to showcase your new skills.
About this course: In this course you will learn how to use survey weights to estimate descriptive statistics, like means and totals, and more complicated quantities like model parameters for linear and logistic regressions. Software capabilities will be covered with R® receiving particular emphasis. The course will also cover the basics of record linkage and statistical matching—both of which are becoming more important as ways of combining data from different sources. Combining of datasets raises ethical issues which the course reviews. Informed consent may have to be obtained from persons to allow their data to be linked. You will learn about differences in the legal requirements in …

There are no frequently asked questions yet. If you have any more questions or need help, contact our customer service.
When you enroll for courses through Coursera you get to choose for a paid plan or for a free plan .
- Free plan: No certicification and/or audit only. You will have access to all course materials except graded items.
- Paid plan: Commit to earning a Certificate—it's a trusted, shareable way to showcase your new skills.
About this course: In this course you will learn how to use survey weights to estimate descriptive statistics, like means and totals, and more complicated quantities like model parameters for linear and logistic regressions. Software capabilities will be covered with R® receiving particular emphasis. The course will also cover the basics of record linkage and statistical matching—both of which are becoming more important as ways of combining data from different sources. Combining of datasets raises ethical issues which the course reviews. Informed consent may have to be obtained from persons to allow their data to be linked. You will learn about differences in the legal requirements in different countries.
Created by: University of Maryland, College Park-
Taught by: Richard Valliant, Ph.D., Research Professor
Joint Program in Survey Methodology
Each course is like an interactive textbook, featuring pre-recorded videos, quizzes and projects.
Help from your peersConnect with thousands of other learners and debate ideas, discuss course material, and get help mastering concepts.
CertificatesEarn official recognition for your work, and share your success with friends, colleagues, and employers.
University of Maryland, College Park The University of Maryland is the state's flagship university and one of the nation's preeminent public research universities. A global leader in research, entrepreneurship and innovation, the university is home to more than 37,000 students, 9,000 faculty and staff, and 250 academic programs. Its faculty includes three Nobel laureates, three Pulitzer Prize winners, 47 members of the national academies and scores of Fulbright scholars. The institution has a $1.8 billion operating budget, secures $500 million annually in external research funding and recently completed a $1 billion fundraising campaign.Syllabus
WEEK 1
Basic Estimation
After completing Modules 1 and 2 of this course you will understand how to estimate descriptive statistics, overall and for subgroups, when you deal with survey data. We will review software for estimation (R, Stata, SAS) with examples for how to estimate things like means, proportions, and totals. You will also learn how to estimate parameters in linear, logistic, and other models and learn software options with emphasis on R. Module 3 and 4 discuss how you can add additional data to your analysis. This requires knowing about record linkage techniques, and what it takes to get permission to link data.
7 videos, 6 readings expand
- Video: Overview
- Reading: Slides
- Video: Basic R examples
- Video: Basic R examples (continued)
- Reading: Slides
- Video: Degrees of Freedom
- Reading: Slides
- Video: Estimating Means
- Video: Multistage samples
- Reading: Slides
- Reading: Slides (continued)
- Video: Quantile estimation in R
- Reading: Slides
Graded: Course 6 Module 1
WEEK 2
Models
Module 2 covers how to estimate linear and logistic model parameters using survey data. After completing this module, you will understand how the methods used differ from the ones for non-survey data. We also cover the features of survey data sets that need to be accounted for when estimating standard errors of estimated model parameters.
8 videos, 8 readings expand
- Video: Introduction
- Reading: Slides
- Video: Estimation Method
- Reading: Slides
- Video: Linear Models
- Reading: Slides
- Video: Diagnostics in R
- Reading: Slides
- Video: Linear Models in Stata
- Reading: Slides
- Video: Logistic Models in R
- Reading: Slides
- Video: Odds Ratios
- Reading: Slides
- Video: Logistic Regression in Stata
- Reading: Slides
Graded: Course 6 Module 2
WEEK 3
Record Linkage
Module starts with the current debate on using more (linked) administrative records in the U.S. Federal Statistical System, and a general motivation for linking records. Several examples will be given on why it is useful to link data. Challenges of record linkage will be discussed. A brief overview over key linkage techniques is included as well.
4 videos, 12 readings expand
- Reading: Improving Federal Statistics Using Multiple Data Sources
- Reading: Longitudinal Employer-Household Dynamics (LEHD)
- Reading: Impact of Research on Innovation, Competition and Science
- Discussion Prompt: Country specific examples
- Video: Why we link records
- Reading: Slides
- Video: Gentle Introduction
- Reading: Slides - Introduction
- Reading: Technical Overview - Software
- Video: Challenges
- Reading: Slides: Challenges
- Video: Key Techniques
- Reading: Slides
- Reading: Record Linkage (Herzog/Scheuren/Winkler 2010)
- Reading: Febrl - A Freely Available Record Linkage System (Christen)
- Reading: Machine Learning and Record Linkage (Winkler 2011)
- Reading: Privacy Preserving Record Linkage (Schnell et al. 2009)
Graded: Quiz 3 - Record Linkage
WEEK 4
Ethics
This module will discuss key issues in obtaining consent to record linkage. Failure to consent can lead to bias estimates. Current research examples will be given as well as practical suggestions on how to obtain linkage consent.
5 videos, 3 readings expand
- Video: Privacy and Confidentiality
- Video: Linkage Consent and Consent Bias
- Video: Correlates of Consent
- Video: Bias in Administrative Estimates
- Video: Optimizing Linkage Consent
- Reading: Slides
- Reading: Assessing the Magnitude of Non-Consent Biases (Sakshaug & Kreuter 2012)
- Reading: Placement, Wording and Interviewers (Sakshaug et al.)
Graded: Quiz - Linkage Consent
There are no frequently asked questions yet. If you have any more questions or need help, contact our customer service.
