Practical Predictive Analytics: Models and Methods
Description
When you enroll for courses through Coursera you get to choose for a paid plan or for a free plan .
- Free plan: No certicification and/or audit only. You will have access to all course materials except graded items.
- Paid plan: Commit to earning a Certificate—it's a trusted, shareable way to showcase your new skills.
About this course: Statistical experiment design and analytics are at the heart of data science. In this course you will design statistical experiments and analyze the results using modern methods. You will also explore the common pitfalls in interpreting statistical arguments, especially those associated with big data. Collectively, this course will help you internalize a core set of practical and effective machine learning methods and concepts, and apply them to solve some real world problems. Learning Goals: After completing this course, you will be able to: 1. Design effective experiments and analyze the results 2. Use resampling methods to make clear and bulletproof statistical arg…
Frequently asked questions
There are no frequently asked questions yet. If you have any more questions or need help, contact our customer service.
When you enroll for courses through Coursera you get to choose for a paid plan or for a free plan .
- Free plan: No certicification and/or audit only. You will have access to all course materials except graded items.
- Paid plan: Commit to earning a Certificate—it's a trusted, shareable way to showcase your new skills.
About this course: Statistical experiment design and analytics are at the heart of data science. In this course you will design statistical experiments and analyze the results using modern methods. You will also explore the common pitfalls in interpreting statistical arguments, especially those associated with big data. Collectively, this course will help you internalize a core set of practical and effective machine learning methods and concepts, and apply them to solve some real world problems. Learning Goals: After completing this course, you will be able to: 1. Design effective experiments and analyze the results 2. Use resampling methods to make clear and bulletproof statistical arguments without invoking esoteric notation 3. Explain and apply a core set of classification methods of increasing complexity (rules, trees, random forests), and associated optimization methods (gradient descent and variants) 4. Explain and apply a set of unsupervised learning concepts and methods 5. Describe the common idioms of large-scale graph analytics, including structural query, traversals and recursive queries, PageRank, and community detection
Created by: University of Washington-
Taught by: Bill Howe, Director of Research
Scalable Data Analytics
Each course is like an interactive textbook, featuring pre-recorded videos, quizzes and projects.
Help from your peersConnect with thousands of other learners and debate ideas, discuss course material, and get help mastering concepts.
CertificatesEarn official recognition for your work, and share your success with friends, colleagues, and employers.
University of Washington Founded in 1861, the University of Washington is one of the oldest state-supported institutions of higher education on the West Coast and is one of the preeminent research universities in the world.Syllabus
WEEK 1
Practical Statistical Inference
Learn the basics of statistical inference, comparing classical methods with resampling methods that allow you to use a simple program to make a rigorous statistical argument. Motivate your study with current topics at the foundations of science: publication bias and reproducibility.
28 videos expand
- Video: Appetite Whetting: Bad Science
- Video: Hypothesis Testing
- Video: Significance Tests and P-Values
- Video: Example: Difference of Means
- Video: Deriving the Sampling Distribution
- Video: Shuffle Test for Significance
- Video: Comparing Classical and Resampling Methods
- Video: Bootstrap
- Video: Resampling Caveats
- Video: Outliers and Rank Transformation
- Video: Example: Chi-Squared Test
- Video: Bad Science Revisited: Publication Bias
- Video: Effect Size
- Video: Meta-analysis
- Video: Fraud and Benford's Law
- Video: Intuition for Benford's Law
- Video: Benford's Law Explained Visually
- Video: Multiple Hypothesis Testing: Bonferroni and Sidak Corrections
- Video: Multiple Hypothesis Testing: False Discovery Rate
- Video: Multiple Hypothesis Testing: Benjamini-Hochberg Procedure
- Video: Big Data and Spurious Correlations
- Video: Spurious Correlations: Stock Price Example
- Video: How is Big Data Different?
- Video: Bayesian vs. Frequentist
- Video: Motivation for Bayesian Approaches
- Video: Bayes' Theorem
- Video: Applying Bayes' Theorem
- Video: Naive Bayes: Spam Filtering
WEEK 2
Supervised Learning
Follow a tour through the important methods, algorithms, and techniques in machine learning. You will learn how these methods build upon each other and can be combined into practical algorithms that perform well on a variety of tasks. Learn how to evaluate machine learning methods and the pitfalls to avoid.
26 videos, 1 reading expand
- Video: Statistics vs. Machine Learning
- Video: Simple Examples
- Video: Structure of a Machine Learning Problem
- Video: Classification with Simple Rules
- Video: Learning Rules
- Video: Rules: Sequential Covering
- Video: Rules Recap
- Video: From Rules to Trees
- Video: Entropy
- Video: Measuring Entropy
- Video: Using Information Gain to Build Trees
- Video: Building Trees: ID3 Algorithm
- Video: Building Trees: C.45 Algorithm
- Video: Rules and Trees Recap
- Video: Overfitting
- Video: Evaluation: Leave One Out Cross Validation
- Video: Evaluation: Accuracy and ROC Curves
- Video: Bootstrap Revisited
- Video: Ensembles, Bagging, Boosting
- Video: Boosting Walkthrough
- Video: Random Forests
- Video: Random Forests: Variable Importance
- Video: Summary: Trees and Forests
- Video: Nearest Neighbor
- Video: Nearest Neighbor: Similarity Functions
- Video: Nearest Neighbor: Curse of Dimensionality
- Reading: R Assignment: Classification of Ocean Microbes
Graded: R Assignment: Classification of Ocean Microbes
WEEK 3
Optimization
You will learn how to optimize a cost function using gradient descent, including popular variants that use randomization and parallelization to improve performance. You will gain an intuition for popular methods used in practice and see how similar they are fundamentally.
11 videos expand
- Video: Optimization by Gradient Descent
- Video: Gradient Descent Visually
- Video: Gradient Descent in Detail
- Video: Gradient Descent: Questions to Consider
- Video: Intuition for Logistic Regression
- Video: Intuition for Support Vector Machines
- Video: Support Vector Machine Example
- Video: Intuition for Regularization
- Video: Intuition for LASSO and Ridge Regression
- Video: Stochastic and Batched Gradient Descent
- Video: Parallelizing Gradient Descent
WEEK 4
Unsupervised Learning
A brief tour of selected unsupervised learning methods and an opportunity to apply techniques in practice on a real world problem.
4 videos expand
- Video: Introduction to Unsupervised Learning
- Video: K-means
- Video: DBSCAN
- Video: DBSCAN Variable Density and Parallel Algorithms
Graded: Kaggle Competition Peer Review
Share your review
Do you have experience with this course? Submit your review and help other people make the right choice. As a thank you for your effort we will donate $1.- to Stichting Edukans.There are no frequently asked questions yet. If you have any more questions or need help, contact our customer service.