Communicating Data Science Results

Product type

Communicating Data Science Results

Coursera (CC)
Logo Coursera (CC)
Provider rating: starstarstarstar_halfstar_border 7.2 Coursera (CC) has an average rating of 7.2 (out of 6 reviews)

Need more information? Get more details on the site of the provider.

Description

When you enroll for courses through Coursera you get to choose for a paid plan or for a free plan

  • Free plan: No certicification and/or audit only. You will have access to all course materials except graded items.
  • Paid plan: Commit to earning a Certificate—it's a trusted, shareable way to showcase your new skills.

About this course: Important note: The second assignment in this course covers the topic of Graph Analysis in the Cloud, in which you will use Elastic MapReduce and the Pig language to perform graph analysis over a moderately large dataset, about 600GB. In order to complete this assignment, you will need to make use of Amazon Web Services (AWS). Amazon has generously offered to provide up to $50 in free AWS credit to each learner in this course to allow you to complete the assignment. Further details regarding the process of receiving this credit are available in the welcome message for the course, as well as in the assignment itself. Please note that Amazon, University of Washington, a…

Read the complete description

Frequently asked questions

There are no frequently asked questions yet. If you have any more questions or need help, contact our customer service.

Didn't find what you were looking for? See also: Science, Software / System Engineering, English (FCE / CAE / CPE), Teaching Skills, and Biology.

When you enroll for courses through Coursera you get to choose for a paid plan or for a free plan

  • Free plan: No certicification and/or audit only. You will have access to all course materials except graded items.
  • Paid plan: Commit to earning a Certificate—it's a trusted, shareable way to showcase your new skills.

About this course: Important note: The second assignment in this course covers the topic of Graph Analysis in the Cloud, in which you will use Elastic MapReduce and the Pig language to perform graph analysis over a moderately large dataset, about 600GB. In order to complete this assignment, you will need to make use of Amazon Web Services (AWS). Amazon has generously offered to provide up to $50 in free AWS credit to each learner in this course to allow you to complete the assignment. Further details regarding the process of receiving this credit are available in the welcome message for the course, as well as in the assignment itself. Please note that Amazon, University of Washington, and Coursera cannot reimburse you for any charges if you exhaust your credit. While we believe that this assignment contributes an excellent learning experience in this course, we understand that some learners may be unable or unwilling to use AWS. We are unable to issue Course Certificates for learners who do not complete the assignment that requires use of AWS. As such, you should not pay for a Course Certificate in Communicating Data Results if you are unable or unwilling to use AWS, as you will not be able to successfully complete the course without doing so. Making predictions is not enough! Effective data scientists know how to explain and interpret their results, and communicate findings accurately to stakeholders to inform business decisions. Visualization is the field of research in computer science that studies effective communication of quantitative results by linking perception, cognition, and algorithms to exploit the enormous bandwidth of the human visual cortex. In this course you will learn to recognize, design, and use effective visualizations. Just because you can make a prediction and convince others to act on it doesn’t mean you should. In this course you will explore the ethical considerations around big data and how these considerations are beginning to influence policy and practice. You will learn the foundational limitations of using technology to protect privacy and the codes of conduct emerging to guide the behavior of data scientists. You will also learn the importance of reproducibility in data science and how the commercial cloud can help support reproducible research even for experiments involving massive datasets, complex computational infrastructures, or both. Learning Goals: After completing this course, you will be able to: 1. Design and critique visualizations 2. Explain the state-of-the-art in privacy, ethics, governance around big data and data science 3. Use cloud computing to analyze large datasets in a reproducible way.

Created by:  University of Washington
  • Taught by:  Bill Howe, Director of Research

    Scalable Data Analytics
Basic Info Course 3 of 4 in the Data Science at Scale Specialization Language English How To Pass Pass all graded assignments to complete the course. User Ratings 3.5 stars Average User Rating 3.5See what learners said Coursework

Each course is like an interactive textbook, featuring pre-recorded videos, quizzes and projects.

Help from your peers

Connect with thousands of other learners and debate ideas, discuss course material, and get help mastering concepts.

Certificates

Earn official recognition for your work, and share your success with friends, colleagues, and employers.

University of Washington Founded in 1861, the University of Washington is one of the oldest state-supported institutions of higher education on the West Coast and is one of the preeminent research universities in the world.

Syllabus


WEEK 1


Visualization



Statistical inferences from large, heterogeneous, and noisy datasets are useless if you can't communicate them to your colleagues, your customers, your management and other stakeholders. Learn the fundamental concepts behind information visualization, an increasingly critical field of research and increasingly important skillset for data scientists. This module is taught by Cecilia Aragon, faculty in the Human Centered Design and Engineering Department.


14 videos expand


  1. Video: 01 Introduction: What and Why
  2. Video: 02 Introduction: Motivating Examples
  3. Video: 03 Data Types: Definitions
  4. Video: 04 Mapping Data Types to Visual Attributes
  5. Video: 05 Data Types Exercise
  6. Video: 06 Data Types and Visual Mappings Exercises
  7. Video: 07 Data Dimensions
  8. Video: 08 Effective Visual Encoding
  9. Video: 09 Effective Visual Encoding Exercise
  10. Video: 10 Design Criteria for Visual Encoding
  11. Video: 11 The Eye is not a Camera
  12. Video: 12 Preattentive Processing
  13. Video: 13 Estimating Magnitude
  14. Video: 14 Evaluating Visualizations

Graded: Crime Analytics: Visualization of Incident Reports

WEEK 2


Privacy and Ethics



Big Data has become closely linked to issues of privacy and ethics: As the limits on what we *can* do with data continue to evaporate, the question of what we *should* do with data becomes paramount. Motivated in the context of case studies, you will learn the core principles of codes of conduct for data science and statistical analysis. You will learn the limits of current theory on protecting privacy while still permitting useful statistical analysis.


14 videos expand


  1. Video: Motivation: Barrow Alcohol Study
  2. Video: Barrow Study Problems
  3. Video: Reifying Ethics: Codes of Conduct
  4. Video: ASA Code of Conduct: Responsibilities to Stakeholders
  5. Video: Other Codes of Conduct
  6. Video: Examples of Codified Rules: HIPAA
  7. Video: Privacy Guarantees: First Attempts
  8. Video: Examples of Privacy Leaks
  9. Video: Formalizing the Privacy Problem
  10. Video: Differential Privacy Defined
  11. Video: Global Sensitivity
  12. Video: Laplacian Noise
  13. Video: Adding Laplacian Noise and Proving Differential Privacy
  14. Video: Weaknesses of Differential Privacy


WEEK 3


Reproducibility and Cloud Computing



Science is facing a credibility crisis due to unreliable reproducibility, and as research becomes increasingly computational, the problem seems to be paradoxically getting worse. But reproducibility is not just for academics: Data scientists who cannot share, explain, and defend their methods for others to build on are dangerous. In this module, you will explore the importance of reproducible research and how cloud computing is offering new mechanisms for sharing code, data, environments, and even costs that are critical for practical reproducibility.


17 videos, 1 practice quiz expand


  1. Video: Reproducibility and Data Science
  2. Video: Reproducibility Gold Standard
  3. Video: Anecdote: The Ocean Appliance
  4. Video: Code + Data + Environment
  5. Video: Cloud Computing Introduction
  6. Video: Cloud Computing History
  7. Video: Code + Data + Environment + Platform
  8. Video: Cloud Computing for Reproducible Research
  9. Video: Advantages of Virtualization for Reproducibility
  10. Video: Complex Virtualization Scenarios
  11. Video: Shared Laboratories
  12. Video: Economies of Scale
  13. Video: Provisioning for Peak Load
  14. Video: Elasticity and Price Reductions
  15. Video: Server Costs vs. Power Costs
  16. Video: Reproducibility for Big Data
  17. Video: Counter-Arguments and Summary
  18. Practice Quiz: AWS Credit Opt-in Consent Form

Graded: Graph Analysis in the Cloud
There are no reviews yet.
  • View related products with reviews: Science.

Share your review

Do you have experience with this course? Submit your review and help other people make the right choice. As a thank you for your effort we will donate $1.- to Stichting Edukans.

There are no frequently asked questions yet. If you have any more questions or need help, contact our customer service.