Flash cards
Review the key moves
What is the main idea behind Data Science - Intro to Statistics?
Lesson checks
Practice each idea before moving on
Short Mimo-style checks built from this lesson's code, terms, and sequence.
Which statement best captures the main point of this lesson?
Complete the missing token from the example code.
___ (full_health_data.describe())Put the learning moves in the order that makes the concept easiest to apply.
Before charting or modeling a dataset, which move should come first?
Introduction to Statistics
Statistics is the science of analyzing data.
When we have created a model for prediction, we must assess the prediction's reliability.
After all, what is a prediction worth, if we cannot rely on it?
Descriptive Statistics
We will first cover some basic descriptive statistics.
Descriptive statistics summarizes important features of a data set such as:
- Count
- Sum
- Standard Deviation
- Percentile
- Average
- Etc..
It is a good starting point to become familiar with the data.
We can use the describe() function in Python to summarize the data:
Example
print (full_health_data.describe())Output
Do you see anything interesting here?