bugl
bugl
HomeLearnPatternsPathsSearch
HomeLearnPatternsPathsSearch

Loading lesson path

Learn/Python/Data Science and Scientific Python
Python•Data Science and Scientific Python

Machine Learning - Normal Data Distribution

Flash cards

Review the key moves

1/4
Core idea

What is the main idea behind Machine Learning - Normal Data Distribution?

Lesson checks

Practice each idea before moving on

Short Mimo-style checks built from this lesson's code, terms, and sequence.

1Quick choice

Which statement best captures the main point of this lesson?

2Fill blank

Complete the missing token from the example code.

___ numpy
3Order

Put the learning moves in the order that makes the concept easiest to apply.

In probability theory this kind of data distribution is known as the normal data distribution , or the Gaussian data distribution , after the mathematician Carl Friedrich Gauss who came up with the formula of this data distribution.
Histogram Explained
Normal Data Distribution

Normal Data Distribution

In the previous chapter we learned how to create a completely random array, of a given size, and between two given values.

In this chapter we will learn how to create an array where the values are concentrated around a given value.

In probability theory this kind of data distribution is known as the normal data distribution , or the Gaussian data distribution , after the mathematician Carl Friedrich Gauss who came up with the formula of this data distribution.

Example

import numpy
import matplotlib.pyplot as plt
x =
numpy.random.normal(5.0, 1.0, 100000)
plt.hist(x, 100)
plt.show()

Note

A normal distribution graph is also known as the bell curve because of it's characteristic shape of a bell.

Histogram Explained

We use the array from the numpy.random.normal() method, with 100000 values, to draw a histogram with 100 bars.

We specify that the mean value is 5.0, and the standard deviation is 1.0.

Meaning that the values should be concentrated around 5.0, and rarely further away than 1.0 from the mean.

And as you can see from the histogram, most values are between 4.0 and 6.0, with a top at approximately 5.0.

Previous

Matplotlib Line

Next

Matplotlib Labels and Title