bugl
bugl
HomeLearnPatternsPathsSearch
HomeLearnPatternsPathsSearch

Loading lesson path

Learn/Python

Python

Data Science and Scientific Python

NumPy, Pandas, SciPy, plotting, and machine learning topics grouped into one practical track.

Lesson 1

NumPy Tutorial

NumPy is a Python library.

Read lesson →Loading…
Lesson 2

Matplotlib Tutorial

Matplotlib is a low level graph plotting library in python that serves as a visualization utility.

Read lesson →Loading…
Lesson 3

Machine Learning

Machine Learning is making the computer learn from studying data and statistics.

Read lesson →Loading…
Lesson 4

Pandas Tutorial

Pandas is a Python library.

Read lesson →Loading…
Lesson 5

Matplotlib Getting Started

If you have Python and PIP already installed on a system, then installation of Matplotlib is very easy.

Read lesson →Loading…
Lesson 6

Machine Learning - Mean Median Mode

What can we learn from looking at a group of numbers?

Read lesson →Loading…
Lesson 7

Matplotlib Pyplot

Most of the Matplotlib utilities lies under the pyplot submodule, and are usually imported under the plt alias:

Read lesson →Loading…
Lesson 8

Machine Learning - Standard Deviation

Standard deviation is a number that describes how spread out the values are.

Read lesson →Loading…
Lesson 9

Matplotlib Plotting

The plot() function is used to draw points (markers) in a diagram.

Read lesson →Loading…
Lesson 10

Machine Learning - Percentiles

Percentiles are used in statistics to give you a number that describes the value that a given percent of the values are lower than.

Read lesson →Loading…
Lesson 11

Matplotlib Markers

You can use the keyword argument marker to emphasize each point with a specified marker:

Read lesson →Loading…
Lesson 12

Machine Learning - Data Distribution

In the real world, the data sets are much bigger, but it can be difficult to gather real world data, at least at an early stage of a project.

Read lesson →Loading…
Lesson 13

Matplotlib Line

You can use the keyword argument linestyle , or shorter ls , to change the style of the plotted line:

Read lesson →Loading…
Lesson 14

Machine Learning - Normal Data Distribution

In probability theory this kind of data distribution is known as the normal data distribution , or the Gaussian data distribution , after the mathematician Carl Friedrich Gauss who came up with the f…

Read lesson →Loading…
Lesson 15

Matplotlib Labels and Title

With Pyplot, you can use the xlabel() and ylabel() functions to set a label for the x- and y-axis.

Read lesson →Loading…
Lesson 16

Machine Learning - Scatter Plot

A scatter plot is a diagram where each value in the data set is represented by a dot.

Read lesson →Loading…
Lesson 17

Matplotlib Adding Grid Lines

With Pyplot, you can use the grid() function to add grid lines to the plot.

Read lesson →Loading…
Lesson 18

Machine Learning - Linear Regression

The term regression is used when you try to find the relationship between variables.

Read lesson →Loading…
Lesson 19

Matplotlib Subplot

With the subplot() function you can draw multiple plots in one figure:

Read lesson →Loading…
Lesson 20

Machine Learning - Polynomial Regression

If your data points clearly will not fit a linear regression (a straight line through all data points), it might be ideal for polynomial regression.

Read lesson →Loading…
Lesson 21

Matplotlib Scatter

With Pyplot, you can use the scatter() function to draw a scatter plot.

Read lesson →Loading…
Lesson 22

Machine Learning - Multiple Regression

Multiple regression is like linear regression , but with more than one independent value, meaning that we try to predict a value based on two or more variables.

Read lesson →Loading…
Lesson 23

Matplotlib Bars

With Pyplot, you can use the bar() function to draw bar graphs:

Read lesson →Loading…
Lesson 24

Machine Learning - Scale

When your data has different values, and even different measurement units, it can be difficult to compare them. What is kilograms compared to meters? Or altitude compared to time?

Read lesson →Loading…
Lesson 25

Matplotlib Histograms

A histogram is a graph showing frequency distributions.

Read lesson →Loading…
Lesson 26

Machine Learning - Train/Test

In Machine Learning we create models to predict the outcome of certain events, like in the previous chapter where we predicted the CO2 emission of a car when we knew the weight and engine size.

Read lesson →Loading…
Lesson 27

Matplotlib Pie Charts

With Pyplot, you can use the pie() function to draw pie charts:

Read lesson →Loading…
Lesson 28

Machine Learning - Decision Tree

In the example, a person will try to decide if he/she should go to a comedy show or not.

Read lesson →Loading…
Lesson 29

Machine Learning - Confusion Matrix

It is a table that is used in classification problems to assess where errors in the model were made.

Read lesson →Loading…
Lesson 30

Machine Learning - Hierarchical Clustering

Hierarchical clustering is an unsupervised learning method for clustering data points. The algorithm builds clusters by measuring the dissimilarities between data. Unsupervised learning means that a…

Read lesson →Loading…
Lesson 31

Machine Learning - Logistic Regression

Logistic regression aims to solve classification problems. It does this by predicting categorical outcomes, unlike linear regression that predicts a continuous outcome.

Read lesson →Loading…
Lesson 32

Machine Learning - Grid Search

The majority of machine learning models contain parameters that can be adjusted to vary how the model learns. For example, the logistic regression model, from sklearn , has a parameter C that control…

Read lesson →Loading…
Lesson 33

Preprocessing - Categorical Data

When your data has categories represented by strings, it will be difficult to use them to train machine learning models which often only accepts numeric data.

Read lesson →Loading…
Lesson 34

Machine Learning - K-means

K-means is an unsupervised learning method for clustering data points. The algorithm iteratively divides data points into K clusters by minimizing the variance in each cluster.

Read lesson →Loading…
Lesson 35

Machine Learning - Bootstrap Aggregation (Bagging)

Methods such as Decision Trees, can be prone to overfitting on the training set which can lead to wrong predictions on new data.

Read lesson →Loading…
Lesson 36

Machine Learning - Cross Validation

When adjusting models we are aiming to increase overall model performance on unseen data. Hyperparameter tuning can lead to much better performance on test sets. However, optimizing parameters to the…

Read lesson →Loading…
Lesson 37

Machine Learning - AUC - ROC Curve

In classification, there are many different evaluation metrics. The most popular is accuracy , which measures how often the model is correct. This is a great metric because it is easy to understand a…

Read lesson →Loading…
Lesson 38

Machine Learning - K-nearest neighbors (KNN)

KNN is a simple, supervised machine learning (ML) algorithm that can be used for classification or regression tasks - and is also frequently used in missing value imputation. It is based on the idea…

Read lesson →Loading…