Notation and Definitions
Notation
Scalars
A scalars is a simple numerical value like 19, or -3,45. Variable or constant that takes scalar values are denoted by an italic letter like
Vectors
A vector is an ordered list of scalar values, called attributes. We denote a vector as a bold character, for example,
Set
A set is an unordered collection of unique elements. We denote a set as a calligraphic capital character, for example,
When an element x belongs to a set
We can obtain a new set
Capital Sigma Notation
The summation over a collection
The notation
Capital Pi Notation
notation analogous to capital sigma is the capital pi notation. It denotes a product of elements in a collection or attributes of a vector:
where
Operation of Sets
A derived set creation operator looks like this:
Operations of Vectors
Sum of two vectors
Difference of two vectors
Scalar Multiplication
Functions
A function is a relation that associates each element x of a set
Max and Arg Max
Given a set of values
returns the highest value f(a) for all elements in the set A. On the other hand, the operator,
returns the element of the set A that maximizes f(a).
Sometimes, when the set is implicit or infinite, we can write min and arg min operate in a similar manner.
Assignment Operator
The expression
Derivative and Gradient
A derivative
Random Variable
A
- Discreate Random Variable
- Continuous Random Variable
Unbiased Estimators
Bayesâ Rule
The conditional probability
Parameter Estimation
Classification vs. Regression
Classification is a problem of automatically assigning a label to an unlabeled example. Spam detection is a famous example of classification.
In machine learning, the classification problem is solved by a classification learning algorithm that takes a collection of labeled examples as inputs and produces a model that can take an unlabeled example as input and either directly output a label. In a classification problem, a label is a member of finite set of classes.
- Binary classification : When the size of the set of classes is two.
- Multiclass classification : When the size of the set of classes is three or more.
Regression is a problem of predicting a continuous real-valued label (target) given an unlabeled example. Estimating house price valuation based on house features is a famous example of regression. The regression problem is solved by a regression learning algorithm that takes a collection of labeled examples as inputs and produces a model that can take an unlabeled example as input and output a target
Model-Based vs. Instance-Based Learning
Most supervised learning algorithms are model-based. Model-Based learning algorithms use the training data to create a model that has parameters learned from the training data. In SVM the two parameters were w* and b*. After model is built, the training data can be discarded.
Instance-based learning algorithms use the whole dataset as the model. One instance-based algorithm frequently used in practice is K-Nearest Neighbors (KNN). In classification, to predict a label for an input example the KNN algorithmlooks at the close neighborhood of the input example in the space of feature vectors and outputs the label that it saw the most often in this close neighborhood
Shallow vs. Deep Learning
A shallow learning algorithm learns the parameters of the model directly from the features of the training examples. Most supervised learning algorithms are shallow. The notorious exceptions are neural network learning algorithms, specifically those that build neural networks with more than one layer between input and output. Such neural networks are called deep neural networks. In deep neural network learning (or, simply, deep learning), contrary to shallow learning, most model parameters are learned not directly from the features of the training examples, but from the outputs of the preceding layers.