17 notes · co-occurs with 12 tags · last updated May 18, 2026

Notes tagged #algorithm
01
Decision Tree
A Decision Tree is a supervised learning algorithm used for both classification and regression tasks.
May 18, 2026
General ML
02
Gradient boosting
Gradient Boosting is an ensemble machine learning technique (boosting) that combines weak learners (typically shallow Decision Tree) sequentially to create a strong predictive model.
May 18, 2026
General ML
03
K-Nearest Neighbors
K-Nearest Neighbors (KNN) is a simple, non-parametric algorithm used for both classification and regression tasks.
May 18, 2026
General ML
04
K-means clustering
K-means is an unsupervised machine learning algorithm used for partitioning a dataset into K distinct, non-overlapping subgroups (clusters).
May 18, 2026
General ML
05
Linear Regression
Linear regression is a supervised algorithm or statistical method that learns to model a dependent variable (target) as a function of some independent variables (features) by finding a line (or surface) that best “fits” the data.
May 18, 2026
General ML
06
Logistic regression
Logistic regression is a statistical method used for binary classification problems, modeling the probability of an instance belonging to a particular class.
May 18, 2026
General ML
07
Multi-armed bandits
A multi-armed bandit is a sequential decision problem where a learner repeatedly chooses among k actions (arms), observes a stochastic reward for the chosen arm only, and adapts future choices to balance exploration (sampling under-tested arms to learn their value) against exploitation (sampling the...
May 18, 2026
General ML
08
Random Forest
Random Forest is an ensemble learning method (bagging) that constructs a multitude of decision trees at training time and outputs the class that is the mode of the classes (classification) or mean prediction (regression) of the individual trees.
May 18, 2026
General ML
09
SVM
Support Vector Machines (SVM) is a supervised learning algorithm used for classification, regression, and outlier detection.
May 18, 2026
General ML
10
BERT
Most of the information is available in the BERT paper. Key details: Multi-head attention. Transformer encoder.
May 18, 2026
NLP
11
BERTopic
BERTopic is a modular topic modeling pipeline.
May 18, 2026
NLP
12
GloVe
GloVe (Global Vectors for Word Representation) is a word embedding technique developed in 2014.
May 18, 2026
NLP
13
LDA
Latent Dirichlet Allocation (Blei, Ng, Jordan, 2003) is the canonical probabilistic topic model.
May 18, 2026
NLP
14
RNN
Recurrent Neural Networks are a type of neural networks designed for processing sequential data.
May 18, 2026
NLP
15
Term Frequency-Inverse Document Frequency
Term Frequency-Inverse Document Frequency (TF-IDF) is a numerical statistic used in NLP to show how important a word (term) is to a document in a corpus.
May 18, 2026
NLP
16
Word2Vec
Word2Vec is a approach for learning word embeddings.
May 18, 2026
NLP
17
fastText
fastText is a library for efficient text classification and word representation learning by Facebook.
May 18, 2026
NLP