26 notes · co-occurs with 14 tags · last updated May 18, 2026

Notes tagged #concept
01
LoRA
LoRA (Low-Rank Adaptation) is a parameter-efficient fine-tuning technique that allows to adapt large pre-trained models to specific tasks while minimizing computational resources.
May 18, 2026
Deep Learning
02
Negative sampling
Negative sampling trains a model by contrasting each observed positive with a small set of sampled alternatives instead of every item in the catalog.
May 18, 2026
Deep Learning
03
logQ correction
LogQ correction is a bias correction technique used in recommendation systems to account for non-uniform sampling during training.
May 18, 2026
Deep Learning
04
AB Tests
A/B testing (online controlled experimentation) randomly assigns units to a treatment or a control variant and compares aggregate outcomes, attributing the observed difference to the change.
May 18, 2026
General ML
05
Bias-Variance Trade-off
The bias-variance trade-off is a fundamental concept in machine learning that describes the balance between a model’s ability to fit the training data (low bias) and its ability to generalize to new, unseen data (low variance).
May 18, 2026
General ML
06
CUPED
CUPED makes A/B tests detect smaller effects without needing more users.
May 18, 2026
General ML
07
Cold start
Cold start is the problem of producing useful recommendations when a user, item, or market has little or no reliable interaction history.
May 18, 2026
General ML
08
Dimensionality Reduction
Dimensionality reduction is the process of reducing the number of features (dimensions) in a dataset while preserving as much relevant information as possible.
May 18, 2026
General ML
09
Distance calculation
Cosine similarity measures the cosine of the angle between two vectors, effectively capturing their orientation similarity while ignoring their magnitude.
May 18, 2026
General ML
10
Multi-armed bandits
A multi-armed bandit is a sequential decision problem where a learner repeatedly chooses among k actions (arms), observes a stochastic reward for the chosen arm only, and adapts future choices to balance exploration (sampling under-tested arms to learn their value) against exploitation (sampling the...
May 18, 2026
General ML
11
Regularization
Regularization is a technique used in machine learning to prevent overfitting by adding a penalty term to the loss function.
May 18, 2026
General ML
12
Time-series validation
This is one of the types of Validation, which deserves a special explanation due to the sheer variability and complexity.
May 18, 2026
General ML
13
Training-serving skew
Training-serving skew is a mismatch between the data representation used to train and evaluate a model and the representation available at serving time.
May 18, 2026
General ML
14
Validation
Model validation is the process of assessing how well a trained machine learning model performs on unseen data.
May 18, 2026
General ML
15
Behavioral interviews
Behavioral interview is a stage of the job interview process in which candidates are asked to describe specific situations from their past experiences to demonstrate their skills, abilities, and character traits.
May 18, 2026
Interview_preparation
16
Leetcode code templates
Two pointers: one input, opposite ends def fn(arr): left = ans = 0 right = len(arr) - 1 while left < right: # do some logic here with left and right if CONDITION: left += 1 else: right -= 1 return ans Two pointers: two inputs, exhaust both def fn(arr1, arr2): i = j = ans = 0 while i < len(arr1...
May 18, 2026
Interview_preparation
17
ML System design
ML System Design interview is a stage of the job interview process focused on assessing a candidate’s ability to design and implement machine learning systems at scale.
May 18, 2026
Interview_preparation
18
Questions to ask the interviewers
During the interview process, it is important not just to answer questions but also to ask your own questions.
May 18, 2026
Interview_preparation
19
Calibration
A classifier is calibrated if its predicted probabilities match observed frequencies: among examples assigned a 0.7 score, roughly 70% should be positive.
May 18, 2026
Metrics and losses
20
Confusion matrix
A Confusion Matrix is a table used to evaluate the performance of a classification model on a set of data for which the true values are known.
May 18, 2026
Metrics and losses
21
General losses
Loss functions (also called objective functions or cost functions) are mathematical measures of the error between predicted and actual values.
May 18, 2026
Metrics and losses
22
RAG
Retrieval-Augmented Generation (RAG) is a hybrid approach that combines LLM’s generative abilities with real-time information retrieval from external knowledge sources​.
May 18, 2026
NLP
23
Topic Modeling Methods
A survey of the main topic modeling methods, ordered roughly by historical development (matrix factorization → probabilistic generative models → neural → embedding-based).
May 18, 2026
NLP
24
Topic Modeling
Topic modeling is an unsupervised technique for discovering abstract themes in a document collection, where a document is whatever unit of text the project treats as one (article, review, tweet, paragraph, support ticket).
May 18, 2026
NLP
25
Word Embeddings
Word embedding is a representation of a word, usually with a vector of values.
May 18, 2026
NLP
26
Recommendation system
This note covers how recommendation systems are designed, built, evaluated, deployed, and debugged in production.
May 18, 2026
Use_cases