18 notes · co-occurs with 16 tags · last updated May 18, 2026

Notes tagged #nlp
01
Attention
The original paper Attention is a mechanism that lets neural networks focus on specific parts of an input sequence.
May 18, 2026
Deep Learning
02
Contrastive Language-Image Pre-training
CLIP (Contrastive Language-Image Pre-training) is a neural net model developed by OpenAI that efficiently learns visual concepts from natural language supervision.
May 18, 2026
Deep Learning
03
LoRA
LoRA (Low-Rank Adaptation) is a parameter-efficient fine-tuning technique that allows to adapt large pre-trained models to specific tasks while minimizing computational resources.
May 18, 2026
Deep Learning
04
NLP losses
Loss functions commonly used in NLP tasks. For general losses (Cross-Entropy, MSE, KL Divergence), see General losses.
May 18, 2026
Metrics and losses
05
NLP metrics
NLP metrics evaluate tasks from machine translation and summarization to question answering and sequence labeling.
May 18, 2026
Metrics and losses
06
BERT
Most of the information is available in the BERT paper. Key details: Multi-head attention. Transformer encoder.
May 18, 2026
NLP
07
BERTopic
BERTopic is a modular topic modeling pipeline.
May 18, 2026
NLP
08
GloVe
GloVe (Global Vectors for Word Representation) is a word embedding technique developed in 2014.
May 18, 2026
NLP
09
LDA
Latent Dirichlet Allocation (Blei, Ng, Jordan, 2003) is the canonical probabilistic topic model.
May 18, 2026
NLP
10
RAG
Retrieval-Augmented Generation (RAG) is a hybrid approach that combines LLM’s generative abilities with real-time information retrieval from external knowledge sources​.
May 18, 2026
NLP
11
RNN
Recurrent Neural Networks are a type of neural networks designed for processing sequential data.
May 18, 2026
NLP
12
Term Frequency-Inverse Document Frequency
Term Frequency-Inverse Document Frequency (TF-IDF) is a numerical statistic used in NLP to show how important a word (term) is to a document in a corpus.
May 18, 2026
NLP
13
Topic Modeling Methods
A survey of the main topic modeling methods, ordered roughly by historical development (matrix factorization → probabilistic generative models → neural → embedding-based).
May 18, 2026
NLP
14
Topic Modeling
Topic modeling is an unsupervised technique for discovering abstract themes in a document collection, where a document is whatever unit of text the project treats as one (article, review, tweet, paragraph, support ticket).
May 18, 2026
NLP
15
Transformer
The first Transformer was introduced in the Attention Is All You Need paper, soon after that BERT was published.
May 18, 2026
NLP
16
Word Embeddings
Word embedding is a representation of a word, usually with a vector of values.
May 18, 2026
NLP
17
Word2Vec
Word2Vec is a approach for learning word embeddings.
May 18, 2026
NLP
18
fastText
fastText is a library for efficient text classification and word representation learning by Facebook.
May 18, 2026
NLP