DSWoK — Data Science Well of Knowledge
Search
Search
Dark mode
Light mode
Explorer
#attention
2 notes
· co-occurs with
3 tags
· last updated
May 18, 2026
Co-tags
#
nlp
2
#
architecture
2
#
transformer
2
Notes tagged
#attention
01
Attention
The original paper Attention is a mechanism that lets neural networks focus on specific parts of an input sequence.
May 18, 2026
Deep Learning
02
Transformer
The first Transformer was introduced in the Attention Is All You Need paper, soon after that BERT was published.
May 18, 2026
NLP