DSWoK — Data Science Well of Knowledge

#attention

2 notes · co-occurs with 3 tags · last updated May 18, 2026

Co-tags#nlp2#architecture2#transformer2
Notes tagged #attention
01
Attention
The original paper Attention is a mechanism that lets neural networks focus on specific parts of an input sequence.
May 18, 2026
Deep Learning
02
Transformer
The first Transformer was introduced in the Attention Is All You Need paper, soon after that BERT was published.
May 18, 2026
NLP

Created with Quartz v4.5.2 © 2026

  • GitHub
  • Discord Community