返回时间轴
2017-12

"Attention Is All You Need" — the Transformer architecture is born

Capability Breakthrough

事件摘要

Eight Google researchers published 'Attention Is All You Need' at NeurIPS 2017, introducing the Transformer architecture. By dispensing with recurrence and convolution entirely and relying solely on self-attention mechanisms, the Transformer achieved state-of-the-art translation results while training significantly faster. The paper has been cited over 250,000 times, and the Transformer became the foundation of every major AI system since—from GPT and BERT to AlphaFold and modern speech models.

影响评估

  • Capability Leap +3 · Long-term

    Replaced RNNs and CNNs as the dominant sequence-processing architecture across NLP, vision, speech, and biology. The self-attention mechanism proved so general and scalable that it became the universal compute substrate for modern AI—every major AI system since 2018 (BERT, GPT series, Claude, Gemini, AlphaFold) uses Transformers at its core.

    Affected Groups: all AI researchers, NLP researchers, computer vision researchers, computational biologists

  • Economic Disruption +3 · Long-term

    Enabled the scaling laws that led to GPT-3, ChatGPT, and all subsequent large language models. The authors collectively founded or joined companies now worth tens of billions (Cohere, Character.AI, Essential AI). The architecture's parallelizability made GPU/TPU training efficient at unprecedented scale, directly shaping the modern AI hardware market.

    Affected Groups: tech industry, investors, hardware manufacturers, startups

  • Paradigm Shift +3 · Long-term

    'Attention Is All You Need' became a scientific meme. Its audacious title captured a truth that proved deeper than the authors knew: attention was sufficient not just for translation, but for vision, protein folding, reasoning, and generation. The paper marks the clearest 'before and after' line in modern AI research methodology.

    Affected Groups: entire AI field, researchers, engineers

共识度与来源

重要度 L3
分类 Capability Breakthrough
共识度 Broad Consensus
影响指数 10/10