Try   HackMD
  •  
    Ping Wang
    Last edited by Ping Wang on Apr 18, 2024
    Linked with GitHub
    Contributed by

C++ Data Structure and Algorithms

  • Introduction
  • Data Structure
  • Tree and Graph
  • Divide and Conquer
  • Dynamic Programming
  • Problem Solving
Last changed by 

 
Ping Wang
0
50

Read more

UW HYAK Notes

Request for GPU

Mar 14, 2025
Summary: Decoder-Only Transformers: The Workhorse

Introduction This article delves into the architecture and mechanics of decoder-only transformers, which are a crucial component of many large language models (LLMs). It highlights the structure, attention mechanisms, and embedding techniques that make these models effective for various natural language processing (NLP) tasks. Decoder-Only Transformer Architecture Overview Decoder-only transformers, unlike the traditional encoder-decoder structure, use only the decoder component to process and generate text. This architecture is particularly suited for tasks that involve sequential generation, such as text completion and language modeling. Structure The decoder-only transformer consists of multiple layers, each containing self-attention mechanisms and feed-forward neural networks.

Jun 26, 2024
Summary: The Illustrated Transformer

Introduction This article provides a visual and intuitive explanation of the transformer architecture, which has revolutionized natural language processing (NLP) by enabling efficient handling of sequential data through self-attention mechanisms. It covers the structure, mechanics, and key components such as the encoder, decoder, attention mechanisms, and embeddings. Transformer Architecture Overview The transformer architecture, introduced by Vaswani et al. in 2017, eliminates the need for recurrent layers by using self-attention mechanisms, allowing for parallel processing and better handling of long-range dependencies. Encoder-Decoder Structure The transformer model consists of an encoder-decoder architecture, each composed of multiple layers.

Jun 26, 2024
Summary: Transformers Explained

This article offers a detailed explanation of transformers, a revolutionary architecture in natural language processing (NLP) that has significantly advanced the capabilities of large language models (LLMs). It covers the structure, mechanics, and key components of transformers, including the encoder, decoder, attention mechanisms, and embeddings.

Jun 26, 2024
Read more from Ping Wang

Published on HackMD
    • C++ Data Structure and Algorithms
    Expand allBack to topGo to bottom
  • C++ Data Structure and Algorithms
Expand allBack to topGo to bottom

Sign in

Forgot password

or

By clicking below, you agree to our terms of service.

Sign in via Facebook Sign in via Twitter Sign in via GitHub Sign in via Dropbox Sign in with Wallet
Wallet ( )
Connect another wallet

New to HackMD? Sign up