Home
Projects
Blog
Tags
Posts Tagged "ML"
A Mini BERT Cross-Section: How [PAD] Tokens Work in Attention Masking
February 20, 2026
A deep dive into how [PAD] tokens behave inside BERT, following the embedding to matmul and attention masking pipeline.
Read Post
Page 1 of 1