Byte-Pair Encoding (BPE)

February 15, 2024
Supervised Learning

In this post we'll show how to train a custom Byte-Pair Encoding tokenizer on a dataset of domain names using the Hugging Face library