This is a naive implementation of Byte Pair Encoding (BPE) for tokenization of text.
Clone the repository and run example.py to train and tokenize a sample text.
git clone https://github.com/shahad-mahmud/bpe.git
python example.py| Name | Name | Last commit date | ||
|---|---|---|---|---|