tf-tpu
/

unigram-tokenizer-wikitext

Model card Files Files and versions

xet

Community

unigram-tokenizer-wikitext / README.md

sayakpaul HF Staff

Create README.md (#1)

2ea9793 over 3 years ago

preview code

raw

history blame

245 Bytes

This is a Unigram tokenizer trained on the Wikitext dataset. Refer to the train_unigram.py script within this repository to know how it was trained.