↩ Back to danieldemmel.me

TokenTokenTokenOnline LLM TokenizerTokenTokenToken

A pure Javascript tokenizer running in your browser that can load tokenizer.json and tokenizer_config.json from any repository on Huggingface. You can use it to count tokens and compare how different large language model vocabularies work. It's also useful for debugging prompt templates. If you are wondering why are there so many models under Xenova, it's because they work for HuggingFace and re-upload just the tokenizers, so it's possible to load them without agreeing to model licences.