For your first question: https://platform.openai.com/tokenizer

calibas · 2024-06-06T20:20:28 1717705228

I saw that, but the language makes me think it's not quite the same as what's really being used?

"how a piece of text might be tokenized by a language model"

"It's important to note that the exact tokenization process varies between models."

yorwba · 2024-06-06T20:25:49 1717705549

That's why they have buttons to choose which model's tokenizer to use.

calibas · 2024-06-06T21:16:23 1717708583

Yes, thank you, I understand that part.

It's the might condition in the description that makes me think the results might not be the exact same as what's used in the live models.

baobabKoodaa · 2024-06-07T01:07:58 1717722478

The results are the same.