I actually wonder if he could just claim a win by calculating validation set BPB...

		anonymoushn 78 days ago \| parent \| context \| favorite \| on: Tokenization for language modeling: BPE vs. Unigra... I actually wonder if he could just claim a win by calculating validation set BPB for both equally-sized vocabs instead of targeting the same level of perplexity as in the speedrun finish line lol