talking about the model spitting out tokens, not the app