I was thinking about LLM tokenization (as one does) and had a thought: We select the next output token for an LLM based on its likelihood, but (some) shorter tokens are more likely.
Why? Longer tokens can only complete one word, but some shorter tokens can complete many words …
exfatloss recently wrote about the difference between being satiated and being full, and not experiencing satiety until their 30's. Thinking about this made me realize that there's at least four axes of hunger (pangs, appetite, fullness and emotional state), and some interesting edge cases. These hunger feelings are correlated …
I've gone snowboarding about 30 times since I started learning a few years ago, but every time I'm on a lift, most of the other riders have been out 90 days just this season. In fact, almost everyone I see has been skiing or snowboarding for decades, and comes …
Note: I don't know if this is useful for any mouse except for mine (Anker Vertical Mouse). I'm posting this partially because it might be useful to someone else and partially because I'm trying to ~~spam the site~~ post something every day after being pre-inspired byInkhaven.
Claude has trouble playing Pokemon partially because it can't see the screen very well. This made me wonder if Claude would be better at an ASCII game like Dwarf Fortress, where it doesn't need to rely on image recognition.