Six bytes plus the running dictionary, right?
I need to study how Huffman encoding works. I read a little about it in the 90s, but don't quite grasp it, other than the general idea that it's a kind of binary tree structure to represent more frequent symbols with shorter codes. A little like the early versions of #MorseCode, before O got seriously messed up. ;)