The Truth About the Family Emoji

One of the most misunderstood emojis in Unicode is the family: man, woman, girl, and boy standing together. Many websites and tools claim it uses eight code points and twenty UTF-16 units. The truth is simpler and more precise: the plain family emoji without any skin tones is exactly seven code points and eleven UTF-16 units. Understanding why reveals how modern emoji composition really works.

The family emoji is built using a special invisible character called a Zero Width Joiner, abbreviated ZWJ. This character tells the system to combine separate emojis into one meaningful unit. For the default family, the sequence is: man emoji, ZWJ, woman emoji, ZWJ, girl emoji, ZWJ, boy emoji. That’s four people plus three invisible joiners — exactly seven code points total.

Why the Confusion Exists

The myth of eight code points and twenty units comes from testing the skinned version of the family, where each person has a skin tone modifier. When skin tones are applied, each modifier adds one more code point per person, and the total jumps significantly. But the default yellow-tone family — the one most keyboards insert — has no skin modifiers at all.

How UTF-16 Affects the Count

Every emoji character except the ZWJ requires two UTF-16 code units because they live outside the Basic Multilingual Plane. The three ZWJ characters use only one unit each. This gives us four times two plus three times one, which equals eleven UTF-16 units total. This is the correct behavior, not a mistake.

Why This Matters

Understanding the real composition helps developers make better decisions about text storage, validation, and display. It also explains why accurate tools sometimes show different numbers than outdated websites — they’re following the actual Unicode standard, not repeating old myths.

The Bottom Line

The plain family emoji is one grapheme cluster, seven code points, eleven UTF-16 units, and twenty-five UTF-8 bytes. When skin tones are added, those numbers increase — but the default version remains elegantly simple. Tools that report different values for the plain emoji are simply using the correct, current Unicode rules.

Accuracy in text measurement starts with understanding how things are really built.