Arc Forum | It's reading the invalid sequence as � U+FFFD REPLACEMENT CHARACTER, which trans...

Arc Forum

5 points by rocketnia 3162 days ago | link | parent

It's reading the invalid sequence as � U+FFFD REPLACEMENT CHARACTER, which translates back to UTF-8 as EF BF BD (as we can see in the actual results above). The replacement character is what Unicode offers for use as a placeholder for corrupt sequences in encoded Unicode text, just like the way it's being used here.