Class ProcUTF8

java.lang.Object
org.apache.jena.atlas.io.ProcUTF8

public final class ProcUTF8 extends Object
Convert UTF-8 encoded data. This class implements the "Modified UTF8" encoding rules (null -> C0 80)
  • Constructor Details

    • ProcUTF8

      public ProcUTF8()
  • Method Details

    • convert

      public static void convert(int ch, IntConsumer action)
      Convert to UTF-8, calling an action on each byte. Unicode ends at 0x10FFFF (1,114,112 code points)
       Bits
       7    U+007F      1 to 127              0xxxxxxx
       11   U+07FF      128 to 2,047          110xxxxx 10xxxxxx
       16   U+FFFF      2,048 to 65,535       1110xxxx 10xxxxxx 10xxxxxx
       21   U+1FFFFF    65,536                11110xxx 10xxxxxx 10xxxxxx 10xxxxxx
       26   U+3FFFFFF                         111110xx 10xxxxxx 10xxxxxx 10xxxxxx 10xxxxxx
       31   U+7FFFFFFF                        1111110x 10xxxxxx 10xxxxxx 10xxxxxx 10xxxxxx 10xxxxxx