Advertisement
ivandrofly

c#: String normalization explanation

Dec 11th, 2017
324
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
C# 0.74 KB | None | 0 0
  1. U+1E09 = LATIN SMALL LETTER C WITH CEDILLA AND ACUTE
  2. U+0033 = DIGIT THREE
  3. U+2044 = FRACTION SLASH
  4. U+0034 = DIGIT FOUR
  5.  
  6. note:
  7. FormC: Compose the decomposed e.g: \u0033\u2044\u0034 => U+00BE or ¾
  8.  
  9. FormD: keeps the string without performning any normalizaation;
  10.  
  11. FormKC: U+1E09 is already nomalized so, when you use FormD it will decompose it and normalize any decomposed that follows it
  12.     e.g: U+1E09 will be decomposed to \u0063\u0327\u0301
  13.     and  \u0033\u2044\u0034 will be composed to U+00BE
  14.  
  15. FormKD: Decompose eveything U+1E09 => \u0063\u0327\u0301 and keeps \u0033\u2044\u0034 as it's (already decomposed form)
  16.  
  17.  
  18. NOTE: the character will still be displayed the same in most case... it may change though
  19. when ¾ is decompsed it changes to 3/4
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement