Explore Courses Blog Tutorials Interview Questions
0 votes
in Azure by (45.3k points)

I'm new to learning Unicode, and not sure how much I have to learn based on my ASCII background, but I'm reading the C# spec on rules for identifiers to determine what chars are permitted within Azure Table (which is directly based on the C# spec).

Where can I find a list of Unicode characters that fall into these categories:

  • letter-character: A Unicode character of classes Lu, Ll, Lt, Lm, Lo, or Nl
  • combining-character: A Unicode character of classes Mn or Mc
  • decimal-digit-character: A Unicode character of the class Nd
  • connecting-character: A Unicode character of the class Pc
  • formatting-character: A Unicode character of the class Cf

1 Answer

0 votes
by (16.8k points)
edited by

You can retrieve this information in an automated fashion from the official Unicode data file, UnicodeData.txt, which is published here:

This is a file with semicolon-separated values in each line. The third column tells you the character class of each character.

Want to learn Azure from scratch! Have a look at this video on Azure provided by Intellipaat:

The benefit of this is that you can get the character name for each character, so you have a better idea of what it is than by just looking at the character itself.

Browse Categories