I want to kown how many unicode make one hindi character

Question

1 Answer

supriya · Answer 1 · 2020-08-14T11:22:08+0000

I am not quite sure if this is correct, being Finnish and not well versed in Hindi, but this would merge characters with any subsequent Unicode Mark characters:

import unicodedata
def merge_compose(s: str):
current = []
for c in s:
if current and not unicodedata.category(c).startswith("M"):
yield current
current = []
current.append(c)
if current:
yield current
for group in merge_compose("कृपयाल्ली"):
print(group, len(group), "->", "".join(group))

Output is:

['क', 'ृ'] 2 -> कृ
['प'] 1 -> प
['य', 'ा'] 2 -> या
['ल', '्'] 2 -> ल्
['ल', 'ी'] 2 -> ली

If you are a beginner and want to know more about Python the do check out the python for data science

I want to kown how many unicode make one hindi character

1 Answer

Related questions

Browse Categories