I used the pyhton:
for m in regex.findall(r"\X", 'ल्लील्ली', regex.UNICODE):
for i in m:
print(i, i.encode('unicode-escape'))
print('--------')
the results show ल्ली has 2 Hindi characters:
ल b'\\u0932'
् b'\\u094d'
--------
ल b'\\u0932'
ी b'\\u0940'
--------
it is wrong, actually, ल्ली is one Hindi character. How to get the Hindi character(such as ल्ली) by how many Unicode composed.
In short, I want to split 'कृपयाल्ली' to 'कृ','प','या','ल्ली'