Skip to content

Utf8

Handling unicode characters

from tamilstring.utf8 import *
utf8 modules provide some common methods

extract letters from str

from tamilstring.utf8 import get_letters
letter = get_letters("பூ")
print(letter)
string = get_letters("பன்மொழி")
print(string)

Output:

['பூ']
['ப', 'ன்', 'மொ', 'ழி']
get_letters helps to extract letters from string

splitting letters

from tamilstring.utf8 import split_letter
letter1 = split_letter("பூ")
print(letter1)
letter2 = split_letter("க்ஷீ")
print(letter2)
letter3 = split_letter("வா")
print(letter3)
Output:
('ப்', 'ஊ')
('க்ஷ்', 'ஈ')
('வ்', 'ஆ')
split_letter method split compound letter and return as possible constant and voule

make compound

from tamilstring.utf8 import make_letter
letter1 = make_letter('ப்','ஊ')
print(letter1)
letter2 = make_letter('க்ஷ்', 'ஈ')
print(letter2)
letter3 = make_letter('வ்', 'ஆ')
print(letter3)
letter4 = make_letter('ஆ', 'வ்')
print(letter4)
Output:
'பூ'
'க்ஷீ'
'வா'
'வா'
make_letter method compain constant and voule letter into compound letter

splitting letters

from tamilstring.utf8 import split_letter
letter1 = split_letter("பூ")
print(letter1)
letter2 = split_letter("க்ஷீ")
print(letter2)
letter3 = split_letter("வா")
print(letter3)
Output:
('ப்', 'ஊ')
('க்ஷ்', 'ஈ')
('வ்', 'ஆ')
split_letter method split compound letter and return as possible constant and voule

check letters is compound, consonent, vowel

from tamilstring.utf8 import is_composite, is_consonent, is_vowel
print(is_composite("வா"))
print(is_composite("வ்"))
print(is_composite("ஆ"))

Output:

True
False
False
is_composite method helps to check given letter is composite

print(is_consonent("வா"))
print(is_consonent("வ்"))
print(is_consonent("ஆ"))

Output:

False
True
False
is_consonent method helps to check given letter is consonent

print(is_vowel("வா"))
print(is_vowel("வ்"))
print(is_vowel("ஆ"))

Output:

False
False
True
is_vowel method helps to check given letter is vowel