You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+6-7Lines changed: 6 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -27,13 +27,13 @@ PyThaiNLP is a Python package for text processing and linguistic analysis, simil
27
27
## Capabilities
28
28
29
29
- Convenient character and word classes, like Thai consonants (```pythainlp.thai_consonants```), vowels (```pythainlp.thai_vowels```), digits (```pythainlp.thai_digits```), and stop words (```pythainlp.corpus.thai_stopwords```) -- comparable to constants like ```string.letters```, ```string.digits```, and ```string.punctuation```
30
-
- Thai word segmentation (```word_tokenize```), including subword segmentation based on Thai Character Cluster (```tcc```) and ETCC (```etcc```)
31
-
- Thai transliteration (```transliterate```) and experimental RTGS romanization (```romanize```)
30
+
- Thai word segmentation (```word_tokenize```), including subword segmentation based on Thai Character Cluster (```subword_tokenize```)
31
+
- Thai transliteration (```transliterate```)
32
32
- Thai part-of-speech taggers (```pos_tag```)
33
33
- Read out number to Thai words (```bahttext```, ```num_to_thaiword```)
34
34
- Thai collation (sort by dictionoary order) (```collate```)
- and much more - see examples in [PyThaiNLP Get Started notebook](https://github.com/PyThaiNLP/pythainlp/blob/dev/notebooks/pythainlp-get-started.ipynb).
0 commit comments