You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+42-42Lines changed: 42 additions & 42 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -26,15 +26,15 @@ PyThaiNLP is a Python package for text processing and linguistic analysis, simil
26
26
27
27
## Capabilities
28
28
29
-
- Convenient character and word classes, like Thai consonants (```pythainlp.thai_consonants```), vowels (```pythainlp.thai_vowels```), digits (```pythainlp.thai_digits```), and stop words (```pythainlp.corpus.thai_stopwords```) -- comparable to constants like ```string.letters```, ```string.digits```, and ```string.punctuation```
30
-
- Thai word segmentation (```word_tokenize```), including subword segmentation based on Thai Character Cluster (```subword_tokenize```)
31
-
- Thai transliteration (```transliterate```)
32
-
- Thai part-of-speech taggers (```pos_tag```)
33
-
- Read out number to Thai words (```bahttext```, ```num_to_thaiword```)
34
-
- Thai collation (sort by dictionoary order) (```collate```)
- Convenient character and word classes, like Thai consonants (`pythainlp.thai_consonants`), vowels (`pythainlp.thai_vowels`), digits (`pythainlp.thai_digits`), and stop words (`pythainlp.corpus.thai_stopwords`) -- comparable to constants like `string.letters`, `string.digits`, and `string.punctuation`
30
+
- Thai word segmentation (`word_tokenize`), including subword segmentation based on Thai Character Cluster (`subword_tokenize`)
31
+
- Thai transliteration (`transliterate`)
32
+
- Thai part-of-speech taggers (`pos_tag`)
33
+
- Read out number to Thai words (`bahttext`, `num_to_thaiword`)
34
+
- Thai collation (sort by dictionoary order) (`collate`)
- Thai spelling suggestion and correction (`spell` and `correct`)
37
+
- Thai soundex (`soundex`) with three engines (`lk82`, `udom83`, `metasound`)
38
38
- Thai WordNet wrapper
39
39
- and much more - see examples in [PyThaiNLP Get Started notebook](https://github.com/PyThaiNLP/pythainlp/blob/dev/notebooks/pythainlp-get-started.ipynb).
40
40
@@ -62,20 +62,20 @@ For some advanced functionalities, like word vector, extra packages may be neede
62
62
$ pip install pythainlp[extra1,extra2,...]
63
63
```
64
64
65
-
where ```extras``` can be
66
-
-```artagger``` (to support artagger part-of-speech tagger)*
67
-
-```deepcut``` (to support deepcut machine-learnt tokenizer)
68
-
-```icu``` (for ICU, International Components for Unicode, support in transliteration and tokenization)
69
-
-```ipa``` (for IPA, International Phonetic Alphabet, support in transliteration)
70
-
-```ml``` (to support fastai 1.0.22 ULMFiT models)
71
-
-```ner``` (for named-entity recognizer)
72
-
-```thai2fit``` (for Thai word vector)
73
-
-```thai2rom``` (for machine-learnt romanization)
74
-
-```full``` (install everything)
65
+
where `extras` can be
66
+
-`artagger` (to support artagger part-of-speech tagger)*
67
+
-`deepcut` (to support deepcut machine-learnt tokenizer)
68
+
-`icu` (for ICU, International Components for Unicode, support in transliteration and tokenization)
69
+
-`ipa` (for IPA, International Phonetic Alphabet, support in transliteration)
70
+
-`ml` (to support fastai 1.0.22 ULMFiT models)
71
+
-`ner` (for named-entity recognizer)
72
+
-`thai2fit` (for Thai word vector)
73
+
-`thai2rom` (for machine-learnt romanization)
74
+
-`full` (install everything)
75
75
76
-
* Note: standard ```artagger``` package from PyPI will not work on Windows, please ```pip install https://github.com/wannaphongcom/artagger/tarball/master#egg=artagger``` instead.
76
+
* Note: standard `artagger` package from PyPI will not work on Windows, please ```pip install https://github.com/wannaphongcom/artagger/tarball/master#egg=artagger``` instead.
77
77
78
-
** see ```extras``` and ```extras_require``` in [```setup.py```](https://github.com/PyThaiNLP/pythainlp/blob/dev/setup.py) for package details.
78
+
** see `extras` and `extras_require` in [`setup.py`](https://github.com/PyThaiNLP/pythainlp/blob/dev/setup.py) for package details.
0 commit comments