Skip to content

Commit a67c6b3

Browse files
authored
Merge pull request #715 from PyThaiNLP/dev
Update 3.1 from dev
2 parents 1b2f39b + fc60cd3 commit a67c6b3

40 files changed

+804
-38
lines changed

.github/workflows/windows-test.yml

Lines changed: 59 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,59 @@
1+
name: Windows Unit test and code coverage
2+
3+
on:
4+
push:
5+
paths-ignore:
6+
- '**.md'
7+
- 'docs/**'
8+
pull_request:
9+
branches:
10+
- dev
11+
paths-ignore:
12+
- '**.md'
13+
- 'docs/**'
14+
15+
jobs:
16+
build:
17+
18+
runs-on: ${{ matrix.os }}
19+
strategy:
20+
fail-fast: false
21+
matrix:
22+
os: [windows-latest]
23+
python-version: [3.8]
24+
25+
steps:
26+
- uses: actions/checkout@v2
27+
- uses: conda-incubator/setup-miniconda@v2
28+
with:
29+
python-version: ${{ matrix.python-version }}
30+
auto-activate-base: true
31+
auto-update-conda: true
32+
- shell: powershell
33+
run: |
34+
conda info
35+
conda list
36+
- name: Install PyTorch
37+
shell: powershell
38+
run: |
39+
pip install torch==1.8.1
40+
- name: Install dependencies
41+
shell: powershell
42+
run: |
43+
python -m pip install --disable-pip-version-check --user --upgrade pip setuptools
44+
python -m pip --version
45+
python -m pip install pytest coverage coveralls
46+
conda install -y -c conda-forge fairseq
47+
python -m pip install https://www.dropbox.com/s/o6p2sj5z50iim1e/PyICU-2.3.1-cp38-cp38-win_amd64.whl?dl=1
48+
python -m pip install -r docker_requirements.txt
49+
python -m pip install .[full]
50+
python -m nltk.downloader omw-1.4
51+
python -m pip install spacy deepcut
52+
- name: Test
53+
shell: powershell
54+
env:
55+
GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}
56+
COVERALLS_SERVICE_NAME: github
57+
run: |
58+
coverage run -m unittest discover
59+
coveralls

Dockerfile

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
FROM python:3.7-slim-buster
1+
FROM python:3.8-slim-buster
22

33
COPY . .
44

README.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -20,12 +20,12 @@ PyThaiNLP เป็นไลบารีภาษาไพทอนสำหร
2020

2121
**News**
2222

23-
> Now, You can contact or ask any questions you encounter with the PyThaiNLP team. <a href="https://matrix.to/#/#thainlp:matrix.org" rel="noopener" target="_blank"><img src="https://matrix.to/img/matrix-badge.svg" alt="Chat on Matrix"></a>
23+
> Now, You can contact or ask any questions with the PyThaiNLP team. <a href="https://matrix.to/#/#thainlp:matrix.org" rel="noopener" target="_blank"><img src="https://matrix.to/img/matrix-badge.svg" alt="Chat on Matrix"></a>
2424
2525
| Version | Description | Status |
2626
|:------:|:--:|:------:|
27-
| [3.0](https://github.com/PyThaiNLP/pythainlp/releases) | Stable | [Change Log](https://github.com/PyThaiNLP/pythainlp/issues/545) |
28-
| [`dev`](https://github.com/PyThaiNLP/pythainlp/tree/dev) | Release Candidate for 3.1 | [Change Log](https://github.com/PyThaiNLP/pythainlp/issues/643) |
27+
| [3.1](https://github.com/PyThaiNLP/pythainlp/releases) | Stable | [Change Log](https://github.com/PyThaiNLP/pythainlp/issues/643) |
28+
| [`dev`](https://github.com/PyThaiNLP/pythainlp/tree/dev) | Release Candidate for 3.2 | [Change Log](https://github.com/PyThaiNLP/pythainlp/issues/714) |
2929

3030

3131
## Getting Started

README_TH.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -16,12 +16,12 @@ PyThaiNLP เป็นไลบารีภาษาไพทอนสำหร
1616

1717
**ข่าวสาร**
1818

19-
>นับตั้งแต่ PyThaiNLP 3.0 พวกเราจะยุติการสนับสนุน Python 3.6 แล้ว หากคุณจำเป็นต้องใช้ PyThaiNLP บน Python 3.6 คุณสามารถใช้ PyThaiNLP 2.3.1 ได้
19+
> คุณสามารถพูดคุยหรือแชทกับทีม PyThaiNLP หรือผู้สนับสนุนคนอื่น ๆ ได้ที่ <a href="https://matrix.to/#/#thainlp:matrix.org" rel="noopener" target="_blank"><img src="https://matrix.to/img/matrix-badge.svg" alt="Chat on Matrix"></a>
2020
2121
| รุ่น | คำอธิบาย | สถานะ |
2222
|:------:|:--:|:------:|
23-
| [3.0](https://github.com/PyThaiNLP/pythainlp/releases) | Stable | [Change Log](https://github.com/PyThaiNLP/pythainlp/issues/545 |
24-
| [`dev`](https://github.com/PyThaiNLP/pythainlp/tree/dev) | Release Candidate for 3.1 | [Change Log](https://github.com/PyThaiNLP/pythainlp/issues/643) |
23+
| [3.1](https://github.com/PyThaiNLP/pythainlp/releases) | Stable | [Change Log](https://github.com/PyThaiNLP/pythainlp/issues/643) |
24+
| [`dev`](https://github.com/PyThaiNLP/pythainlp/tree/dev) | Release Candidate for 3.2 | [Change Log](https://github.com/PyThaiNLP/pythainlp/issues/714) |
2525

2626
ติดตามพวกเราบน [PyThaiNLP Facebook page](https://www.facebook.com/pythainlp/) เพื่อรับข่าวสารเพิ่มเติม
2727

SECURITY.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -4,6 +4,7 @@
44

55
| Version | Supported |
66
| ------- | ------------------ |
7+
| 3.1.x | :white_check_mark: |
78
| 3.0.x | :white_check_mark: |
89
| 2.3.x | :white_check_mark: |
910
| 2.2.x | :x: |

docker_requirements.txt

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@ sentencepiece==0.1.91
1212
ssg==0.0.8
1313
torch==1.8.1
1414
fastai==1.0.61
15-
transformers==4.8.2
15+
transformers==4.22.1
1616
phunspell==0.1.6
1717
spylls==0.1.5
1818
symspellpy==6.7.6
@@ -31,3 +31,6 @@ thai-nner==0.3
3131
spacy==2.3.*
3232
wunsen==0.0.3
3333
khanaa==0.0.6
34+
spacy_thai==0.7.1
35+
esupar==1.3.8
36+
ufal.chu-liu-edmonds==1.0.2

docs/api/parse.rst

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,10 @@
1+
.. currentmodule:: pythainlp.parse
2+
3+
pythainlp.parse
4+
===============
5+
The :class:`pythainlp.parse` is dependency parsing for Thai.
6+
7+
Modules
8+
-------
9+
10+
.. autofunction:: dependency_parsing

docs/api/soundex.rst

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -11,6 +11,7 @@ Modules
1111
.. autofunction:: lk82
1212
.. autofunction:: udom83
1313
.. autofunction:: metasound
14+
.. autofunction:: prayut_and_somchaip
1415

1516
References
1617
----------
@@ -23,3 +24,5 @@ References
2324
Master Thesis. Chulalongkorn University, Thailand.
2425
2526
.. [#lk82] วิชิต หล่อจีระชุณห์กุล และ เจริญ คุวินทร์พันธุ์. `โปรแกรมการสืบค้นคำไทยตามเสียงอ่าน (Thai Soundex) <http://guru.sanook.com/1520/>`_.
27+
28+
.. [#prayut_and_somchaip] Prayut Suwanvisat, Somchai Prasitjutrakul. Thai-English Cross-Language Transliterated Word Retrieval using Soundex Technique. In 1998 [cited 2022 Sep 8]. Available from: https://www.cp.eng.chula.ac.th/~somchai/spj/papers/ThaiText/ncsec98-clir.pdf

docs/api/tokenize.rst

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,7 @@ Modules
1212
.. autofunction:: sent_tokenize
1313
.. autofunction:: subword_tokenize
1414
.. autofunction:: word_tokenize
15+
.. autofunction:: word_detokenize
1516
.. autoclass:: Tokenizer
1617
:members:
1718

docs/notes/installation.rst

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -31,7 +31,11 @@ where ``extras`` can be
3131
- ``tltk`` (to support tltk)
3232
- ``textaugment`` (to support text augmentation)
3333
- ``oskut`` (to support OSKUT)
34-
- ``nlpo3`` (to support nlpo3 enging)
34+
- ``nlpo3`` (to support nlpo3 engine)
35+
- ``spacy_thai`` (to support spacy_thai engine)
36+
- ``esupar`` (to support esupar engine)
37+
- ``transformers_ud`` (to support transformers_ud engine)
38+
- ``dependency_parsing`` (to support dependency parsing with all engine)
3539
- ``full`` (install everything)
3640

3741
For dependency details, look at `extras` variable in `setup.py <https://github.com/PyThaiNLP/pythainlp/blob/dev/setup.py>`_.

0 commit comments

Comments
 (0)