Skip to content

Commit 3f7d027

Browse files
committed
medcat workflow
1 parent bc094cc commit 3f7d027

File tree

1 file changed

+41
-0
lines changed

1 file changed

+41
-0
lines changed

medcat/ReadMe.md

Lines changed: 41 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,41 @@
1+
# Medical <img src="../data/media/medcat_logo.png" width=45>oncept Annotation Tool
2+
3+
This directory contains information on retrieving data and creating models
4+
All details regarding creating, building and running the NLP model are stored here.
5+
6+
## Locations for storing data:
7+
8+
- The [data](data) directory stores textual content.
9+
Methods for retrieving data should be stored in the [retrieve_data](search) folder.
10+
11+
- The [MedCAT models](data/medcat_models) directory holds models.
12+
13+
## Order of processing steps
14+
15+
#### [__Step 1__](medcat/1_create_model): Create the model
16+
17+
Each of the model components are found [here.](medcat/1_create_model)
18+
This directory contains all the components required to initialise a model pack.
19+
20+
All models should be stored [here.](data/medcat_models)
21+
22+
23+
#### [__Step 2__](medcat/2_train_model): Perform training
24+
25+
- [__Step 2.1__](medcat/2_train_model/1_unsupervised_training): Unsupervised training
26+
27+
The unsupervised training steps can be found within unsupervised_training folder.
28+
29+
30+
- [__Step 2.2__](medcat/2_train_model/2_supervised_training): Supervised training
31+
32+
After providing supervised labels with MedCATtrainer.
33+
The supervised training steps can be found within supervised_training folder.
34+
35+
#### [__Step 3__](medcat/3_run_model): Run model
36+
37+
Run model on your corpus of documents and write to csv/sql db.
38+
Instructions on how to do this can be found within [run_model](medcat/3_run_model/run_model.ipynb)
39+
40+
41+

0 commit comments

Comments
 (0)