text2brain

This is the repository for the text2brain hackathon from the IMAGINE Lab The aim of the hackathon was to train a model to take input text descriptions of brain areas and output a mask in MNI space of the brain area

(Replace with nice overview picture)

Create the environment

Get the text2brain package:

git clone https://github.com/imaginelab/text2brain.git

In the text2brain package, open a terminal and create the main environment:

conda env create -f environment.yml
conda activate text2brain
pip install -e .

Data overview

Sources

All data is readily available in the dataset directory (see below). The following was used to create the dataset:

Neurparch: https://github.com/neurodata/neuroparc
MIST: https://figshare.com/articles/dataset/MIST_A_multi-resolution_parcellation_of_functional_networks/5633638/1?file=9811081
Lobar atlas: Lobar labels created by merging Desikan–Killiany regions

Getting the data

data_generation/get_neurparc.sh

Generate language-friendly versions of atlas labels:

python data_generation/create_language_versions.py

Dataset directory layout

text2brain/
└── data/
    ├── atlases/
    │   ├── labels/
    │   │   ├── Anatomical-labels-csv/
    │   │   └── Anatomical-labels-csv-language/
    │   └── volumes/
    └── data_generation/
        └── datasets.csv

Create the training data for the decoder

Example on the atlas aparc+aseg_mni152.

To generate the training data, place your atlas in the folder atlases using the following format: You will need the atlas in MNI space (decoder/data/atlas_<my_atlas>.nii) as well as a look up table (decoder/data/lut_<my_atlas>.txt).

First, we create a csv spreadsheet of brain regions and labels Run the notebook decoder/convert_lut_csv.ipynb This create the csv in decoder/data/labels_<my_atlas>.csv
Create the embedding vector for each label of the atlas

python text_preprocessing/get_embeddings.py --input_file decoder/data/labels_<my_atlas>.csv --output_file decoder/data/embedding_<my_atlas>.pkl

Create the final file containing the embeddings and masks for each label of the atlas

python decoder/create_data_model.py --atlas_nii decoder/data/atlas_<my_atlas>.nii --embedding_file decoder/data/embedding_<my_atlas>.pkl --output_file decoder/data/trainingdata_<my_atlas>.pkl

TEST TRAINING model

training baseline:

batch: 1
lr: 1e-3
model: v1
epochs: 45
atlas : DK
training loss function at the end: 0.001212 Able to predict location on train set and val set

Text preprocessing and data augmentation

To preprocess ROIs into a ready-to-train format, look at the README.md file /text_preprocessing. The scripts in this folder include functions to:

Extract ROIs from text, like radiological reports.
Renamed atlas labels into natural language.
Translate non-English text into English, since the text embedding are optimized for English text.
Generate ROIs labels synonyms to augment the training set.

Name		Name	Last commit message	Last commit date
Latest commit History 142 Commits
data/atlases		data/atlases
data_generation		data_generation
decoder		decoder
language_embed		language_embed
plotting		plotting
tests		tests
text_preprocessing		text_preprocessing
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
config.py		config.py
environment.yml		environment.yml
overview_sketch.jpeg		overview_sketch.jpeg
problematic_masks_list_and_plot.py		problematic_masks_list_and_plot.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

text2brain

Create the environment

Data overview

Sources

Getting the data

Dataset directory layout

Create the training data for the decoder

TEST TRAINING model

Text preprocessing and data augmentation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

text2brain

Create the environment

Data overview

Sources

Getting the data

Dataset directory layout

Create the training data for the decoder

TEST TRAINING model

Text preprocessing and data augmentation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages