Skip to content

co-occurence matrix #4

@KrishnaPG

Description

@KrishnaPG

In the section 2.4 of the paper,

a CUI-CUI co-occurrence matrix is constructed, ... For nonclinical text data (e.g., journal articles), it is first preprocessed (see Section 3) and chunked into fixed length windows of 10 words, and a co-occurrence is counted as the appearance of
a CUI-CUI pair in the same window. For claims data, ICD-9 codes are mapped to UMLS CUIs and a co-occurrence is counted as the number of patients in which two CUIs appear in any 30-day period. Finally, for the clinical notes, we counted a co-occurrence as two CUIs appearing in the same 30-day ‘bin’

The co-occurance matrix created on these 3 separate sources - would you be able to kindly provide access to it? It is very powerful data-structure and can lead to further investigations (we already hold UMLS license, and if required can reach out to you privately to get the download access, if it cannot be publicly released).

Thank you

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions