We provide four social media knowledge graphs (KGs) capturing online epilepsy discourse from the following platforms:
- X (formerly Twitter) Download
- Instagram Download
- Reddit (r/Epilepsy) Download
- Epilepsy Foundation Forums (EFA) Download
The methodology and results are described in the following paper:
Selecting Focused Digital Cohorts from Social Media Using the Metric Backbone of Biomedical Knowledge Graphs
arXiv preprint arXiv:2405.07072
The image KGOverview.png provides an overview of the four social media KGs and their corresponding metric backbones.
- Left Side: The full knowledge graph for each data source, where all edges are considered.
- Right Side: The metric backbone subgraph, where only edges with the attribute
'is_metric' = Trueare retained.
You can generate these visualizations using the provided KGs.
Due to copyright restrictions, we redacted specific term names sourced from MedDRA (v15.0) and DrugBank (v.5.1.0) but provided the source ID to allow retrieval for users with appropriate access.
For a node with index 9169, the attributes are as follows:
{
"parent": "Redacted",
"type": "Drug",
"parent original id": "DB11266",
"original id source": "DrugBank"
}- The
parentattribute is a high-level term we used to build KG. - The
parent original idcorresponds to the DrugBank ID (DB11266). - Users with access to DrugBank can map this ID to the actual term name.
For sources that do not have copyright restrictions, we provide the full term names directly.
To facilitate reproducibility, we intentionally retained the following key nodes in the provided KGs:
EpilepsyMigraineDepressionAnxietyTopiramateDecreased Appetite
If you encounter any issues using the provided GraphML files to generate knowledge graphs, feel free to reach out: zguo29@binghamton.edu
