This repository provides code for evaluating text simplification using various metrics.
The dataset swipe_generator_data.json originates from Salesforce's Simplification repository and is licensed under the Apache License 2.0. This repository does not redistribute the original dataset but processes it into a different format for evaluation purposes.
data_collection.ipynb: Convertsswipe_generator_data.jsonintoinput.csv, which serves as the input for different evaluation metrics.- Evaluation Notebooks:
eval_with_len_salsa.ipynb: Evaluates using LEN-SALSA.eval_with_questeval.ipynb: Evaluates using QuestEval.eval_with_referee.ipynb: Evaluates using Referee.eval_with_summac.ipynb: Evaluates using SummaC.
merged_data.ipynb: Merges the evaluation results from different metrics into a single dataset and and generating the finalevaluation_results.csv.data/: Contains all generated evaluation inputs and outputs.
Each evaluation notebook includes the necessary package installations. Ensure that you install the required dependencies before running the notebooks.
- Run
data_collection.ipynbto generateinput.csv. - Execute any of the evaluation notebooks to obtain metric-based evaluations.
- Use
merged_data.ipynbto combine all evaluation results.
This project is licensed under the Apache License 2.0. See the LICENSE file for details.
The dataset used in this project is sourced from Salesforce’s Simplification repository and follows the Apache License 2.0.