SPA

Selective Preference Aggregation

├── data         # processed datasets                `data_dir`
├── spa          # source code                       `pkg_dir`
├── scripts      # scripts that run source code                            
├── results      # results                           `results_dir`
├── reporting    # source code for reporting
├── reports      # reports                           `reports_dir`

Installation

You can install the package using pip:

pip install selectiverank

Run without cloning (via `python -m`)

After pip install selectiverank, invoke the installed modules directly.

Quick commands

python -m spa.scripts.create_datasets
python -m spa.scripts.dev_spa
python -m spa.scripts.aggregate_base_results
python -m spa.scripts.combine_all_results

Examples

Create datasets

python -m spa.scripts.create_datasets --dataset movie --data-dir ./data

Run SPA experiments

python -m spa.scripts.dev_spa --dataset movie --data-dir ./data --results-dir ./results --seed 2338

Aggregate base results

python -m spa.scripts.aggregate_base_results --results-dir ./results

Combine all results

python -m spa.scripts.combine_all_results --results-dir ./results

1. Add Your Dataset

Location: Create a subfolder for your dataset within the data/ directory (e.g., data/movie/).
File Naming: Place your data file(s) in this subfolder, named according to the pattern: {dataset_name}_{type}.csv (e.g., movie_ratings.csv, movie_pairwise.csv).
Data Format: Ensure your CSV file matches one of the following structures based on the {type} in your filename:
- For ranking or rating types:
  - Headers (First Row): User ids.
  - First Column: Item ids.
  - Cell A1 (Top-Left): Must contain the exact text item_name.
  - Data Cells: Preference values (rating or rank) given by the user (column) for the item (row)
  Example (movie_ratings.csv):
```
item_name,user101,user102,user103
ItemA,5,4,3
ItemB,3,4,5
ItemC,2,1,4
ItemD,1,2,2
```
- For pairwise comparison types:
  - Headers (First Row): Must be exactly judge_id, item_id_1, item_id_2, pref.
  - Rows: Each row represents a single comparison made by a judge_id.
  - pref Column: Indicates the user preference:
    - 1: item_id_1 is preferred over item_id_2.
    - -1: item_id_2 is preferred over item_id_1.
    - 0: Represents a tie or indifference.
  Example (movie_pairwise.csv):
```
judge_id,item_id_1,item_id_2,pref
judgeA,itemX,itemY,1
judgeA,itemY,itemZ,-1
judgeB,itemX,itemY,0
judgeB,itemX,itemZ,1
judgeC,itemY,itemZ,1
```

2. Configure and Run Experiments

Update Dataset Creation Script:
- Open the file scripts/create_datasets.py.
- Find the settings dictionary.
- Add the {dataset_name} string (e.g., "movie") from Step 1 to the data_names list.
```
settings = {
    "data_names": ["movie"], 
    ...
}
```
- Execute the script
Update Main Experiment Script:
- Open the file scripts/dev_spa.py.
- Find the settings dictionary within this script.
- Add the same {dataset_name} string to the data_names list (or similar configuration entry) in this file as well.
```
settings = {
    "data_names": ["movie"], # Added "movie"
    "seed": 2338,
    ...
}
```
- Execute dev_spa.py

3. Generating and Viewing Results

Aggregate Base Results:
- Configure settings within scripts/aggregate_base_results.py.
- Execute the script .
Combine All Results:
- Configure settings within scripts/combine_all_results.py.
- Execute the script.
Locate Output CSV:
- The final, combined results are saved as a CSV file within the results/ directory (results_dir).
- The filename includes a timestamp for uniqueness.
(Optional) Generate LaTeX Table:
- Modify and run the R script located at scripts/create_big_table.R. Update the script to point to the correct input CSV file from the previous step.
- This will output a .tex file.

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
data		data
reporting		reporting
scripts		scripts
spa		spa
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SPA

Installation

Run without cloning (via `python -m`)

Quick commands

Examples

1. Add Your Dataset

2. Configure and Run Experiments

3. Generating and Viewing Results

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Languages

License

ustunb/spa

Folders and files

Latest commit

History

Repository files navigation

SPA

Installation

Run without cloning (via python -m)

Quick commands

Examples

1. Add Your Dataset

2. Configure and Run Experiments

3. Generating and Viewing Results

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Languages

Run without cloning (via `python -m`)

Packages