feat(examples): Add top-k and per-class eval metrics to quickstart-pytorch by SalimELMARDI · Pull Request #6638 · flwrlabs/flower

SalimELMARDI · 2026-02-27T11:51:36Z

Issue

Description

The quickstart-pytorch example only reported basic evaluation metrics (loss and top-1 accuracy).
That made it harder to inspect ranking quality and class-level performance during federated runs.

Related issues/PRs

N/A

Proposal

Explanation

This PR extends evaluation reporting in examples/quickstart-pytorch while keeping existing metrics backward-compatible.

Changes:

Updated pytorchexample/task.py:
- Extended test(...) to compute:
  - top-1 accuracy (existing behavior)
  - top-3 accuracy
  - per-class top-1 accuracy for CIFAR-10 (class_accuracy_0 ... class_accuracy_9)
Updated pytorchexample/client_app.py:
- Kept existing eval_loss and eval_acc
- Added eval_acc_top3
- Added eval_acc_class_0 ... eval_acc_class_9
Updated pytorchexample/server_app.py:
- Kept existing loss and accuracy
- Added accuracy_top3
- Added accuracy_class_0 ... accuracy_class_9
Updated examples/quickstart-pytorch/README.md:
- Documented the new reported metrics

Validation:

Ran a 1-round simulation locally:
- flwr run . --stream --run-config "num-server-rounds=1 batch-size=128 fraction-evaluate=0.1"
Confirmed new client and server metrics are present in logs.

Checklist

Any other comments?

No API-breaking changes. Existing metric keys were kept for compatibility.

…torch

Copilot

Pull request overview

This PR enhances the examples/quickstart-pytorch evaluation reporting to include top-3 accuracy and per-class (CIFAR-10) top-1 accuracies, exposing these metrics from both client-side evaluation and centralized server-side evaluation while keeping existing metric keys.

Changes:

Extended test(...) to compute top-3 accuracy and per-class top-1 accuracies alongside existing loss/top-1 accuracy.
Updated client and server apps to emit the additional metrics under new, backward-compatible metric keys.
Documented the newly reported metrics in the example README.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 5 comments.

File	Description
examples/quickstart-pytorch/pytorchexample/task.py	Computes top-3 and per-class accuracies and returns a metrics dict from `test()`.
examples/quickstart-pytorch/pytorchexample/client_app.py	Adds client-side metric keys for top-3 and per-class accuracies.
examples/quickstart-pytorch/pytorchexample/server_app.py	Adds centralized metric keys for top-3 and per-class accuracies.
examples/quickstart-pytorch/README.md	Documents the expanded set of reported metrics (but front matter formatting changed).

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

examples/quickstart-pytorch/pytorchexample/task.py

examples/quickstart-pytorch/pytorchexample/server_app.py

examples/quickstart-pytorch/pytorchexample/client_app.py

examples/quickstart-pytorch/README.md

chongshenng · 2026-03-06T14:51:15Z

Hello @SalimELMARDI, thanks for opening this PR. I do agree that it's a useful enhancement to our example PyTorch app. But I'm unsure if we should merge this in this form because we intentionally keep our quickstart apps simple - just run basic training and eval.

Maybe it's better to implement this in our advanced-pytorch example? Wdyt?

SalimELMARDI · 2026-03-07T10:05:32Z

Hello @SalimELMARDI, thanks for opening this PR. I do agree that it's a useful enhancement to our example PyTorch app. But I'm unsure if we should merge this in this form because we intentionally keep our quickstart apps simple - just run basic training and eval.

Maybe it's better to implement this in our advanced-pytorch example? Wdyt?

@chongshenng Thanks for the feedback, that makes sense. I’ll open a new PR with this enhancement in advanced-pytorch, then close this one as superseded.

SalimELMARDI · 2026-03-07T12:10:53Z

Author

Superseded by #6713

feat(examples): Add top-k and per-class eval metrics to quickstart-py…

298ad87

…torch

Copilot AI review requested due to automatic review settings February 27, 2026 11:51

SalimELMARDI requested review from chongshenng, danieljanes, jafermarq, panh99, tanertopal and yan-gao-GY as code owners February 27, 2026 11:51

Copilot started reviewing on behalf of SalimELMARDI February 27, 2026 11:52 View session

Copilot AI reviewed Feb 27, 2026

View reviewed changes

Merge branch 'main' into feat/quickstart-pytorch-rich-metrics

a8cf052

github-actions bot added the Contributor Used to determine what PRs (mainly) come from external contributors. label Feb 27, 2026

SalimELMARDI and others added 4 commits February 27, 2026 12:45

fix(examples): Address Copilot review feedback

b1cc823

refactor(examples): Optimize per-class eval metric accumulation

61b37d1

Merge branch 'main' into feat/quickstart-pytorch-rich-metrics

38f7743

Merge branch 'main' into feat/quickstart-pytorch-rich-metrics

1282f46

SalimELMARDI mentioned this pull request Mar 7, 2026

feat(examples): add richer evaluation metrics to advanced-pytorch #6713

Open

5 tasks

SalimELMARDI closed this Mar 7, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(examples): Add top-k and per-class eval metrics to quickstart-pytorch#6638

feat(examples): Add top-k and per-class eval metrics to quickstart-pytorch#6638
SalimELMARDI wants to merge 6 commits intoflwrlabs:mainfrom
SalimELMARDI:feat/quickstart-pytorch-rich-metrics

SalimELMARDI commented Feb 27, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chongshenng commented Mar 6, 2026

Uh oh!

SalimELMARDI commented Mar 7, 2026

Uh oh!

SalimELMARDI commented Mar 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

SalimELMARDI commented Feb 27, 2026

Issue

Description

Related issues/PRs

Proposal

Explanation

Checklist

Any other comments?

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chongshenng commented Mar 6, 2026

Uh oh!

SalimELMARDI commented Mar 7, 2026

Uh oh!

SalimELMARDI commented Mar 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants