Optimize excel reading workflow by MarcoAnarmo · Pull Request #22 · IEE-TUGraz/InOutModule

MarcoAnarmo · 2025-10-30T13:42:36Z

Now the pandas ExcelFile function is using the engine calamine based on Rust which is faster than the default engine openpyxl.
Now the versioning checking is done over the already opened file, before every file was reading twice.
Now the reading of the excel files are being parallelized when building the CaseStudy object, except for the files Global_Parameters, Global_Scenarios and Power_Parameters that are read sequentially at the beginning of the workflow.

Copilot

Pull Request Overview

This PR improves Excel file reading performance by switching from openpyxl to the faster calamine engine and introducing optional parallel file reading capabilities. The changes refactor the version checking mechanism to work with open ExcelFile objects and reorganize the initialization flow to enable concurrent data loading.

Key changes:

Switched pandas ExcelFile reading from openpyxl to calamine engine for better performance
Added optional parallel reading of Excel files using ThreadPoolExecutor with configurable worker count
Refactored check_LEGOExcel_version() to operate on open ExcelFile objects instead of file paths

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 4 comments.

File	Description
environment.yml	Added `python-calamine=0.5.3` dependency to support the new Excel reading engine
ExcelReader.py	Refactored version checking to work with open ExcelFile objects; switched to calamine engine for reading
CaseStudy.py	Reorganized initialization to support parallel file reading with ThreadPoolExecutor; added `parallel_read` and `n_jobs` parameters

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

CaseStudy.py

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

MarcoAnarmo added 5 commits October 30, 2025 14:27

Add calamine engine as a package dependency

8b96ff5

Use calamine engine in pandas ExcelFile reader

ed151b7

Update version checking to use the opened files

18eca9c

Fix capacity factors bug when validating positive capacity factors

438503a

Implement parallel reading in CaseStudy

12c520c

MarcoAnarmo requested a review from FelixCAAuer October 30, 2025 13:42

MarcoAnarmo self-assigned this Oct 30, 2025

MarcoAnarmo added the enhancement New feature or request label Oct 30, 2025

MarcoAnarmo changed the title ~~Optimize excel reader~~ Optimize excel reading workflow Oct 30, 2025

FelixCAAuer changed the base branch from main to feature/markovTransition October 30, 2025 13:46

Merge branch 'feature/markovTransition' into feature/optimizeExcelReader

4641222

FelixCAAuer requested a review from Copilot October 30, 2025 13:56

Copilot AI reviewed Oct 30, 2025

View reviewed changes

CaseStudy.py Show resolved Hide resolved

CaseStudy.py Show resolved Hide resolved

CaseStudy.py Outdated Show resolved Hide resolved

CaseStudy.py Outdated Show resolved Hide resolved

FelixCAAuer and others added 2 commits October 30, 2025 15:01

Update comment in CaseStudy

11ebf1a

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Add suggestion from Copilot for futures in CaseStudy

a2ab151

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

FelixCAAuer merged commit 9967832 into feature/markovTransition Oct 30, 2025
1 check passed

FelixCAAuer deleted the feature/optimizeExcelReader branch October 30, 2025 14:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize excel reading workflow#22

Optimize excel reading workflow#22
FelixCAAuer merged 8 commits intofeature/markovTransitionfrom
feature/optimizeExcelReader

MarcoAnarmo commented Oct 30, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

MarcoAnarmo commented Oct 30, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants