Lumina

High-performance electromagnetic simulations for nanophotonics using the Coupled Dipole Approximation

Lumina is a Rust framework for computing the linear and nonlinear optical response of nanostructured materials. It implements the Coupled Dipole Approximation (CDA) with state-of-the-art numerical methods and GPU acceleration, providing accurate cross-sections, near-field maps, and SHG/THG spectra for metallic and dielectric nanoparticles — from simple spheres to complex periodic metasurfaces on substrates.

Interactive GUI dashboard with real-time spectra, near-field heatmaps, and dipole lattice preview

What's New in v0.4.1

O(N) memory GMRES — on-the-fly matvec evaluates G(rᵢ,rⱼ)·xⱼ per iteration without ever assembling the 3N×3N matrix; peak memory drops from ~1.3 GB to ~4 MB at N = 3 000
GPU budget gate — the matrix_memory_budget check now fires before GPU assembly, so large-N systems go directly to on-the-fly rather than silently doing an O(N²) CPU allocation
1D chain Ewald fix — the previous implementation zeroed all inter-cell interactions (erfc ≈ 10⁻¹⁰⁹ at the nearest image); 1D chains now use a correct direct lattice sum with ≥ 20 shells
Extended Palik library — Al₂O₃ (sapphire), Si (crystalline, 400–1100 nm), GaAs (300–1000 nm) added alongside TiO₂ and SiO₂
136 tests passing

v0.4.0 highlights

SHG & THG — self-consistent second- and third-harmonic generation via χ^(2) and χ^(3) tensors
Full Ewald acceleration — erfc-damped real-space sum + 2D spectral reciprocal-space sum for periodic arrays
Retarded substrate — retarded normal-incidence Fresnel amplitude replaces the quasi-static Δε

Features

Physics

Full dyadic Green's function with Filtered Coupled Dipole (FCD) near-field correction
Clausius–Mossotti polarisability with radiative reaction correction (RRCM)
Anisotropic 3×3 polarisability tensors for ellipsoidal particles (depolarisation factors via 16-point Gauss–Legendre quadrature)
CoreShell geometry with arbitrary concentric layers, each with independent material
Substrate image-dipole — retarded normal-incidence Fresnel amplitude (default) or quasi-static Δε
Periodic lattice sums — full Ewald split: erfc-damped real-space + 2D spectral reciprocal-space sum; Bloch wavevector support
SHG — χ^(2) tensors (isotropic surface C_∞v or custom); self-consistent solve at 2ω; intensity ∝ |E|⁴
THG — χ^(3) tensors (isotropic bulk or custom); self-consistent solve at 3ω; intensity ∝ |E|⁶
Far-field radiation patterns at ω, 2ω, 3ω (E-plane / H-plane polar plots)
Circular dichroism ΔC_ext for chiral structures
Near-field |E|² maps on arbitrary observation planes
Validated against Mie theory: < 15% for dielectrics, < 30% for Au interband region

Solvers

Direct LU decomposition (faer) for N ≤ 1000 dipoles
GMRES(m=30) iterative solver for N > 1000 (agrees with direct to 10⁻¹³ relative error)
On-the-fly matvec — computes G(rᵢ,rⱼ)·xⱼ per iteration with no matrix assembly; O(N) peak memory, runs on any laptop at N = 10 000
Configurable matrix_memory_budget (default 2 GiB) — automatically selects cached or on-the-fly path based on matrix size
GPU-accelerated GMRES matvec via wgpu compute shaders (optional, --features gpu)
Persistent GPU session buffers — matrix uploaded once per wavelength solve
Rayon-parallel wavelength sweep across all CPU cores (GUI and CLI)

Materials

Johnson & Christy (1972): Au, Ag, Cu — 43 data points, 188–892 nm
Palik handbook: TiO₂, SiO₂, Al₂O₃, Si (400–1100 nm), GaAs (300–1000 nm)
Cubic spline interpolation for smooth ε(λ)

Geometry

Primitives: sphere, cylinder, cuboid, ellipsoid, helix
CoreShell: sphere/cylinder/cuboid/ellipsoid core with any number of shell layers
.xyz import — treat each atom as a dipole (Å→nm auto-conversion, nearest-neighbour auto-detection)
.obj mesh import — volume-filling dipole lattice via ray-casting
Multi-object scenes: independent position, rotation (Euler XYZ), and scale per object
Supplied example: Au Mackay icosahedron (k=6, 923 atoms, examples/au_icosahedron_k6.xyz)

Installation

Prerequisites

Rust 1.75 or later (rustup.rs)
(GPU only) A Vulkan/Metal/DX12-capable GPU driver — no additional SDK required

Build (CPU only)

git clone https://github.com/JonesRobM/lumina.git
cd lumina
cargo build --release

Build with GPU acceleration

cargo build --release --features gpu

The gpu feature enables wgpu compute shaders for GMRES matrix-vector products. The build otherwise identical — no CUDA toolkit, no separate driver installation. GPU is used automatically when available; falls back to CPU otherwise.

Quick Start: GUI

cargo run --release -p lumina-gui
# with GPU:
cargo run --release -p lumina-gui --features gpu

The GUI has three panels:

Panel	Purpose
Scene	Define geometry objects, materials, file imports, and transforms
Simulation	Set wavelength range, environment, polarisation, substrate, and compute backend
Results	View spectra (C_ext/C_abs/C_sca), near-field heatmap, far-field polar plot, export CSV/JSON

Typical workflow:

Scene panel — add an object (+), choose shape type, set radius/semi-axes/etc., choose material, set dipole spacing
Simulation panel — set wavelength range (e.g. 400–900 nm, 100 points), environment refractive index
Optionally enable Substrate (material + interface z-position) or GPU
Click Run Simulation — progress bar tracks the parallel wavelength sweep
Results panel appears automatically on completion

Quick Start: CLI

Create a TOML configuration file and pass it to lumina-cli:

cargo run --release -p lumina-cli -- run job.toml
# with GPU:
cargo run --release -p lumina-cli --features gpu -- run job.toml

Minimal example — gold nanosphere

Save as gold_sphere.toml:

[simulation]
wavelengths = { range = [400.0, 900.0], points = 100 }
environment_n = 1.0

[[geometry.object]]
name = "Au_sphere"
type = "sphere"
radius = 20.0
material = "Au_JC"
dipole_spacing = 2.0

[output]
directory = "./output"
save_spectra = true

Run:

cargo run --release -p lumina-cli -- run gold_sphere.toml

Output in ./output/spectra.csv:

# Lumina v0.4.1 | object: Au_sphere | material: Au_JC | spacing: 2.0 nm
wavelength_nm,extinction_nm2,absorption_nm2,scattering_nm2
400.0,3.21e2,2.87e2,3.40e1
...
520.0,1.19e2,1.02e2,1.73e1
...

Worked Example: Au–SiO₂ Core-Shell on a Glass Substrate

This example demonstrates CoreShell geometry, the retarded substrate correction, and the parallel wavelength sweep.

Save as coreshell_on_substrate.toml:

[simulation]
wavelengths = { range = [400.0, 900.0], points = 150 }
environment_n = 1.33   # water

[simulation.substrate]
material = "SiO2_Palik"
z_interface = -22.0    # nm — structure sits just above the glass surface

[[geometry.object]]
name = "Au-SiO2_core-shell"
type = "coreshell"
dipole_spacing = 2.0

[geometry.object.core_shape]
type = "sphere"
radius = 15.0          # nm Au core

core_material = "Au_JC"

[[geometry.object.shells]]
thickness = 5.0        # nm SiO2 shell
material = "SiO2_Palik"

[geometry.object.transform]
position = [0.0, 0.0, 0.0]

[output]
directory = "./output/coreshell"
save_spectra = true
save_near_field = true

Run:

cargo run --release -p lumina-cli -- run coreshell_on_substrate.toml

The simulation will:

Discretise a 15 nm Au sphere surrounded by a 5 nm SiO₂ shell (dipole spacing 2 nm → ~2800 dipoles)
Add the SiO₂ substrate image-dipole correction at z = −22 nm
Solve 150 wavelengths in parallel across all CPU cores
Write spectra.csv and a near-field heatmap at peak extinction

Advanced Features

Anisotropic Ellipsoids

Ellipsoidal objects automatically use a per-dipole 3×3 polarisability tensor. Set semi-axes and an Euler rotation to orient the particle in the lab frame:

[[geometry.object]]
name = "Au_nanorod"
type = "ellipsoid"
semi_axes = [30.0, 8.0, 8.0]   # prolate: long axis along x
material = "Au_JC"
dipole_spacing = 2.0

[geometry.object.transform]
rotation_deg = [0.0, 45.0, 0.0]   # tilt 45° around y

Depolarisation factors L_x, L_y, L_z are computed analytically via 16-point Gauss–Legendre quadrature (exact for spheres, ~1 ppm accuracy for aspect ratios up to 20:1).

Periodic Structures

Model a 1D grating or 2D metasurface using lattice sums with Bloch boundary conditions:

[simulation]
wavelengths = { range = [400.0, 900.0], points = 100 }
k_bloch = [0.005, 0.0, 0.0]   # nm⁻¹, oblique incidence

[lattice]
type = "planar"
a1 = [50.0, 0.0, 0.0]   # nm
a2 = [0.0, 50.0, 0.0]

[[geometry.object]]
name = "Au_disk"
type = "cylinder"
length = 20.0
radius = 15.0
material = "Au_JC"
dipole_spacing = 2.0

Multi-Object Dimers

Assemble a dimer with precise inter-particle separation:

[[geometry.object]]
name = "particle_A"
type = "sphere"
radius = 15.0
material = "Au_JC"
dipole_spacing = 2.0
[geometry.object.transform]
position = [-20.0, 0.0, 0.0]

[[geometry.object]]
name = "particle_B"
type = "sphere"
radius = 15.0
material = "Au_JC"
dipole_spacing = 2.0
[geometry.object.transform]
position = [20.0, 0.0, 0.0]

SHG & THG (Nonlinear Optics)

Compute SHG and THG spectra from a structure with broken inversion symmetry:

[simulation]
wavelengths = { range = [400.0, 900.0], points = 100 }
environment_n = 1.0

[[geometry.object]]
name = "Au_sphere"
type = "sphere"
radius = 20.0
material = "Au_JC"
dipole_spacing = 2.0

[nonlinear]
enable_shg = true
symmetry = "isotropic_surface"
chi_zzz = [1.0, 0.0]   # nm³
chi_zxx = [0.3, 0.0]

enable_thg = true
chi3_symmetry = "isotropic_bulk"
chi3_xxxx = [0.5, 0.0]   # nm⁶
chi3_xxyy = [0.1, 0.0]

[output]
directory = "./output"
save_spectra = true

Outputs: spectra.csv, shg_spectrum.csv (intensity in nm⁶), thg_spectrum.csv (intensity in nm⁹).

XYZ Atomistic Import

[[geometry.object]]
name = "Au_icosahedron"
type = "file"
path = "examples/au_icosahedron_k6.xyz"
material = "Au_JC"
dipole_spacing = 0.0   # 0 = auto-detect nearest-neighbour distance

[species_map]
Au = "Au_JC"

GPU Acceleration

Build and run with GPU support:

# GUI
cargo run --release -p lumina-gui --features gpu

# CLI
cargo run --release -p lumina-cli --features gpu -- run job.toml

In the GUI, toggle GPU in the Simulation panel. In the CLI TOML:

[simulation]
backend = "gpu"    # "auto" (default) | "cpu" | "gpu"

Implementation details:

wgpu compute shaders (Vulkan/Metal/DX12 — no CUDA required)
Matrix assembled on CPU (Rayon), matvec offloaded to GPU for all GMRES iterations
Persistent session buffers: matrix written to GPU once per wavelength, reused for all ~300 GMRES iterations
GPU uses f32 arithmetic; Arnoldi vectors and Givens rotations stay f64 on CPU
GMRES residual floor on GPU: ~1e-7 (use solver_tolerance = 1e-6)
At N ≤ 3000, GPU is slower than CPU due to transfer overhead; speedup expected at N > 5000

How the solver picks its path

The solver automatically selects the lowest-memory strategy that fits the configured budget:

For large N, the interaction matrix grows as (3N)² × 16 bytes — 14.4 GB at N = 10 000. The matrix_memory_budget (default 2 GiB) gates both the GPU and CPU-cached paths, so large problems automatically fall through to the on-the-fly path without ever allocating the matrix.

Memory scaling

N	Cached matrix	On-the-fly
1 000	144 MB	~1.5 MB
3 000	1.3 GB	~4.6 MB
10 000	14.4 GB	~15 MB

The crossover from cached to on-the-fly occurs at N ≈ 3 860 (2 GiB default budget). Override in TOML:

[simulation]
matrix_memory_gib = 8.0   # increase to cache larger matrices
matrix_memory_gib = 0.0   # always use on-the-fly (lowest memory)

Validation

Lumina is benchmarked against analytical Mie theory:

Material	Wavelength range	CDA error
Dielectric (TiO₂-like, ε = 4 + 0i)	500–700 nm	< 15%
Lossy dielectric (ε = 4 + 0.5i)	500–700 nm	< 16%
Au interband (FCD, d = 2 nm)	420–510 nm	< 25%
Au Drude (> 550 nm)	550+ nm	> 80%*

*The Drude region error is a known limitation of the point-dipole staircase approximation on coarse grids. Surface averaging is planned for a future release.

Run the full test suite:

cargo test --workspace

Roadmap

Released

Version	Highlights
v0.1.0	Linear solver, GUI, CLI, Mie validation
v0.1.1	GMRES, FCD Green's function, cylinder/helix geometry
v0.1.2	Spectra plots, near-field heatmaps, dipole scatter preview
v0.1.3	Far-field patterns, circular dichroism, CSV/JSON export, Palik data, OBJ/XYZ import
v0.2.0	GPU compute engine (wgpu), matrix-free GMRES, Rayon parallel assembly
v0.2.1	Parallel wavelength sweep, stack-allocated Green's tensors, XYZ workflow, debug output
v0.2.2	Persistent GPU session buffers, SceneSpec multi-object assembly
v0.3.0	CoreShell geometry, anisotropic ellipsoid dipoles, substrate image-dipole, periodic lattice sums
v0.4.0	Full Ewald acceleration, retarded substrate, SHG (χ^(2)), THG (χ^(3))
v0.4.1	O(N) on-the-fly matvec, GPU budget gate, 1D chain Ewald fix, extended Palik (Al₂O₃/Si/GaAs)

Planned

Version	Highlights
v0.5.0	DFT tensor import (VASP/Gaussian), full Sommerfeld substrate, MPI
v1.0.0	BEM solver, T-matrix solver, FDTD, CUDA/Metal backends

Architecture

Lumina is a Rust workspace with six crates:

lumina/
├── lumina-core        # CDA solver, Green's functions, cross-sections, substrate, Ewald
├── lumina-compute     # ComputeBackend trait: CPU (Rayon) and GPU (wgpu)
├── lumina-geometry    # Scene assembly, primitives, discretisation, OBJ/XYZ parsers
├── lumina-materials   # MaterialProvider trait: J&C, Palik, spline interpolation
├── lumina-gui         # egui/eframe interactive dashboard
└── lumina-cli         # TOML-based batch processing

Key abstractions:

OpticalSolver — CDA today; BEM/FDTD/T-matrix plug in via the same trait
ComputeBackend — CPU (Rayon) default; GPU (wgpu) feature-gated
MaterialProvider — returns ε(λ) for any material source
SceneSpec — unified GUI/CLI scene description, serialisable to/from TOML

Documentation

Full documentation is built with mdBook:

cd docs && mdbook serve --open

Key sections:

Theory — CDA formulation, polarisability prescriptions, anisotropic dipoles
Periodic Structures — Ewald summation, Bloch conditions
Geometry & Shapes — all shape types, CoreShell, transforms
Configuration — full TOML reference
Validation — Mie benchmarks and accuracy analysis
Roadmap — version history and future plans

Contributing

Contributions are welcome. See CONTRIBUTING.md for guidelines.

Priority areas:

Surface-averaging methods for improved metallic convergence
DFT polarisability import (VASP WAVEDER, Gaussian)
Full Sommerfeld substrate (k∥-dependent Fresnel coefficients)
GPU matrix assembly shader (currently CPU only)
1D Bessel spectral sum (K₀/K₁) for < 1% on-axis chain accuracy

Citation

If you use Lumina in your research, please cite:

@software{lumina2026,
  author  = {Jones, Robert M.},
  title   = {Lumina: Coupled Dipole Approximation for Nanophotonics},
  year    = {2026},
  url     = {https://github.com/jonesrobm/lumina},
  version = {0.4.1}
}

Key references:

Purcell, E. M. & Pennypacker, C. R., Astrophys. J. 186, 705 (1973) — original DDA
Draine, B. T. & Flatau, P. J., J. Opt. Soc. Am. A 11, 1491 (1994) — DDSCAT
Johnson, P. B. & Christy, R. W., Phys. Rev. B 6, 4370 (1972) — Au/Ag/Cu optical data

License

Apache License 2.0. See LICENSE for details.

Support Development

Acknowledgements

The King's College London Photonics & Nanotechnology Group
DDSCAT (Draine & Flatau) and ADDA (Yurkin & Hoekstra) for pioneering the CDA/DDA method
The Rust community for exceptional tooling and the egui, wgpu, rayon, and faer crates
Johnson & Christy for the gold standard in optical constants

Maintained by Robert M. Jones

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
crates		crates
docs		docs
examples		examples
tests		tests
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

Lumina

What's New in v0.4.1

Features

Physics

Solvers

Materials

Geometry

Installation

Prerequisites

Build (CPU only)

Build with GPU acceleration

Quick Start: GUI

Quick Start: CLI

Minimal example — gold nanosphere

Worked Example: Au–SiO₂ Core-Shell on a Glass Substrate

Advanced Features

Anisotropic Ellipsoids

Periodic Structures

Multi-Object Dimers

SHG & THG (Nonlinear Optics)

XYZ Atomistic Import

GPU Acceleration

How the solver picks its path

Memory scaling

Validation

Roadmap

Released

Planned

Architecture

Documentation

Contributing

Citation

License

Support Development

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 9

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages