Releases: GoreLab/waves
Releases · GoreLab/waves
waves 0.2.6
waves 0.2.6
This release focuses on performance improvements, bug fixes, and enhanced compatibility.
Bug fixes
- Fixed
plot_spectra()no longer returns an error whendetect.outliersis set toFALSEand no alternative title is provided via thealternate.titleparameter (#29). - Fixed compatibility issue with updated
spectaclespackage (v0.5.5) that was causing data frame construction errors in model performance calculations. - Resolved temporary CRAN archive issue with the dependency
spectacles(#31).
Performance improvements
- Optimized cross-validation loops in
train_spectra()using vectorized indexing and preallocated result structures. - Optimized matrix operations in
train_spectra()for faster column selection and reduced memory overhead. - Added shared utility functions to eliminate code duplication between
train_spectra()andtest_spectra().
Internal code improvements:
- Significantly refactored
train_spectra()andtest_spectra()to reduce cyclomatic complexity (#26).train_spectra()complexity reduced from 32+ to 25test_spectra()complexity reduced to 11
- Enhanced error handling for spectacles package compatibility
- Improved code maintainability through function decomposition
Other changes
- When
return.distances = TRUE, the h.distance column is now located between metadata and spectra in the returneddata.frame.
This version is now available on CRAN!
waves 0.2.5
- Bug fix:
predict_spectra()no longer returns error when running example code (#25).
Bug fix for format_cv
- Bug fix: Different CV schemes no longer return the same results (#20).
- When
cv.schemeis set to "CV2" and "CV0" and there are no overlapping genotypes between "trial1" and "trial2",format_cv()now returnsNULL. Previously, results would be returned even if no overlap was present, resulting in incorrect CV scheme specification. format_cv()parametercv.methodis now the boolean parameterstratified.samplingfor consistency with other waves functions.plot_spectra()no longer requires a column named "unique.id".
Bug fixes for setting seed and predicting from saved models
- Bug fix:
save_model()output now works correctly withpredict_spectra(). - Bug fix:
train_spectra()no longer returns an error whenstratified.sampling = F. - In
train_spectra(), stratified random sampling of training and test sets now allows the user to provide a seed value forset.seed(). For random (non-stratified) sampling of training and test sets, seed is set to the current iteration number. - Minor documentation updates added.
Bug fix for SVM algorithm
- Bug fix:
model.method = "svmLinearandmodel.method = "svmRadialno longer return an error when used intrain_spectra()ortest_spectra().
New vignette and bug fix
- Bug fix:
test_spectra()now returns trained model correctly when only one pretreatment is specified. - Change the gap-segment derivative pretreatment to retain compatibility with prospectr. In the upcoming version of prospectr, the gapDer function only accepts odd values for the segment argument in order to properly compute the convolution filter.
- Default plot title for
plot_spectra()is nowNULL(no title) ifdetect.outliersis set toFALSE. - Column names in output list item
$summary.model.performancefromtest_spectra()now include underscores rather than periods for easier parsing. - Update website
- New vignette:
vignette("waves")
Rename functions
- Update all files to conform to the tidyverse style guide (#6).
- All functions renamed to match tidystyle. Old functions names work will be dropped after this version:
AggregateSpectra->aggregate_spectra()DoPreprocessing->pretreat_spectra()FilterSpectra->filter_spectra()FormatCV->format_cv()PlotSpectra()->plot_spectra()SaveModel()->save_model()TestModelPerformance()->test_spectra()TrainSpectralModel()->train_spectra()
- "Preprocessing" has been renamed "Pretreatment" to minimize confusion with physical preprocessing of samples prior to scanning. Arguments have been renamed to reflect these changes (
preprocessingis nowpretreatment). - Added more informative error message and documentation for random forest tune length (
tune.lengthmust be set to 5 whenmodel.algorithm == "rf"). - Additional flexibility for
plot_spectra()including color and title customization and the option to forgo filtering (#5). - Named list output for all functions to enable easier access to individual elements.
- Always return model and variable importance results with
train_spectra()andtest_spectra(). - Add variable importance for PLSR (#9).
- Enable selection of k for k-fold cross-validation within the training set. Previously, k was fixed at 5 (#10).
save_model()now automatically selects the best model if provided with multiple pretreatments.- Code simplified and streamlined to facilitate future updates.
- Export predicted values as well as performance statistics for each training iteration (#11).
wavelengthsis no longer a required argument for any of the waves functions.- The proportion of samples to include in the training set can now be selected with the argument
proportion.train. Previously, this proportion was fixed at 0.7 (#13). - Bug fix:
aggregate_spectra()now allows for aggregation by a single grouping column (#14). - The parameter
save.modelin the functionsave_model()has been renamed towrite.modelfor clarity.
Bug fixes for RF and SVM algorithms
- Bug fix: SVM Linear and SVM Radial algorithms no longer return errors in
TrainSpectralModel(). - Bug fix: Random Forest variable importance no longer returns error in
TrainSpectralModel()or whenpreprocessing = TRUEinTestModelPerformance()(#7). - Output for random Forest variable importance now includes "Pretreatment" and "Iteration" columns.
PlotSpectra()now allows for missing data in non-spectral columns of the input data frame.- waves now has an associated paper in the Plant Phenome Journal! The citation for this paper should be used if waves is used in a paper - see citation("waves") for details.
waves is now on CRAN! 🎉
v0.1.0 Change examples from dontrun to donttest