Migrate some simple ddply() calls to dplyr equivalents by MichaelChirico · Pull Request #1388 · topepo/caret

MichaelChirico · 2025-06-03T23:39:24Z

Closes #1387.

Similar to #1382, #1383, #1385, #1386 -- the end goal is moving away from {plyr}. Like #1386, this bumps {dplyr} from Suggests to Imports.

There are a variety of stylistic choices around using tidyverse code in a package (whether to use %>% / |>, whether to use the .data mask to avoid the need for globalVariables(), etc). Let me know your preferences and I can make further edits.

Also like #1386, the key driver of potential differences in these between {plyr} and {dplyr} comes down to the differences between plyr::rbind.fill() and dplyr::bind_rows(). Since the structure across groups in summarize() expressions is typically consistent, I don't have much concern about that here.

The second major difference is that plyr::ddply() always sorts the output by the grouping key, whereas dplyr::summarize() sorts according to the input row order (this in turn matches the {data.table} behavior). I tried to have a look at nearby code to see if the output order matters -- if it's not clear we can ignore the row ordering, I default to assuming it's important, thus we might be able to drop more arrange() calls.

There are still a large number of ddply() calls -- this reduces the number from roughly 74 to roughly 48. The remaining ones either do more complicated things around column selection, or use a much more complicated .fun which requires more careful examination. More PR(s) to follow.

Migrate some simple ddply() calls to dplyr equivalents

4c08289

MichaelChirico mentioned this pull request Jun 3, 2025

Make ddply() summarise() in simple cases ggobi/ggally#524

Merged

MichaelChirico added 4 commits June 3, 2025 17:01

specific, limited plyr imports

ea5f6aa

also in roxygen2

d71e0aa

dlply still needed here

cb7c4e7

also rbind.fill

fdafe33

MichaelChirico mentioned this pull request Jun 5, 2025

Write a dplyr replacement for ddply() + MeanSD() #1389

Open

MichaelChirico added 2 commits June 5, 2025 11:11

A few more simple ones from tests

7b5c0e8

Use {{cols}} for robustness

b505b34

MichaelChirico mentioned this pull request Jun 5, 2025

Drop ddply from summarize.bag #1391

Open

also migrate in upSample

70afbe4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Migrate some simple ddply() calls to dplyr equivalents#1388

Migrate some simple ddply() calls to dplyr equivalents#1388
MichaelChirico wants to merge 8 commits intotopepo:masterfrom
MichaelChirico:ddply-1

MichaelChirico commented Jun 3, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

MichaelChirico commented Jun 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

MichaelChirico commented Jun 3, 2025 •

edited

Loading