bake
was changed from all_predictors()
to everything()
.verbose
option for prep
is now defaulted to FALSE
step_dummy
was fixed that makes sure that the correct binary variables are generated despite the levels or values of the incoming factor. Also, step_dummy
now requires factor inputs.step_dummy
also has a new default naming function that works better for factors. However, there is an extra argument (ordinal
) now to the functions that can be passed to step_dummy
.step_interact
now allows for selectors (e.g. all_predictors()
or starts_with("prefix")
to be used in the interaction formula.step_YeoJohnson
gained an na.rm
option.dplyr::one_of
was added to the list of selectors.step_bs
adds B-spline basis functions.step_unorder
converts ordered factors to unordered factors.step_count
counts the number of instances that a pattern exists in a string.step_string2factor
and step_factor2string
can be used to move between encodings.step_lowerimpute
is for numeric data where the values cannot be measured below a specific value. For these cases, random uniform values are used for the truncated values.step_zv
).tidy
methods were added for recipes and many (but not all) steps.bake.recipe
, the argument newdata
is now without a default.bake
and juice
can now save the final processed data set in sparse format. Note that, as the steps are processed, a non-sparse data frame is used to store the results.First CRAN release.
prepare
to prep
per issue #59learn
has become prepare
and process
has become bake
New steps:
step_lincomb
removes variables involved in linear combinations to resolve them.step_bin2factor
)step_regex
applies a regular expression to a character or factor vector to create dummy variables.Other changes:
step_dummy
and step_interact
do a better job of respecting missing values in the data set.recipe
objects was changed so that pipes can be used to create the recipe with a formula.process.recipe
lost the role
argument in factor of a general set of selectors. If no selector is used, all the predictors are returned.