Produces a data table with the same distributions and correlations as an existing data table Only returns numeric columns and simulates all numeric variables from a continuous normal distribution (for now).
sim_df( data, n = 100, within = c(), between = c(), id = "id", dv = "value", empirical = FALSE, long = FALSE, seed = NULL, missing = FALSE )
data | the existing tbl |
---|---|
n | the number of samples to return per group |
within | a list of the within-subject factor columns (if long format) |
between | a list of the between-subject factor columns |
id | the names of the column(s) for grouping observations |
dv | the name of the DV (value) column |
empirical | Should the returned data have these exact parameters? (versus be sampled from a population with these parameters) |
long | whether to return the data table in long format |
seed | DEPRECATED use set.seed() instead before running this function |
missing | simulate missing data? |
a tbl
See vignette("sim_df", package = "faux")
for details.
iris100 <- sim_df(iris, 100) iris_species <- sim_df(iris, 100, between = "Species")