Synthesis from a saturated model based on all combinations of the predictor variables.
syn.satcat.Rd
Synthesises one variable (y
) from all possible
combinations of its precitors (x
). A bootstrap sample is created
from the original values of y
within each unique combinations of
of xp
(the syntheisied values of the grouping variable).
Arguments
- y
an original data vector of length
n
for the satcat variable.- x
a matrix (
n
xp
) with the original predictor variables fory
.- xp
a matrix (
k
xp
) with synthetic values ofx
.- proper
if
proper = TRUE
x
andy
are replaced with a bootstrap sample before synthesis, thus effectively sampling from the posterior distribution of the model, given the data.- ...
additional parameters.
Details
It is intended that the variables in x
are categorical (factor)
variables. If y
is also a categorical variable syn.satcat
will
give the same results as fitting a saturated polychotomous regression model but
will usually be much faster. syn.satcat
will fail with an error message
if previous syntheses have generated a combination of variables in xp
that was not present in x
. Use of the syn.catall
method for
grouped variables can overcome this.
Value
A list with two components:
- res
a data frame of dimension
k x p
containing the synthesised data.- fit
the cross-tabulation of the original predictor variables.
Examples
ods <- SD2011[, c("region", "sex", "agegr", "placesize")]
s1 <- syn(ods, method = c("sample", "cart", "satcat", "cart"))
#>
#> Synthesis
#> -----------
#> region sex agegr placesize
if (FALSE) {
### mostly fails because too many small categories
s2 <- syn(ods, method = c("sample", "cart", "cart", "satcat"))}