Synthesis from a saturated model based on all combinations of the predictor variables.
syn.satcat.RdSynthesises one variable (y) from all possible
combinations of its precitors (x). A bootstrap sample is created
from the original values of y within each unique combinations of
of xp (the syntheisied values of the grouping variable).
Arguments
- y
an original data vector of length
nfor the satcat variable.- x
a matrix (
nxp) with the original predictor variables fory.- xp
a matrix (
kxp) with synthetic values ofx.- proper
if
proper = TRUExandyare replaced with a bootstrap sample before synthesis, thus effectively sampling from the posterior distribution of the model, given the data.- ...
additional parameters.
Details
It is intended that the variables in x are categorical (factor)
variables. If y is also a categorical variable syn.satcat will
give the same results as fitting a saturated polychotomous regression model but
will usually be much faster. syn.satcat will fail with an error message
if previous syntheses have generated a combination of variables in xp
that was not present in x. Use of the syn.catall method for
grouped variables can overcome this.
Value
A list with two components:
- res
a data frame of dimension
k x pcontaining the synthesised data.- fit
the cross-tabulation of the original predictor variables.
Examples
ods <- SD2011[, c("region", "sex", "agegr", "placesize")]
s1 <- syn(ods, method = c("sample", "cart", "satcat", "cart"))
#>
#> Synthesis
#> -----------
#> region sex agegr placesize
if (FALSE) {
### mostly fails because too many small categories
s2 <- syn(ods, method = c("sample", "cart", "cart", "satcat"))}