| NEWS | R Documentation |
More efficient code was borrowed from the C50 package to write the data file to a string.
A bug related to column names found by David Clifford was fixed.
A bug was fixed that occurred when splits were determined in models with a categorical variable involving more than two categories (eg. "if X4 in a, b")
Also, a bug was found in the variable importances calculations when a variable was used only in the conditions and not the model.
The 'doc' directory was moved into the "vignettes" directory
cubist uses a string buffer to capture the output files that
would normally get saved to external files. The package was failing
R CMD check only on windows because the output files were getting
garbled. s*printf functions work differently on Windows. On
Linux, they return the size that is needed to correctly print the
string. On Windows, if there isn't enough space, they return -1.
Also, work-arounds were added to the C sources because of R CMD
check issues. printf(), assert() and exist()
statements were removed or redefined.
A bug was fixed for the coef method.
The GPL version was revised to be version 3 (thanks to Ross Quinlan for the note).
The variable usage statistics were saved to a data frame in
called object$usage. The package vignette and help files were
updated accordingly.
When using cubist, if x was a vector or a single
variable, an error occurred. This was fixed.
The vignette was also expanded to show how caret can be used to tune the model and a technical error was corrected about the boosting algorithm (thanks again to Ross Quinlan).