Mercurial > hg > sfx-subgrouping
changeset 5:7848d183c7ab
fixing comments
author | DaveM |
---|---|
date | Thu, 09 Feb 2017 18:14:44 +0000 |
parents | 7ec9bd8df111 |
children | 54446ca7e6cb |
files | phase2/rfFeatureSelection.m |
diffstat | 1 files changed, 17 insertions(+), 9 deletions(-) [+] |
line wrap: on
line diff
--- a/phase2/rfFeatureSelection.m Thu Feb 09 18:11:51 2017 +0000 +++ b/phase2/rfFeatureSelection.m Thu Feb 09 18:14:44 2017 +0000 @@ -7,16 +7,23 @@ % labels is the set of labels for the data % numFeatures is the dimension of the output vector (default 5) % iterMethod is the method for which the features are cut down -% 'onePass' will simply select the top (numFeatures) features and -% report them -% 'cutX' will iteratively cut the bottom X percent of features out, -% and perform random forest feature selection on the new set, until -% the desired number of features has been returned -% 'oobErr' will do something with the out-of-bag error, and return +% * 'onePass' will simply select the top (numFeatures) features and +% report them +% * 'cutX' will iteratively cut the bottom X percent of +% features out, and perform random forest feature selection on the +% new set, until the desired number of features has been returned +% * 'oobErr' will do something with the out-of-bag error, and return % that in some way, but this has not been implemented yet. -% 'featureDeltaErr' will do something with the feature importance -% prediction error, and return that in some way, but this has not -% been implemented yet. +% * 'featureDeltaErr' will do something with the feature importance +% prediction error, and return that in some way, but this has not +% been implemented yet. The OOBPermutedVarDeltaError property is a +% numeric array of size 1-by-Nvars containing a measure of importance +% for each predictor variable (feature). For any variable, the +% measure is the increase in prediction error if the values of that +% variable are permuted across the out-of-bag observations. This +% measure is computed for every tree, then averaged over the entire +% ensemble and divided by the standard deviation over the entire +% ensemble. if(length(labels) ~= size(data,1)) error('labels and data do not match up'); @@ -57,6 +64,7 @@ features = features(1:numFeatures); elseif(strcmp(iterMethod,'featureDeltaErr')) warning('This method has not been implemented yet, using onePass to return results') + % this will use variable FI features = features(1:numFeatures); end end \ No newline at end of file