ReaderBench Model 3exp Variable Importance - shmercer/writeAlizer GitHub Wiki

Ensemble Weightings and Metric Importance

ReaderBench Model 3exp

This model used ReaderBench scores from 15 min expository writing samples from 200 students in Grades 2-5 to predict holistic writing quality on the samples (theta scores calculated from paired comparisons).

Highly correlated ReaderBench metrics (r > |.90|) were excluded during pre-processing (see section on Scoring Model Development for more details).

Algorithm Weightings in Ensemble

Abbreviations:

  • overall = ensemble model
  • pls = partial least squares regression
  • gbm = stochastic gradient boosted trees
  • svm = support vector machines
  • enet = elastic net regression
  • rf = random forest regression
  • mars = bagged multivariate adaptive regression splines
  • cube = cubist regression

The table below presents the linear weightings of each algorithm for the ensemble model.

Intercept rf mars gbm svm enet cube
-0.0156 0.0826 0.3112 0.0319 0.1360 0.3306 0.1259

Metric Importance in Each Algorithm and Ensemble

Each column sums to 100 (so values can be interpreted as % contribution to the model).

Metric overall rf mars gbm svm enet cube
Content.words 20.83 5.13 35.84 48.17 3.43 17.49 14.66
RB.AvgWdLen 4.5 1.29 10.64 1.06 0.42 1.6 4.32
RB.AvgDepsBl_compound 4.11 0.71 8.06 0.36 0.06 3.36 3.92
RB.AvgConnBl_order 3.63 0.6 6.17 0.15 0.21 4.43 1.81
RB.SenStdDevWd 3.56 1.02 5.2 0.29 1.18 4.38 2.41
RB.LangRhythmId 3.46 0.96 10.64 0.29 0.26 0 0.7
RB.TCorefChainDoc 3.34 1.93 0 4.43 2.48 7.01 3.51
RB.WdEnt 3.21 2.23 0 1.79 2.35 6.08 5.52
RB.AggPronSen_first_person 2.93 0.99 8.76 0.77 0.12 0 1.1
Sentences 2.9 1.37 0 2.38 1.47 6.99 2.01
RB.AvgSenAdjCoh_Path 2.68 1.05 0 1.28 1.09 5.93 3.92
RB.CAF 2.51 1.33 0 1.24 1.75 5.87 1.81
RB.AvgPronBl_third_person 2.49 0.76 7.52 0.04 0.79 0 0.2
RB.AvgBlScore 2.27 2.1 4.33 1.67 2.38 0 3.31
RB.AvgPronBl_second_person 2.03 1.17 0 0.64 0.95 4.73 2.01
RB.LangRhythmDiameter 1.92 0.73 2.84 0.28 0.15 2.24 1.91
RB.TActCorefChainWd 1.4 0.76 0 0.47 1.04 2.97 1.81
RB.TCorefChainBigSpan 1.31 0.44 0 1.17 1.63 2.75 1
RB.AvgUnqAdjectiveBl 1.06 1.01 0 0.23 1.7 1.95 0.9
RB.WdDiffWdStem 0.99 1.06 0 1.8 0.52 2.17 0.6
RB.AvgDepsSen_nmod 0.94 0.95 0 0.52 1.14 1.67 1.2
RB.AvgDepsBl_expl 0.89 0.79 0 0.42 0.42 1.86 1.2
RB.RdbltyDaleChall 0.86 1.25 0 0.91 0.78 1.01 2.41
RB.AvgAOEBl_InflectionPointPolynomial 0.77 0.72 0 0.27 0.7 1.85 0.1
RB.AvgConnBl_temporal_connectors 0.76 0.91 0 0.04 0.5 1.37 1.41
RB.AvgPronBl_indefinite 0.75 2.03 0 5.56 1.56 0 1.61
RB.SynDiv 0.71 0.61 0 0.28 1.04 1.39 0.5
RB.LxcDiv 0.69 1.51 0 1.57 2.14 0 1.91
RB.AvgAOASen_Bristol 0.66 0.56 0 0.14 0.31 1.56 0.5
RB.AvgDepsBl_root 0.65 0.09 0 0.04 0.04 1.95 0
RB.AvgDepsBl_nsubj 0.62 1.9 0 0.88 2.19 0 1.2
RB.AvgPronounBl 0.59 1.54 0 0.25 1.94 0 1.61
RB.AvgPrepositionBl 0.59 1.37 0 1.41 2.07 0 1.31
RB.AvgUnqNoundBl 0.49 0.83 0 0.41 1.02 0 2.21
RB.AvgDepsBl_parataxis 0.47 0.53 0 0.01 0.15 1.26 0
RB.LangRhythmCoeff 0.44 0.58 0 0.33 0.4 0.93 0.2
RB.AvgUnqPrepositionBl 0.43 0.94 0 0.2 2.05 0 0.6
RB.AvgAOASen_Bird 0.43 0.63 0 0.4 0.79 0.63 0.5
RB.WdSylCnt 0.42 0.96 0 0.54 0.18 0.5 1.1
RB.AvgDepsBl_nmod 0.42 1.01 0 0.52 1.59 0 0.9
RB.AvgChainSpan 0.41 1.04 0 0.4 1.58 0 0.8
RB.AvgDepsBl_nummod 0.41 0.7 0 0.01 0.21 1 0
RB.AvgDepsSen_expl 0.4 0.41 0 0.23 0.06 1.1 0
RB.AvgPronBl_first_person 0.39 0.71 0 0.51 0.5 0.25 1.41
RB.AvgUnqVerbBl 0.38 0.91 0 0.06 1.71 0 0.6
RB.AvgDepsBl_aux 0.37 0.59 0 0.19 0.93 0.4 0.5
RB.AvgAdverbBl 0.33 0.6 0 0.11 1.31 0 0.8
RB.AvgDepsBl_punct 0.33 1.26 0 0.3 1.17 0 0.5
RB.AvgNounSen 0.33 0.99 0 0.05 0.22 0 1.81
RB.LxcSoph 0.32 0.79 0 0.3 0.75 0 1.2
RB.CharEnt 0.31 0.49 0 1.05 1.09 0.13 0.4
RB.AvgDepsSen_cop 0.31 0.86 0 0.55 0.55 0 1.2
RB.AvgDepsBl_mark 0.31 1.04 0 0.56 1.59 0 0
RB.AvgSenBlCoh_LDA 0.3 0.82 0 0.16 1.15 0 0.6
RB.RdbltyFlesch 0.29 0.47 0 0.19 0.17 0 1.81
RB.AvgCorefChain 0.28 0.76 0 0.2 1.05 0 0.6
RB.AvgDepsBl_dobj 0.28 0.92 0 0.09 1.36 0 0.2
RB.AvgDepsBl_cop 0.27 0.59 0 0.07 0.97 0 0.7
RB.AvgDepsBl_det 0.27 0.92 0 0.09 1.36 0 0.1
RB.AvgDepsSen_mark 0.27 0.68 0 0.19 1.12 0 0.5
RB.AvgDepsBl_amod 0.26 0.58 0 0.27 1.23 0 0.3
RB.AvgDepsBl_mwe 0.25 0.8 0 0.09 0.61 0.3 0
RB.AvgUnqAdverbBl 0.25 0.6 0 0.03 1.39 0 0.1
RB.AvgPrepositionSen 0.24 0.44 0 0.16 0.91 0 0.6
RB.AvgConnBl_simple_subordinators 0.23 0.76 0 0.05 1.22 0 0
RB.AvgAOASen_Kuperman 0.23 0.53 0 0.51 0.39 0.2 0.4
RB.AvgDepsSen_compound 0.23 1.22 0 0.33 0.33 0 0.6
RB.AvgDepsBl_ccomp 0.22 0.51 0 0.05 0.54 0.2 0.3
RB.AvgUnqPronounBl 0.22 0.46 0 0 1.33 0 0
RB.FrqRhythmId 0.22 0.94 0 0.3 0.68 0.06 0.2
RB.AggPronSen_indefinite 0.22 0.76 0 0.37 0.93 0 0.2
RB.AvgDepsSen_dobj 0.21 0.98 0 0.1 0.49 0 0.5
RB.AggPronSen_second_person 0.2 0.81 0 0.23 0.64 0 0.3
RB.AvgAOADoc_Shock 0.2 0.98 0 0.42 0.82 0 0
RB.AvgConnSen_semi_coordinators 0.19 0.59 0 0.29 0 0.38 0.1
RB.AvgConnBl_addition 0.18 0.7 0 0.23 0.65 0 0.2
RB.AvgRhythmUnitStreesSyll 0.18 0.89 0 0.17 0.47 0 0.3
RB.AvgDepsSen_ccomp 0.18 0.31 0 0.22 0.94 0 0.2
RB.AvgAdverbSen 0.17 0.38 0 0.06 0.99 0 0
RB.AvgCommaSen 0.17 0.62 0 0.25 0.8 0 0
RB.AvgAOEDoc_IndexAboveThreshold.0.3. 0.17 0.72 0 0.12 0.36 0 0.5
RB.AvgConnBl_contrasts 0.17 0.46 0 0.08 0.82 0 0.2
RB.AvgConnSen_simple_subordinators 0.16 0.44 0 0.13 0.88 0 0
RB.AvgConnBl_reason_and_purpose 0.16 0.73 0 0.14 0.62 0 0.1
RB.AvgAOADoc_Bird 0.16 0.79 0 0.14 0.68 0 0
RB.AvgDepsSen_amod 0.16 0.29 0 0.25 0.5 0 0.5
RB.AvgConnBl_oppositions 0.16 0.65 0 0.05 0.6 0.02 0.2
RB.AvgAOABl_Kuperman 0.15 0.11 0 0.18 0.45 0 0.6
RB.AvgDepsSen_xcomp 0.15 0.63 0 0.06 0.73 0 0
RB.AvgPronounSen 0.14 0.62 0 0.03 0.26 0 0.4
RB.AvgDepsBl_advcl 0.14 0.21 0 0.02 0.89 0 0
RB.AvgInferenceDistChain 0.14 0.56 0 0.2 0.45 0 0.2
RB.AvgNounNmdEntBl 0.14 0.49 0 0.87 0.55 0 0
RB.AggPronSen_third_person 0.14 0.65 0 0.14 0.64 0 0
RB.WdLettStdDev 0.14 0.65 0 0.18 0.63 0 0
RB.AvgConnSen_addition 0.13 0.47 0 0.23 0.63 0 0
RB.AvgNmdEntSen 0.13 0.18 0 0.36 0.81 0 0
RB.WdDiffLemmaStem 0.12 0.71 0 0.26 0.29 0 0.1
RB.AvgDepsSen_aux 0.12 0.4 0 0.03 0.64 0 0
RB.AvgCommaBl 0.12 0.66 0 0.04 0.4 0 0.1
RB.AvgAOASen_Shock 0.12 0.28 0 0.05 0.73 0 0
RB.AvgDepsBl_acl 0.12 0.47 0 0.13 0.6 0 0
RB.AvgAOABl_Cortese 0.12 0.28 0 0.1 0.64 0 0.1
RB.AvgDepsSen_advcl 0.12 0.46 0 0.25 0.59 0 0
RB.AvgDepsBl_xcomp 0.12 0.23 0 0.09 0.78 0 0
RB.AvgConnSen_temporal_connectors 0.11 0.73 0 0.06 0.09 0.06 0.1
RB.AvgAOESen_InflectionPointPolynomial 0.11 0.28 0 0.11 0.52 0 0.1
RB.AvgDepsSen_dep 0.11 0.49 0 0.17 0.38 0 0.1
RB.AvgAOASen_Cortese 0.11 0.22 0 0.17 0.66 0 0
RB.AvgDepsSen_det 0.11 0.14 0 0.12 0.54 0 0.2
RB.AvgConnSen_reason_and_purpose 0.11 0.39 0 0.12 0.58 0 0
RB.AvgAOABl_Bristol 0.1 0.45 0 0.15 0.37 0 0.1
RB.AvgDepsBl_iobj 0.09 0.74 0 0.21 0.17 0 0
RB.AvgDepsSen_mwe 0.09 0.64 0 0.44 0.21 0 0
RB.AvgConnSen_order 0.08 0.69 0 0.67 0.01 0 0
RB.AvgConnSen_oppositions 0.08 0.63 0 0.11 0.09 0 0.1
RB.AvgConnBl_disjunctions 0.08 0.5 0 0 0.32 0 0
RB.AvgConnSen_contrasts 0.07 0.6 0 0.11 0.11 0 0
RB.AvgDepsBl_auxpass 0.07 0.56 0 0.01 0.17 0 0
RB.AvgDepsSen_neg 0.07 0.47 0 0.31 0 0 0.2
RB.AvgConnBl_conditions 0.07 0.49 0 0.11 0.23 0 0
RB.AvgDepsBl_neg 0.06 0.19 0 0.03 0.23 0 0.1
RB.AvgPronBl_interrogative 0.06 0.54 0 0.04 0.14 0 0
RB.SenAsson 0.05 0.25 0 0 0.23 0 0
RB.AvgConnSen_disjunctions 0.05 0.55 0 0.03 0.05 0 0
RB.AvgConnBl_semi_coordinators 0.04 0.16 0 0.1 0.16 0 0
RB.AvgDepsSen_nummod 0.04 0.46 0 0.03 0 0 0
RB.AvgDepsSen_acl 0.04 0.46 0 0.01 0.01 0 0
RB.AvgDepsBl_csubj 0.04 0.4 0 0.01 0.05 0 0
RB.AvgDepsBl_nsubjpass 0.04 0.25 0 0 0.16 0 0
RB.AvgDepsBl_appos 0.04 0.51 0 0 0.02 0 0
RB.AvgDepsBl_dep 0.02 0 0 0.08 0.15 0 0
RB.SenAllit 0.02 0.3 0 0 0 0 0