ReaderBench Model 1c Variable Importance - shmercer/writeAlizer GitHub Wiki

Ensemble Weightings and Metric Importance

ReaderBench Model 1c

This model used Coh-Metrix scores from 7 min narrative writing samples (I once had a magic pencil and ...) from 124 students in the spring of Grades 2-5 (Mercer et al., 2019) to predict holistic writing quality on the samples (elo ratings calculated from paired comparisons).

Highly correlated ReaderBench metrics (r > |.90|) were excluded during pre-processing (see section on Scoring Model Development for more details).

Mercer, S. H., Keller-Margulis, M. A., Faith, E. L., Reid, E. K., & Ochs, S. (2019). The potential for automated text evaluation to improve the technical adequacy of written expression curriculum-based measurement. Learning Disability Quarterly, 42, 117-128. https://doi.org/10.1177/0731948718803296

Algorithm Weightings in Ensemble

Abbreviations:

  • all = ensemble model
  • gbm = stochastic gradient boosted trees
  • pls = partial least squares regression
  • svm = support vector machines
  • enet = elastic net regression
  • rf = random forest regression
  • mars = bagged multivariate adaptive regression splines
  • cube = cubist regression

The table below presents the linear weightings of each algorithm for the ensemble model.

Intercept gbm pls svm enet rf mars cube
-5.6692 0.1651 0.2625 0.1043 -0.0146 0.4555 0.1632 -0.0348

Metric Importance in Each Algorithm and Ensemble

Each column sums to 100 (so values can be interpreted as % contribution to the model).

ReaderBench metric names can be found here. Some metric names are abbreviated more to improve the display of the table.

Metric all gbm pls svm enet rf mars cube
WdEnt 7.76 15.63 2.27 1.53 2 3.41 25.87 9.83
AvgUnqVerbBl 5.89 24.09 2.26 1.36 2.3 3.86 0 12.86
AvgBlScore 2.6 7.12 1.53 1.13 0.58 2.67 0 4.69
AvgNounSen 2.57 0.61 0.6 0.39 1.06 1.04 14.54 2.12
LxcDiv 2.18 3.58 2.18 1.33 0.72 2.54 0 3.78
AvgDepsSen_compound 2.17 1.45 1.12 0.56 0.27 1.54 7.94 2.12
WdDiffLemmaStem 2.07 0.57 0.93 0.49 0.22 1 10.51 0
AvgDepsBl_dobj 2.06 0.09 1.63 0.74 0.92 1.13 9.09 0
AvgUnqPrepositionBl 1.96 1.98 2.25 1.27 1.28 2.47 0 5.6
AvgDepsBl_punct 1.85 2.48 1.93 0.93 1.96 2.21 0 6.2
AvgDepsBl_nsubj 1.84 3.1 1.97 1.26 0.88 2.06 0 1.36
RdbltyFlesch 1.75 0.27 0.22 0.16 1.12 0.23 12.02 0.15
AvgWdLen 1.74 3.11 1.54 0.76 0.21 2.1 0 3.03
AvgDepsBl_nmod 1.65 2.75 1.94 0.92 0.42 1.77 0 2.42
AvgUnqNoundBl 1.28 0.06 0.99 0.68 1.79 0.1 6.94 2.72
AvgPronounBl 1.28 0.4 1.85 1.14 0.98 1.75 0 1.21
AvgUnqPronounBl 1.23 0.36 1.69 0.74 0.27 0.86 3 0.76
WdSylCnt 1.22 0.89 1.39 0.58 1.52 1.62 0 5.45
AvgUnqAdjectiveBl 1.19 0.24 1.37 0.54 0.34 0.6 4.3 0.76
AvgDepsBl_ccomp 1.18 0.35 0.61 0.05 0.35 0.61 5.78 0.76
AvgChainSpan 1.17 0.51 1.66 0.99 0.49 1.6 0 0.91
LexChainMaxSp 1.14 0.38 1.67 0.75 1.99 1.62 0 0
WdDiffWdStem 1.11 0.76 1.38 0.78 0.06 1.6 0 0
AvgDepsBl_det 1.1 0.29 1.71 0.72 0.82 1.52 0 0.76
WdLettStdDev 1.02 1.03 1.66 0.85 1.56 1.04 0 0.3
FrqRhythmId 1 0.65 1.44 0.59 0.95 1.32 0 0.76
AvgAOABl_Shock 0.97 0.91 1.13 0.67 0.02 1.34 0 0
AvgSenBlCoh_LDA 0.97 1.01 1.13 0.82 1.53 1.24 0 0
AvgDepsBl_mark 0.96 0.04 1.52 0.65 1.46 1.4 0 0
AvgSenAdjCoh_word2vec 0.9 0.39 1.37 0.67 0.29 1.18 0 1.06
TCorefChainDoc 0.87 0.32 1.67 0.71 2.43 0.96 0 0
AvgDepsSen_punct 0.81 0.4 1 0.68 0.26 1.2 0 0
LangRhythmCoeff 0.8 0.47 0.95 0.46 0.2 1.23 0 0
AvgPronBl_first_person 0.78 0.58 1.31 0.44 0.15 0.86 0 1.66
RdbltyDaleChall 0.78 1.44 0.96 0.38 1.08 0.76 0 1.06
AvgConnBl_sentence_link 0.74 0.09 1.31 0.45 3.5 0.92 0 0.3
AvgDepsBl_compound 0.74 1.08 0.77 0.12 1.07 0.86 0 3.48
AvgConnSen_logical_conns 0.71 0.31 0.54 0.51 0.72 1.24 0 0.76
LexChainAvgSpan 0.7 0.67 1.02 0.62 0.5 0.81 0 0
AvgUnqAdverbBl 0.69 0.01 1.46 0.57 0.78 0.75 0 0.76
CharEnt 0.66 0.48 1.35 0.78 1.51 0.49 0 0.76
AvgDepsBl_amod 0.66 0.23 1.31 0.48 1.2 0.73 0 0
AvgRhythmUnitStreesSyll 0.65 0.39 0.43 0.21 0.86 1.22 0 0
AvgAdjectiveSen 0.64 0.7 0.68 0.45 0.03 0.78 0 2.72
AvgDepsBl_advcl 0.62 0.07 1.21 0.42 1.83 0.74 0 0
AvgConnBl_simp_subords 0.61 0.04 1.25 0.45 0.28 0.69 0 0.76
AvgConnBl_reas_purp 0.61 0.57 0.98 0.27 0.17 0.74 0 0
AvgCorefChain 0.6 0.54 1.17 0.48 1.94 0.52 0 0
AvgPronBl_indefinite 0.59 0.2 1.33 0.5 0.05 0.57 0 0
AvgDepsBl_xcomp 0.58 0.36 1.14 0.37 0.09 0.63 0 0
AvgBlVoiceCoOcc 0.57 0 1.44 0.56 0.17 0.51 0 0
AvgDepsBl_aux 0.56 0.17 1.12 0.32 1.5 0.64 0 0
AvgDepsBl_mwe 0.55 0 0.93 0.23 1.62 0.79 0 0
AvgPronBl_third_person 0.55 0.13 1.23 0.39 0.24 0.53 0 1.06
AggPronSen_third_person 0.55 0.37 0.47 0.21 2.96 0.87 0 0.76
AvgAOABl_Cortese 0.55 0.56 0.64 0.7 0.31 0.68 0 0
AvgDepsBl_neg 0.54 0.08 0.71 0.15 0.29 0.91 0 0
AvgConnSen_order 0.54 0.06 0.26 0.52 0.98 1.07 0 0
AvgDepsSen_amod 0.53 0.43 0.6 0.49 0.07 0.72 0 0.61
AvgInferenceDistChain 0.53 0.52 0.46 0.31 1.67 0.81 0 0
TCorefChainBigSpan 0.52 0.04 1.24 0.35 0.13 0.53 0 0
SenAsson 0.51 0.33 0.85 0.21 1.05 0.63 0 0.15
AvgConnBl_contrasts 0.5 0.06 0.85 0.21 0 0.72 0 0
AggPronSen_indefinite 0.49 0.31 0.15 0.39 0.83 0.93 0 0.76
AvgDepsSen_xcomp 0.48 0.76 0.22 0.37 0.33 0.72 0 0
AvgConnBl_temp_conns 0.47 0.2 1.04 0.26 0.13 0.46 0 0.61
AvgDepsBl_cop 0.46 0.25 0.92 0.23 0.13 0.5 0 0
AvgAOASen_Kuperman 0.46 0.19 0.45 0.37 1.7 0.71 0 0.61
AvgDepsSen_nmod 0.46 0.2 0.11 0.46 1.37 0.74 0 4.39
AvgDepsSen_dobj 0.45 0.17 0.44 0.42 0.55 0.74 0 0
AvgDepsBl_nummod 0.44 0.07 0.35 0.02 0.46 0.88 0 0
AvgAOEBl_IndexPolyFAT.3 0.44 0.19 0.42 0.4 1.29 0.71 0 0
AvgConnBl_order 0.42 0.05 0.62 0.11 0.53 0.67 0 0
LxcSoph 0.42 0.28 0.06 0.07 0.93 0.88 0 0.76
AvgDepsSen_neg 0.42 0.06 0.25 0.02 0.84 0.91 0 0
AvgConnSen_simp_subords 0.41 0.36 0.07 0.56 0.46 0.74 0 0
SenStdDevWd 0.4 0.08 0.88 0.66 0.4 0.32 0 0
AvgConnBl_oppositions 0.4 0.02 0.95 0.24 1.8 0.37 0 0
AvgDepsSen_ccomp 0.4 0.1 0.57 0.44 0.58 0.47 0 2.12
AvgAOABl_Kuperman 0.39 0.38 0.37 0.53 0.22 0.51 0 0
AvgRhythmUnits 0.39 0.28 0.16 0.48 0.22 0.68 0 0
AvgAOEBl_InvLinRegSlo 0.37 0.13 0.67 0.33 0.23 0.41 0 0.76
TActCorefChainWd 0.37 0.29 0.51 0.26 0.8 0.49 0 0
AvgPronounSen 0.36 0.32 0.6 0.37 0.49 0.33 0 0.76
AvgDepsSen_mwe 0.36 0.01 0.33 0.03 0.68 0.73 0 0
AvgAOASen_Bristol 0.35 0.6 0.24 0.13 0.88 0.42 0 1.36
LangRhythmDiameter 0.34 0.13 0.23 0.01 0.31 0.68 0 0
AvgCommaBl 0.34 0.07 0.66 0.12 0.49 0.43 0 0
AvgAOASen_Cortese 0.34 0.33 0.69 0.45 0.27 0.24 0 0
AvgDepsSen_mark 0.34 0.11 0.22 0.54 0.34 0.58 0 0
AvgDepsSen_acl 0.33 0.04 0.71 0.13 0.93 0.38 0 0
AvgConnBl_logical_conns 0.33 0.05 0.53 0.06 0.26 0.51 0 0
WdPathCntHypernymTree 0.32 0.52 0.45 0.13 0.56 0.32 0 0
AvgConnBl_semi_coords 0.32 0.14 0.78 0.16 0.43 0.27 0 0
AvgDepsSen_cop 0.31 0.2 0.47 0.47 0.17 0.34 0 0
AvgAOESen_InfPointPoly 0.3 0.34 0.31 0.16 1.12 0.4 0 0
AvgConnSen_semi_coords 0.3 0.06 0.01 0.34 0.68 0.64 0 0
AvgAOASen_Shock 0.3 0.28 0.21 0.49 0.06 0.4 0 0.61
AvgConnSen_oppositions 0.3 0.08 0.02 0 0.39 0.73 0 0
AvgDepsSen_advmod 0.29 0.32 0.18 0.47 0.28 0.42 0 0
AvgConnBl_addition 0.28 0.18 0.64 0.1 2.79 0.2 0 0
WdPolysemyCnt 0.28 0.33 0.11 0.6 0.09 0.39 0 0
AvgAOABl_Bristol 0.28 0.19 0.22 0.38 0.49 0.43 0 0
AvgNmdEntSen 0.27 0.26 0.54 0.39 0.36 0.19 0 0
AvgAOEBl_InfPointPoly 0.27 0.2 0.35 0.23 2.11 0.3 0 0.76
AvgConnSen_temp_conns 0.27 0.35 0.12 0 0.93 0.48 0 0
WdAvgDpthHypernymTree 0.27 0.43 0.4 0.2 0.15 0.25 0 0
AvgDepsSen_dep 0.27 0.06 0.38 0.28 2.35 0.34 0 0.3
AvgAOESen_InvLinRegSlo 0.26 0.27 0.52 0.25 0.02 0.19 0 0.3
AvgAOASen_Bird 0.26 0.53 0.04 0.16 0.75 0.39 0 0.15
AvgNmdEntBl 0.26 0.17 0.48 0.04 1.53 0.3 0 0
AvgConnSen_reas_purp 0.25 0.12 0.08 0.48 1.14 0.4 0 0
AvgDepsSen_det 0.25 0.11 0.1 0.25 0.26 0.44 0 0.76
AvgAOABl_Bird 0.23 0.7 0.21 0.33 0.44 0.13 0 0
AvgAOESen_IndexPolyFAT.3 0.23 0.11 0.28 0.31 0.64 0.31 0 0
AvgDepsBl_dep 0.22 0.1 0.44 0.05 1.31 0.22 0 0.61
AvgDepsBl_nsubjpass 0.21 0.02 0.84 0.2 0.07 0 0 0
AvgAOESen_IndexAbThr.0.3. 0.19 0.27 0.12 0.39 0.05 0.22 0 0
AvgDepsSen_aux 0.18 0.39 0 0.49 2.05 0.17 0 0
AvgUnqWdBl 0.15 0 0 1.66 0 0 0 0
AvgBlLen 0.15 0 0 1.68 0 0 0 0
AvgDepsBl_acl 0.14 0.01 0.42 0.04 0.2 0.1 0 0
AvgVerbBl 0.14 0 0 1.57 0 0 0 0
Content.words 0.14 0 0 1.57 0 0 0 0
AvgWdBl 0.14 0 0 1.57 0 0 0 0
AvgNounNmdEntBl 0.14 0.29 0.05 0 1.08 0.21 0 0
Words 0.13 0 0 1.47 0 0 0 0
AvgPrepositionBl 0.11 0 0 1.22 0 0 0 0
AvgDepsSen_advcl 0.1 0.14 0.05 0.52 0.09 0.05 0 0
AvgDepsBl_case 0.09 0 0 0.96 0 0 0 0
AvgIntraBlCoh_LDA 0.09 0 0 0.99 0 0 0 0
LangRhythmId 0.09 0.01 0.19 0 0.98 0.1 0 0
AvgIntraBlCoh_Path 0.08 0 0 0.84 0 0 0 0
AvgSenAdjCoh_LDA 0.08 0 0 0.87 0 0 0 0
AvgIntraBlCoh_LSA 0.08 0 0 0.91 0 0 0 0
AvgSenBlCoh_LSA 0.07 0 0 0.74 0 0 0 0
SenScoreStDev 0.07 0 0 0.75 0 0 0 0
AvgSenAdjCoh_Path 0.07 0 0 0.76 0 0 0 0
AvgSenBlCoh_word2vec 0.07 0 0 0.79 0 0 0 0
AvgSenBlCoh_Path 0.07 0 0 0.81 0 0 0 0
AvgNounBl 0.07 0 0 0.83 0 0 0 0
Sentences 0.07 0 0 0.83 0 0 0 0
AvgSenBl 0.07 0 0 0.83 0 0 0 0
AvgSenAdjCoh_LSA 0.07 0 0 0.84 0 0 0 0
AvgSenAdjCoh_WuPalmer 0.06 0 0 0.64 0 0 0 0
SenStDevUnqWd 0.06 0 0 0.66 0 0 0 0
AvgSenBlCoh_LeackChod 0.06 0 0 0.66 0 0 0 0
AvgSenAdjCoh_LeackChod 0.06 0 0 0.66 0 0 0 0
AvgAOADoc_Shock 0.06 0 0 0.67 0 0 0 0
AvgIntraBlCoh_WuPalmer 0.06 0 0 0.68 0 0 0 0
AvgSenBlCoh_WuPalmer 0.06 0 0 0.69 0 0 0 0
AvgIntraBlCoh_LeackChod 0.06 0 0 0.69 0 0 0 0
AvgIntraBlCoh_word2vec 0.06 0 0 0.7 0 0 0 0
AvgAOADoc_Cortese 0.06 0 0 0.7 0 0 0 0
AvgVoice 0.05 0 0 0.5 0 0 0 0
AvgConnSen_sentence_link 0.05 0 0 0.51 0 0 0 0
AvgVerbSen 0.05 0 0 0.51 0 0 0 0
AvgDepsSen_nsubj 0.05 0 0 0.53 0 0 0 0
AvgAOADoc_Kuperman 0.05 0 0 0.53 0 0 0 0
AvgConnSen_conjunctions 0.05 0 0 0.54 0 0 0 0
AvgConnSen_coord_connects 0.05 0 0 0.54 0 0 0 0
AvgAdjectiveBl 0.05 0 0 0.56 0 0 0 0
AvgDepsSen_case 0.05 0 0 0.56 0 0 0 0
AvgAOEDoc_IndexPolyFAT.3 0.04 0 0 0.4 0 0 0 0
AvgPrepositionSen 0.04 0 0 0.41 0 0 0 0
AvgAdverbSen 0.04 0 0 0.43 0 0 0 0
AvgSenSyll 0.04 0 0 0.45 0 0 0 0
AvgAOEDoc_IndexAbThr.0.3. 0.04 0 0 0.45 0 0 0 0
AvgAOEBl_IndexAbThr.0.3. 0.04 0 0 0.45 0 0 0 0
AvgConnSen_addition 0.04 0 0 0.45 0 0 0 0
AvgSemDep 0.04 0 0 0.45 0 0 0 0
AvgDepsSen_cc 0.04 0 0 0.45 0 0 0 0
AvgDepsBl_advmod 0.04 0 0 0.46 0 0 0 0
AvgWdSen 0.03 0 0 0.29 0 0 0 0
AvgSenStressedSyll 0.03 0 0 0.3 0 0 0 0
AvgConnSen_contrasts 0.03 0 0 0.31 0 0 0 0
AvgSenScore 0.03 0 0 0.31 0 0 0 0
AvgAOEDoc_InvLinRegSlo 0.03 0 0 0.33 0 0 0 0
AvgAOADoc_Bird 0.03 0 0 0.33 0 0 0 0
AvgConnSen_coord_conjs 0.03 0 0 0.34 0 0 0 0
AvgDepsSen_conj 0.03 0 0 0.35 0 0 0 0
SynSoph 0.03 0 0 0.38 0 0 0 0
AvgAOADoc_Bristol 0.03 0 0 0.38 0 0 0 0
AvgAdverbBl 0.03 0 0 0.38 0 0 0 0
AvgAOESen_InvAverage 0.02 0 0 0.17 0 0 0 0
AvgConnBl_coord_connects 0.02 0 0 0.2 0 0 0 0
AvgDepsBl_auxpass 0.02 0 0 0.22 0 0 0 0
AvgAOEDoc_InfPointPoly 0.02 0 0 0.23 0 0 0 0
WdMaxDpthHypernymTree 0.02 0 0 0.23 0 0 0 0
AvgAOEDoc_InvAverage 0.02 0 0 0.23 0 0 0 0
AvgAOEBl_InvAverage 0.02 0 0 0.23 0 0 0 0
RdbltyKincaid 0.02 0 0 0.24 0 0 0 0
AvgUnqWdSen 0.02 0 0 0.25 0 0 0 0
RdbltyFog 0.02 0 0 0.27 0 0 0 0
AvgRhythmUnitSyll 0.02 0 0 0.27 0 0 0 0
AvgSenLen 0.02 0 0 0.28 0 0 0 0
AvgDepsBl_conj 0.01 0 0 0.11 0 0 0 0
AvgConnBl_conjunctions 0.01 0 0 0.13 0 0 0 0
AvgDepsBl_cc 0.01 0 0 0.14 0 0 0 0
AvgConnBl_coord_conjs 0.01 0 0 0.16 0 0 0 0
AvgUnqNmdEntBl 0 0 0 0.05 0 0 0 0