CohMetrix Model 1c Variable Importance - shmercer/writeAlizer GitHub Wiki
Ensemble Weightings and Metric Importance
Coh-Metrix Model 1c
This model used Coh-Metrix scores from 7 min narrative writing samples ("I once had a magic pencil and ...") from 124 students in the spring of Grades 2-5 (Mercer et al., 2019) to predict holistic writing quality on the samples (elo ratings calculated from paired comparisons).
Highly correlated Coh-Metrix metrics (r > |.90|) were excluded during pre-processing (see section on Scoring Model Development for more details).
Mercer, S. H., Keller-Margulis, M. A., Faith, E. L., Reid, E. K., & Ochs, S. (2019). The potential for automated text evaluation to improve the technical adequacy of written expression curriculum-based measurement. Learning Disability Quarterly, 42, 117-128. https://doi.org/10.1177/0731948718803296
Algorithm Weightings in Ensemble
Abbreviations:
- all = ensemble model
- gbm = stochastic gradient boosted trees
- pls = partial least squares regression
- svm = support vector machines
- enet = elastic net regression
- rf = random forest regression
- mars = bagged multivariate adaptive regression splines
- cube = cubist regression
The table below presents the linear weightings of each algorithm for the ensemble model.
Intercept | gbm | pls | svm | enet | rf | mars | cube |
---|---|---|---|---|---|---|---|
-4.8423 | 0.5169 | 0.1348 | 0.6009 | -0.2375 | -0.4134 | 0.4001 | -0.0098 |
Metric Importance in Each Algorithm and Ensemble
Each column sums to 100 (so values can be interpreted as % contribution to the model).
Detailed information on Coh-Metrix abbreviations and indices is available here.
Metric | all | gbm | pls | svm | enet | rf | mars | cube |
---|---|---|---|---|---|---|---|---|
DESWC | 20.66 | 36.45 | 5.76 | 2.78 | 21.32 | 6.39 | 47.19 | 16.14 |
WRDVERB | 5.51 | 3.32 | 1.88 | 1.07 | 0.58 | 2.06 | 22.99 | 2.79 |
WRDHYPn | 4.27 | 2.98 | 2.38 | 1.13 | 3.13 | 1.75 | 14.74 | 1.28 |
DESSLd | 3.23 | 1.17 | 0.87 | 1.74 | 3.05 | 2.13 | 10.33 | 0 |
PCNARp | 2.65 | 2.58 | 2.72 | 1.77 | 10.29 | 2.17 | 0 | 4.99 |
DESPL | 2.51 | 2.16 | 3.29 | 1.6 | 2.76 | 2.2 | 4.27 | 2.56 |
DESWLltd | 1.8 | 4.36 | 1.75 | 1.09 | 1.39 | 1.51 | 0 | 6.97 |
WRDNOUN | 1.59 | 1.69 | 2.13 | 0.8 | 5.92 | 1.42 | 0 | 5.81 |
WRDFRQmc | 1.59 | 2.27 | 2.04 | 1.46 | 2.86 | 1.51 | 0 | 5.34 |
DESWLlt | 1.55 | 0.7 | 2.05 | 0.77 | 6.89 | 1.97 | 0 | 3.72 |
CRFANPa | 1.45 | 1.93 | 1.23 | 1.64 | 0.05 | 2.84 | 0 | 0 |
LSASS1d | 1.43 | 2.3 | 1.5 | 1.82 | 0.08 | 1.92 | 0 | 0 |
LDMTLD | 1.43 | 2.6 | 1.79 | 1.39 | 0 | 2.14 | 0 | 0 |
CRFCWOa | 1.4 | 2.06 | 1.9 | 1.74 | 0 | 2.09 | 0 | 0 |
WRDHYPv | 1.31 | 1.51 | 2.37 | 0.97 | 2.75 | 1.62 | 0 | 1.63 |
PCDCz | 1.31 | 2.21 | 1.05 | 1.47 | 0 | 2 | 0 | 1.97 |
WRDPRP3s | 1.28 | 1.96 | 1.36 | 0.66 | 3.27 | 1.43 | 0 | 0.81 |
SMCAUSwn | 1.26 | 1.16 | 2.26 | 1.16 | 3.45 | 1.13 | 0 | 2.9 |
SMCAUSvp | 1.25 | 2.1 | 0.97 | 1.55 | 0 | 1.8 | 0 | 0 |
SYNSTRUTa | 1.17 | 1.47 | 1.74 | 1.81 | 0 | 1.38 | 0 | 3.72 |
PCDCp | 1.16 | 0 | 1.81 | 1.28 | 3.29 | 2.05 | 0 | 4.53 |
LSAGN | 1.12 | 0.71 | 1.72 | 1.81 | 0 | 2.08 | 0 | 2.09 |
RDL2 | 1.04 | 0.86 | 2.17 | 0.93 | 2.07 | 1.45 | 0 | 1.28 |
SMCAUSlsa | 1.01 | 0.63 | 1.59 | 1.07 | 3.13 | 0.94 | 0 | 3.72 |
DRPP | 0.98 | 1.61 | 1.78 | 0.67 | 0.69 | 1.47 | 0 | 1.28 |
LSAGNd | 0.97 | 0.08 | 2.29 | 2 | 0 | 1.63 | 0 | 0 |
SYNMEDpos | 0.94 | 0.3 | 1.78 | 1.48 | 0 | 2.09 | 0 | 0.93 |
CNCTemp | 0.9 | 1.06 | 0.85 | 0.91 | 1.61 | 1.18 | 0 | 0 |
WRDADV | 0.89 | 1.56 | 1.9 | 0.67 | 0 | 1.4 | 0 | 0 |
CNCPos | 0.87 | 0.61 | 0.82 | 0.74 | 1.97 | 1.58 | 0 | 2.44 |
SMCAUSv | 0.86 | 1.16 | 0.91 | 1.27 | 0 | 1.22 | 0 | 0 |
PCTEMPp | 0.85 | 0.19 | 1.68 | 1.09 | 2.36 | 0.96 | 0 | 2.09 |
PCVERBz | 0.85 | 0.24 | 1.73 | 1.52 | 0 | 1.6 | 0 | 3.37 |
LDTTRc | 0.84 | 1.39 | 1.26 | 0.82 | 0 | 1.37 | 0 | 0 |
PCREFz | 0.78 | 0.25 | 1.2 | 0.56 | 2.5 | 1.41 | 0 | 0 |
RDFKGL | 0.77 | 0.36 | 2.06 | 0.91 | 0 | 1.86 | 0 | 0 |
LSASSp | 0.76 | 0.04 | 1.97 | 1.62 | 0 | 1.16 | 0 | 0.93 |
PCVERBp | 0.75 | 0.3 | 1.09 | 1.28 | 0 | 1.55 | 0 | 0 |
CRFCWO1d | 0.73 | 0.13 | 1.54 | 1.52 | 0 | 1.18 | 0 | 0 |
DRNP | 0.7 | 0.54 | 1.62 | 0.83 | 0 | 1.51 | 0 | 0 |
WRDPRO | 0.69 | 0.48 | 0.73 | 0.6 | 1.85 | 0.96 | 0 | 4.18 |
SMCAUSr | 0.69 | 0.9 | 0.05 | 0.72 | 0.63 | 1.33 | 0 | 0 |
WRDAOAc | 0.69 | 0.92 | 0.69 | 0.84 | 0.51 | 0.98 | 0 | 0 |
LDTTRa | 0.68 | 0.05 | 2.31 | 0.95 | 0 | 1.57 | 0 | 0 |
WRDCNCc | 0.67 | 0.53 | 1.3 | 0.38 | 3.63 | 0 | 0 | 1.28 |
PCSYNz | 0.62 | 0.24 | 1.76 | 0.77 | 0 | 1.45 | 0 | 1.28 |
CNCCaus | 0.61 | 0.72 | 0.73 | 0.73 | 0.12 | 1.1 | 0 | 0 |
WRDMEAc | 0.57 | 0.51 | 0.68 | 0.45 | 2.11 | 0.46 | 0 | 0 |
CNCTempx | 0.57 | 0.45 | 0.26 | 1.36 | 0.06 | 0.53 | 0 | 0 |
DESWLsy | 0.56 | 0.34 | 0.85 | 0.55 | 0.35 | 0.89 | 0.49 | 1.28 |
WRDPOLc | 0.56 | 0.48 | 0.89 | 0.63 | 0 | 1.31 | 0 | 0 |
SYNLE | 0.55 | 0.35 | 0.09 | 0.62 | 0 | 1.67 | 0 | 0 |
PCCNCz | 0.54 | 0.14 | 1.87 | 0.96 | 0 | 0.72 | 0 | 3.25 |
DRNEG | 0.54 | 0.05 | 0.73 | 0.81 | 1.44 | 0.74 | 0 | 0 |
WRDFRQa | 0.52 | 0.4 | 0.36 | 0.71 | 0 | 1.2 | 0 | 1.28 |
WRDFRQc | 0.52 | 0.97 | 0.3 | 0.33 | 0 | 1.09 | 0 | 1.28 |
DRVP | 0.5 | 0.11 | 0.79 | 0.77 | 0.61 | 0.88 | 0 | 0.81 |
SYNNP | 0.5 | 0.21 | 0.65 | 0.88 | 0 | 1.04 | 0 | 0 |
LSASSpd | 0.5 | 0 | 0 | 1.94 | 0 | 0 | 0 | 0 |
CNCLogic | 0.49 | 0.55 | 0.41 | 0.68 | 0 | 0.9 | 0 | 0 |
CNCADC | 0.49 | 0.38 | 1.61 | 0.46 | 0.24 | 0.93 | 0 | 0 |
SYNSTRUTt | 0.48 | 0 | 0 | 1.83 | 0 | 0 | 0 | 0 |
PCNARz | 0.46 | 0 | 0 | 1.77 | 0 | 0 | 0 | 0 |
PCCNCp | 0.46 | 0 | 1.15 | 0.87 | 0 | 0.93 | 0 | 0 |
WRDADJ | 0.45 | 0.29 | 1.12 | 0.72 | 0 | 0.76 | 0 | 0 |
CRFCWOad | 0.44 | 0 | 0 | 1.69 | 0 | 0 | 0 | 0 |
CRFAOa | 0.43 | 0 | 0 | 1.64 | 0 | 0 | 0 | 0 |
PCSYNp | 0.43 | 0 | 1.45 | 0.62 | 0 | 1.01 | 0 | 0 |
DESSC | 0.42 | 0 | 0 | 1.6 | 0 | 0 | 0 | 0 |
CRFCWO1 | 0.42 | 0 | 0 | 1.6 | 0 | 0 | 0 | 0 |
SYNMEDwrd | 0.42 | 0 | 0 | 1.61 | 0 | 0 | 0 | 0 |
CRFAO1 | 0.41 | 0 | 0 | 1.55 | 0 | 0 | 0 | 0 |
LSASS1 | 0.41 | 0 | 0 | 1.59 | 0 | 0 | 0 | 0 |
SYNMEDlem | 0.41 | 0 | 0 | 1.59 | 0 | 0 | 0 | 0 |
DRAP | 0.4 | 0.23 | 1.23 | 0.64 | 0 | 0.58 | 0 | 0 |
SMINTEp | 0.38 | 0.36 | 1.06 | 0.68 | 0 | 0.29 | 0 | 0.81 |
PCTEMPz | 0.38 | 0 | 0 | 1.45 | 0 | 0 | 0 | 0 |
WRDPRP3p | 0.38 | 0.08 | 0.44 | 0.01 | 1.83 | 0.83 | 0 | 0 |
SMTEMP | 0.37 | 0 | 0 | 1.44 | 0 | 0 | 0 | 0 |
WRDHYPnv | 0.36 | 0.3 | 0.02 | 0.78 | 0 | 0.5 | 0 | 0 |
CRFANP1 | 0.35 | 0 | 0 | 1.35 | 0 | 0 | 0 | 0 |
CRFNOa | 0.34 | 0 | 0 | 1.32 | 0 | 0 | 0 | 0 |
CRFSOa | 0.3 | 0 | 0 | 1.17 | 0 | 0 | 0 | 0 |
PCREFp | 0.28 | 0 | 0.32 | 0.32 | 0 | 0.97 | 0 | 0 |
DESWLsyd | 0.27 | 0.32 | 0.32 | 0.45 | 0 | 0.34 | 0 | 1.28 |
PCCONNp | 0.26 | 0.07 | 1.18 | 0.05 | 0 | 0.87 | 0 | 0 |
PCCONNz | 0.26 | 0.18 | 0.53 | 0.23 | 0 | 0.69 | 0 | 0 |
DESSL | 0.24 | 0 | 0 | 0.91 | 0 | 0 | 0 | 0 |
CRFNO1 | 0.22 | 0 | 0.98 | 0.06 | 0 | 0.81 | 0 | 0 |
RDFRE | 0.21 | 0 | 0 | 0.79 | 0 | 0 | 0 | 0 |
WRDFAMc | 0.21 | 0.29 | 0 | 0.07 | 1.19 | 0.02 | 0 | 0 |
SMINTEr | 0.21 | 0.07 | 0.37 | 0.32 | 0 | 0.5 | 0 | 0 |
CNCNeg | 0.15 | 0 | 0 | 0.58 | 0 | 0 | 0 | 0 |
CNCAll | 0.12 | 0 | 0 | 0.46 | 0 | 0 | 0 | 0 |
CNCAdd | 0.09 | 0 | 0 | 0.36 | 0 | 0 | 0 | 0 |
WRDIMGc | 0.09 | 0 | 0 | 0.36 | 0 | 0 | 0 | 0 |
CRFSO1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |