Results of Proposed Rules - 2021
I. Results11 non-duplicated SD-Rules are proposed to be added to the SD-Rule for evaluation. The results from the optimal set are described as follows:
| SD-Rule | Rank | Precision | Instances | Source | Decompose | Results |
|---|---|---|---|---|---|---|
| Good Rules | ||||||
able$|adj|eability$|noun | 13 | 100.00% | 42 | NOM_D | Root-Parent | Good SD-Rule |
ster$|verb|stration$|noun | 14 | 100.00% | 29 | NOM_D | Decompose-Child | Good SD-Rule |
lter$|verb|ltration$|noun | 16 | 100.00% | 15 | NOM_D | Decompose-Child | Good SD-Rule |
ability$|noun|eable$|adj | 34 | 97.92% | 48 | NOM_D | Root-Parent | Good SD-Rule |
d$|verb|sion$|noun | 56 | 93.18% | 44 | NOM_D | Root-Parent | Good SD-Rule |
narity$|noun|nary$|adj | 68 | 90.48% | 21 | NOM_D | Decompose-Child | Good SD-Rule |
e$|verb|ition$|noun | 71 | 89.58% | 48 | NOM_D | Root-Parent | Good SD-Rule |
ge$|verb|gence$|noun | 74 | 88.89% | 18 | NOM_D | Decompose-Child | Good SD-Rule |
$|noun|cide$|noun | 78 | 86.67% | 15 | EXP_SUG | Root-Parent | Good SD-Rule |
t$|verb|tted$|adj | 91 | 77.78% | 9 | EXP_SUG | Root-Parent | Good SD-Rule |
$|verb|ed$|adj | 101 | 70.10% | 311 | EXP_SUG | Root-Parent | Good SD-Rule |
ctic$|adj|xis$|noun | 104 | 65.85% | 27 | ORG_FACT | Root-Parent | Good SD-Rule |
| Bad Rules | ||||||
e$|noun|ous$|adj | 110 | 57.22% | 187 | ORG_FACT | Root-Parent | Bad SD-Rule |
ed$|adj|ment$|noun | 115 | 51.76% | 85 | NOM_D | Root-Parent | Bad SD-Rule |
$|adj|y$|noun | 127 | 42.07% | 145 | NOM_D | Root-Parent | Bad SD-Rule |
er$|noun|y$|noun | 128 | 39.26% | 163 | ORG_FACT | Root-Parent | Bad SD-Rule |
c$|adj|sm$|noun | 134 | 20.63% | 504 | NOM_D | Root-Parent | Bad SD-Rule |
er$|noun|ing$|noun | 145 | 0.00% | 534 | ORG_FACT | Root-Parent | Bad SD-Rule |
| Proposed parent rule | Child Rule used |
|---|---|
er$|verb|ration$|noun | ster$|verb|stration$|nounlter$|verb|ltration$|noun
|
ity$|noun|y$|adj | narity$|noun|nary$|adj
|
$|verb|nce$|noun | ge$|verb|gence$|noun
|
II. Further Observation on NOM_D
The top SD-Rules generated from NOM_D are added and evaluated (${SUFFIX_D}/data/${YEAR}/dataR/SdRulesFromSdPairs/nomD/sdRulesFromSdPairs.rpt.${YEAR}).
| ID | SD-Rule | Rank | Notes |
|---|---|---|---|
| Added in 2015: Freq. > 200, Coverage > 1.00% , Accum. Coverage > 80.0% | |||
| 1 | $|adj|ness$|noun | 1 | Good |
| 2 | bility$|noun|ble$|adj | 2 | Good (ility$|noun|le$|adj) |
| 3 | se$|verb|zation$|noun | 3 | Good |
| 4 | sation$|noun|ze$|verb | 4 | Good |
| 5 | iness$|noun|y$|adj | 16 | Good |
| 6 | ation$|noun|e$|verb | 21 | Good |
| 7 | nce$|noun|nt$|adj | 25 | Good (ce$|noun|t$|adj) |
| 8 | e$|verb|ion$|noun | 26 | Good |
| 9 | cy$|noun|t$|adj | 27 | Good |
| 10 | $|verb|ment$|noun | 28 | Good |
| 11 | ication$|noun|y$|verb | 29 | Good |
| 12 | ed$|adj|ion$|noun | 30 | Good |
| 13 | $|adj|ity$|noun | 32 | Good |
| 14 | e$|adj|ity$|noun | 35 | Good |
| 15 | $|verb|ion$|noun | 49 | Good |
| 16 | $|verb|ing$|noun | 53 | Good |
| 17 | $|verb|ation$|noun | 61 | Good |
| Added in 2016: Freq. > 100, coverage > 0.40% , Accum. Coverage > 83.36%) | |||
| 18 | e$|verb|is$|noun | 43 | Good |
| 19 | ation$|noun|ed$|adj | 50 | Good |
| 20 | e$|verb|ing$|noun | 60 | Good |
| 21 | $|adj|ism$|noun | 62 | Good |
| 22 | e$|adj|ion$|noun | 100 | Bad |
| Added in 2017: Freq. > 70, Coverage > 0.30% , Accum. Coverage > 85.00%) | |||
| 23 | sation$|noun|zed$|adj | 7 | Good |
| 24 | sed$|adj|zation$|noun | 8 | Good |
| 25 | sity$|noun|us$|adj | 65 | Good (osity$|noun|ous$|adj) |
| 26 | e$|verb|tion$|noun | 63 | Good |
| 27 | ous$|adj|y$|noun | 116 | Bad (exit in 2013) |
| Added in 2020: Freq. > 50, Coverage > 0.20% , Accum. Coverage > 87.41%) | |||
| 28 | ability$|noun|ible$|adj | 10 | Good |
| 29 | sable$|adj|zability$|noun | 12 | Good |
| 30 | sability$|noun|zable$|adj | 13 | Good |
| 31 | sis$|noun|ze$|verb | 41 | Good |
| 32 | al$|noun|e$|verb | 92 | Good |
| Added in 2021: Freq. > 40, Coverage > 0.17% , Accum. Coverage > 89.27%) | |||
| 33 | ability$|noun|eable$|adj | 34 | Good |
| 34 | c$|adj|sm$|noun | 134 | Bad |
| 35 | er$|verb|ration$|noun | 15,29 | Good |
| 36 | $|verb|nce$|noun | 74 | Good |
| 37 | ed$|adj|ment$|noun | 85 | Bad |
| 38 | ity$|noun|y$|adj | 68 | Good |
| 39 | $|adj|y$|noun | 145 | Bad |
| 40 | able$|adj|eability$ | 13 | Good |
| 41 | e$|verb|ition$|noun | 71 | Good |
| 42 | d$|verb|sion$|noun | 56 | Good |
The results shows 88.09% (37/42) are good SD-Rules, more SD-Rules from nomD should be added and evaluated in the future releases.
III. Further Observation on ORG_FACT
The top SD-Rules generated from ORG_FACT are added and evaluated (${SUFFIX_D}/data/${YEAR}/dataR/SdRulesFromSdPairs/orgFacts/sdRulesFromSdPairs.rpt.${YEAR}).
| ID | SD-Rule | Rank | Notes | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Added in 2015: Freq. > 40, Coverage > 1.00% , Accum. Coverage > 11.50% | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 1 | $|noun|less$|adj | 17 | Good | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 2 | $|adj|ally$|adv | 23 | Good | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 3 | ist$|noun|y$|noun | 45 | Good | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 4 | $|verb|ion$|noun | 49 | Good, also in NOM_D | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 5 | c$|adj|s$|noun | 57 | Good (ic$|adj|is$|noun) | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 6 | $|noun|ful$|adj | 64 | Good | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Added in 2016: Freq. >= 35; Accu. coverage: > 16.00% Ind Coverage: > 0.80% | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 7 | sia$|noun|tic$|adj | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
The results shows 60.87% (14/23) are good SD-Rules, more SD-Rules from orgD should be added in the future releases.
V. Future Work
Evaluated more SD-Rules from NOM_D and ORG_FACT down the list.
ORG_FACT is closed to the limit, maybe review 1 more year until there is no good rules canbe found.