| Step | Description | IO | Notes - Examples
|
|---|
| 15 | Get ruleType on multiwords from Lexicon GetLetRtForTermsInLexicon.java- Assign invalid Lead-End-Unit ruleTypes on Lexicon multiwords:
- RT_INV_LEAD_TERM
- RT_INV_END_TERM
- RT_INV_END_ABB
- RT_INV_LEAD_END_TERM
| Inputs:- ./outData/3.InvalidLeadEndTerm/lexMultiwords.data
Outputs: - ./outData/3.InvalidLeadEndTerm/lexMultiwords.data.ruleType
- ./outData/3.InvalidLeadEndTerm/lexMultiwords.data.ruleType.ilet (10)
| - 1 min.
- Only 10 exceptions, all of them are RT_INV_END_ABB
=> Algorithm of endWord with abbreviation pattern can be improved
|
| 16 | Analyze ruleType on multiwords from Lexicon AnalyzeLetRtForTermsInLexicon.java- Analyze results from above step (10)
- Get the precision of exclusive fitler on Lexicon
| Inputs:- ./3.InvalidLeadEndTerm/lexMultiwords.data.ruleType
- ./3.InvalidLeadEndTerm/lexMultiwords.data.ruleType.exceptions
outputs: - ./3.InvalidLeadEndTerm/lexMultiwords.data.ruleType.rpt
| - 5 sec.
- precision: 99.9981%
- 1 invalid ruleType: RT_INV_END_ABB
|
| 17 | Get multiwords in Lexicon by lead/end unitsGetLexiconMultiwordsByLeadEndTerm.java- Find all multiwords in Lexicon by specifying lead/end word
| Inputs:- ./outData/3.InvalidLeadEndTerm/lexMultiwords.data.ruleType
Outputs:- ./outData/3.InvalidLeadEndTerm/LexiconMw/lexMultiwords.data.ruleType.${LEAD_END_WORD}
| - 5 sec.
- Used for case study
|