Computer-Aided Revision
A set of computer-aided program is developed to validate and revise the reconciled Brat annotation data. They are described follows:
${C_SPELL}/PostProcess
${C_SPELL}/PostProcess/bin
${C_SPELL}/PostProcess/data/Brat/NewTestRevised/
${C_SPELL}/PostProcess/data/Brat/NewTestRevised/brat
${C_SPELL}/PostProcess/bin/PostBratNewTest
2
1
2
| Tag | Check Items |
|---|---|
| ToSplit |
|
| ToSplitOnPunct |
|
| ToMerge |
|
| Misspelling |
|
| Informal |
|
| RealWord |
|
| OutOfVocabulary |
|
| WordExists |
|
| Punctuation |
|
| Garbage |
|
| Unknown |
|
From our experience, there are two types of errors that commonly seen in spelling annotation.
3
Check Brat Tags spans - the purpose of this check is to ensure generate gold standard correctly for the cases of contain, multi-tag and overlap for both non-word and real-word