Crash Course - Annotation NA KdK
When users of the National Archive stumble upon the image of a line
of text (a line strip), either through browsing or by means of the
clustering algorithms of the Scratch search engine, they might be
willing to annotate that line, given a sufficient level of "Wiki-motivation".
Upon finishing the annotation of that line, at least three
goals are met:
- from this time on, other users may find that line through keyword-based
- the resulting new text annotation(s) can be used to retrain
the pattern-recognition and cluster algorithmes opnieuw, thereby
improving the accuracy of the results on future search actions;
- thirdly, the line-strip annotations are useful because they allow for
a performance evaluation of the pattern-recognition algorithms.
Thus, even if the annotations are not used for training the system, they
are very useful!
- The text is labeled in reading order, from upper left to lower right.
- The ink trace of the target text is blackened. The irrelevant fragments
on the margins of the line strip are displayed somewhat lighter and do not
need to be labeled.
- For the first line of a paragraph, the following elements may be
- [month] Jan,Feb,Maart,April,Mei,Juni,Juli,Aug,Sept,Octb,Novb,Decb
- [department] FD, MD, BD, MarD of BlD
- no (numero) ==> no
- The content of the line. Please seek advice if you need help with the Dutch spelling
|Jan 14 19 Rappt FD 12 Jan no 49 Op adres van het Be
- Type what is written, not what you think you read
- The two horizontal lines in front of "Besluit fiat" are coded as: _-_
- People did not use a hyphen at the end of the line: please do not add it in the annotation.
- Please only annotate (label) only the line on the scan number which
was allotted to you. This demonstrator has no facilities for record locking
or multiple annotators, as yet.
The interface is rather rudimentary and intended for usage by technical
people rather than by the end users. Should you get lost on this site,
please go to Monk.