|

- Source archives around the world take original paper documents and
digitise them, turning them into digital images of
the original documents.
- Muninn WWI collects very large numbers of these images, shipping them
to the SHARCNET computer array
in Waterloo, Ontario.
- The Sharcnet computer turns the images into electronic files through
a process of data extraction. This digital information
then get organised into a database (DB).
- The database computers recognise patterns in the data which get sent
back to the Sharcnet computer to improve the accuracy of the data extraction
process.
- At the same time, our team of advisers look at the databases and make
suggestions which improve both the organisation of information
in the databases and the data extraction process in Sharcnet.
- When the data in the databases is good enough, it gets shipped to
Muninn's research projects, as well as to academic researchers all around
the world.
- Meanwhile, the techniques we develop while extracting and organising
our data are made available to other researchers interested in the same
kinds of problems.
|