Description: The DIVA project is investigating academic lineages through publicly available thesis data, in this cast the DiVA2 dataset from http://www.diva-portal.org. This was available as a MODS XML database export. The desired format for the project was in a standard relational database format.
Who: David Zeitlyn (School of Anthropology and Museum Ethnography) and Daniel Hook
My Work: Given a relational database schema, I undertook legacy data migration of existing thesis records for to a relational database format. MODS to TEI to CSV.
Commitment: 4 days