MetaArchive Cooperative
From Martin Halbert
Answering the Digital Preservation Challenge: The MetaArchive Cooperative
The advent of digital technologies has presented society with a new challenge—how will we preserve the digital files that already comprise the bulk of the 21st century’s historical record, from government documents to scientific experiment results, from family videos to newscasts, and from email to blogs? How can we ensure that future generations will have access to the cultural record we are creating today in digital formats?
The MetaArchive Cooperative, led by Emory University’s Digital Programs and Systems division of the University Libraries, is helping to answer that question. With $1,125,000 in funds from the Library of Congress’s National Digital Information Infrastructure and Preservation Program (NDIIPP) and from its six partner institutions (Emory University, Auburn University, Florida State University, Georgia Tech, Virginia Tech, and the University of Louisville), the MetaArchive Cooperative will build a distributed digital preservation community to foster and promote the long-term survival of the digital assets of cultural memory organizations, including libraries, archives, and museums.
This project builds on the success of a previous project (2004-2007), also led by Emory University in partnership with these institutions, in which the MetaArchive Cooperative developed and implemented a technical network, the MetaArchive of Southern Digital Culture, that digitally preserves Southern-themed content from its six partner institutions.
“We are doing for digital materials what libraries and archives have done for paper collections for millennia,” said Dr. Martin Halbert, the project’s principal investigator and director of Digital Programs and Systems at Emory University. “The next generation will have little information about the early digital years unless we act now to preserve what we produce as a culture.”
Digital materials are terribly fragile, and their development is occurring at warp speeds. Websites, for example, exist for less than 100 days on average, and millions of new pages are created each day. Anyone with a TRS-80 remembers the days of recording files to tape, and anyone with a floppy disc knows that it takes no more than a few years for a file storage device to become outmoded and nearly impossible to use. Capturing the files stored on computers, the internet, and various types of storage devices in a timely manner—especially those produced by such key groups as governments, scholars, journalists, artists, and scientists—is essential if we want to have a record of our cultural output that is consistent with the records preserved in past centuries. The MetaArchive Cooperative began to capture, describe, and store such files for cultural posterity in 2004.
In its new project phase, which begins in September 2007, the MetaArchive Cooperative will formalize a sustainable business model for cooperative distributed digital preservation and will establish an outreach effort to U.S. cultural memory organizations that possess digital content. The Cooperative will also host a series of workshops that provide information and training for institutions and individuals seeking to build or join distributed digital preservation networks based on the LOCKSS software. “Our goal is to encourage the adoption of distributed digital preservation,” said Dr. Katherine Skinner, the project’s co-director and Digital Projects Librarian at Emory University. “In addition to welcoming new members into the Cooperative and our existing networks, we also want other cultural memory institutions to freely adopt our technical and administrative frameworks to form new networks of their own.”
The MetaArchive approach to digital preservation relies upon a distributed preservation network infrastructure that is based on the LOCKSS software developed at Stanford University. “Distributed” means that copies of digital materials are stored on servers in different geographical spaces (e.g., in different states or countries). Those servers are networked together so that they are constantly in contact with one another. If something happens to a file—perhaps because it degrades naturally or because the geographical region in which it is located suffers a catastrophe (e.g., Hurricane Katrina)—the network will check all other copies of the file. After determining that all other copies are intact and identical, the network can rebuild that degraded or lost file as needed to replace the lost material, thus ensuring its stability over time. The LOCKSS software is available freely as open source software.
The MetaArchive Cooperative The MetaArchive Cooperative is an independent, multi-state membership association whose purpose is to support, promote, and extend the MetaArchive approach to distributed digital preservation practices. The MetaArchive Cooperative currently hosts a preservation network, the MetaArchive of Southern Digital Culture. This preservation network houses a critical body of southern digital content, including such items as government documents, interviews with major American musicians, photographs of the Civil Rights Movement, video footage of agricultural practices, scholarly writings about the South, and maps of the U.S. South. The Cooperative is currently building a second network with an international scope and range that includes university members in England, Canada, New Zealand, and Brazil.
The project is part of the MetaScholar Initiative in Woodruff Library’s Digital Programs and Systems division—a group that has earned its national reputation as a leader in the fields of Digital Library research and Internet-based scholarly communication. In the past five years, the division has received more than $4 million in grant support for projects and programs that promote new ways of conducting research and scholarship in the digital age.


