According to the Web site for the NetarchiveSuite software, it was "developed by the two national deposit libraries in Denmark, The Royal Library and The State and University Library, and has been running in production, harvesting the Danish world wide web for three years. The Danish netarchive currently contains over 120 TB of data that are mirrored on two different geographical locations." It's open source software based on the Heritrix web crawler from the Internet Archive. You can read more information about it on the netarchive.dk project site. I first took note of it on my Ten Thousand Year Blog on July 20, 02004.
David Mattison is an archivist (retired from active duty), historian and digital culture observer from British Columbia, Canada. His Ten Thousand Year Blog was hosted by WordPress.com between October 02008 and August 7, 02010. The photograph in the header was taken on May 22, 02009 at the Kew Gardens Tube station following a visit to the National Archives, England.
No comments:
Post a Comment