Results 1 to 3 of 3

Thread: Some National Website Archives

  1. #1
    Join Date
    Apr 2017
    Location
    Canberra
    Posts
    204
    Thanks
    16
    Thanked 32 Times in 29 Posts

    Default Some National Website Archives

    Website archiving
    The life-span of personal, non-commercial websites can be quite short, often only a handful of years. Good aviation and military history sites, with rich and unique content, may pass into oblivion when the site owner can no longer maintain them or in the event of host or other technical changes beyond the owner's control.

    Anyone who has gone to the effort of creating and maintaining a well-researched presentation of unique, sound content could assist future researchers by contacting their own national web archiving programme to propose their site for preservation. The International Internet Preservation Consortium offers a comprehensive summary of national programmes.
    This selective discussion, sourced from my 211 Squadron website, may perhaps be useful for reference.

    Australian national web archive

    PANDORA, part of the Australian Web Archive of the National Library of Australia and partners, is currently a selective archive of Australian websites. Webmasters and other interested parties might wish to consider the selection guidelines and make an enquiry directly.
    Preservation of my own website and future access are both assured, for the short term and the long term, with independent archive and back up sets:

    • On-line, PANDORA takes a copy of the whole 211 Squadron site once a year, on 6 August
    • The archived set of site copies, starting in May 2002, is easily found by clicking the PANDORA link on the Home page, through the National Library's Trove and on-line catalogue, directly from the PANDORA menu, or by the persistent URL for the complete archive set: http://nla.gov.au/nla.arc-24825.
    • While the PANDORA Persistent URL reference explains how to extract related URLs for any one set, or any one page, via the Citation Service, the Trove presentation provides a choice of citation styles directly.
    • Off-line, the site is backed up monthly with multiple copies on several devices, separately stored.


    Other web archives

    New Zealand
    The National Library of NZ runs their national web archiving programme. Content is searchable through the NZNL on-line catalogue. While the archive is selective, nominations are welcomed:
    http://natlib.govt.nz/collections/a-...nd-web-archive

    United Kingdom

    In the UK, the British Library’s UK Web Archive caters for personal sites, sites of private organisations, and the like. From April 2013, the UKWA began automatically gathering all sites under the .uk domain or published in the UK. The Archive particularly invites nominations from Webmasters (or interested parties) for UK sites with addresses that don’t use .uk in their name. Details: http://www.webarchive.org.uk/ukwa/info/nominate.
    Website search: https://www.webarchive.org.uk/en/ukwa/index

    UKWA archived website copies can be viewed only in the UK's deposit libraries
    List: https://www.webarchive.org.uk/en/ukwa/noresults
    UK Government sites are covered by The National Archives at http://www.nationalarchives.gov.uk/webarchive

    United States

    The Internet Archive offers a hands-off if imperfect alternative. Although erratically frequent and notionally global, copies may be quite incomplete (often lacking images and/or pages in whole or in part). Sites notionally protected by robots.txt files from certain crawler searches may not be copied, although not at all consistently.

    For over 20 years, any search for archived copies of a closed site was seriously hobbled by restricting search to URL address/es of the site while it did exist. By May 2018, however, at long last the search field allowed both keyword and URL search. At about this time, a number of earlier site instances simply disappeared, while for more recent instances, robots.txt protections were to all appearances wilfully ignored. The archive still does not respond well to Google search.

    Fly-by “archives”

    In recent years, private instances of so-called “archives” have begun to arise. Consisting of selective copies of other’s sites or site pages, these unauthorised content duplicators operate without the knowledge, permission or consent of original site owners and are unresponsive to enquiry or complaint.

    A deliberate large-scale breach of copyright, these unauthorised copies needlessly consume site band-width, to duplicate selected content regardless of any true, authorised, complete archival instances. In the long term, the result will be multiple incomplete copies and effectively random links to outdated content.

    Source: My current 211 Squadron RAF website Enquiries page.
    Aug 2020 archived copy: http://pandora.nla.gov.au/pan/24825/...enquiries.html
    Last edited by Don Clark; 9th October 2020 at 12:33.
    Toujours propos

  2. #2
    Join Date
    Nov 2007
    Location
    Prague, Czech Republic
    Posts
    4,058
    Thanks
    43
    Thanked 53 Times in 52 Posts

    Default Re: Some National Website Archives

    Just would like to add there is also Czech official one run by our National Library.
    The official site has also an English option: https://www.webarchiv.cz/en/

    I am pleased my site is also archived: https://www.webarchiv.cz/en/search/c...lovak%20airmen

    Pavel
    Czechoslovak Airmen in the RAF 1940-1945
    http://cz-raf.webnode.cz

  3. #3
    Join Date
    Apr 2017
    Location
    Canberra
    Posts
    204
    Thanks
    16
    Thanked 32 Times in 29 Posts

    Default Re: Some National Website Archives

    Thanks for that, Pavel.
    Be interested to see other's instances.

    A curiosity for you, Pavel. Your site's archived copy will load happily in MS Edge (a poor browser not usually worth bothering with) but will not load in current Firefox
    There it fails with a Secure Connection Failed moan about TLS versions, which is at least technically intelligible (if you see what I mean).
    Apparently current Firefox dislikes the Czech National Library site set-up!
    Last edited by Don Clark; 9th October 2020 at 21:16.
    Toujours propos

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •