Web archiving preserves web content for future generations and keeps it accessible to the public, even if it is not available on the original website anymore.
The websites of the EU institutions, agencies and bodies are archived 4 times per year. In general, this only concerns websites hosted on the europa.eu domain and subdomains, although there are exceptions (see below).
If you intend to take a website offline or change it substantially, the website can be archived on an ad hoc basis at the request of the website owner.
In principle, only requests to archive websites in the europa.eu domain or subdomains will be accepted. For websites or pages outside the europa.eu domain, the requester should duly justify that:
In principle, all static web content is archived. Embedded social media accounts and databases behind websites are currently not archived.
The archive is freely accessible online
Before revamping or taking offline (all or a part of) your website, you may want to archive it one last time. Prepare your website for archiving by removing all content and files that have no future (historical, legal, political, research, cultural) value. Remove also all content that is protected by intellectual property rights (e.g. copyright); that is confidential or private; or that is affected by data protection rules.
The following guidelines can help you to prepare your site for archiving:
Preparing sites for archiving.pdf
Please also take the following into account: users can navigate the archived sites like a live website. However, archiving with a crawler has some technical limitations and as a result certain features may not work, including the following:
Archiving workflow
a. Regular archiving of living websites
What | How | Who | When |
Archiving request | Provide required information using request form. | Website owner | Upon establishment of a new EC and/or DG website |
Analysis of request | OP WP team | ||
Approval/rejection of request | Email with justification of conclusions to website owner and CEM | OP WP team | |
For accepted requests | |||
Quarterly crawling | Remote crawling | OP WP team and contractor | 15 March, 15 June, 15 September and 15 December |
Quality control | Visual/manual check of quality of the crawl, and feedback to OP WP team | Website owner | Upon invitation, sent by OP WP team |
Patching | If needed and possible: improvement of quality of the archived version | OP WP team and contractor | Upon reception of WO feedback on quality |
Acceptance/rejection of crawl | Email to OP WP team | Website owner | |
Publication/takedown of crawl | OP WP team |
b. Ad hoc archiving of websites that are to be taken offline or changed substantially
What | How | Who | When |
Clean-up of website | Website owner | ||
Archiving request | Provide required information using request form. | Website owner | At least 3 months before the site will be taken offline/changed |
Analysis of request | OP WP team | ||
Approval/rejection of request | Email with justification of conclusions to website owner and CEM | OP WP team | Maximum 1 week after reception of CEM approval |
For accepted requests | |||
Planning | Discussions on deadlines and crawl specifications | Upon approval of the request | |
Crawling | Archiving following crawl specifications | OP WP team and contractor | According to planning agreed with website owner |
Quality control | Visual check of quality of the crawl, and feedback to OP WP team | Website owner | Upon invitation, sent by OP WP team |
Patching | If needed and possible: improvement of quality of the archived version | OP WP team and contractor | Upon reception of website owner feedback on quality |
Acceptance | Email to OP WP team | Website owner | |
Publication | OP WP team | ||
Redirections (if desired) | Following the guidelines provided by OP WP team | Website owner |
There are legitimate circumstances when it may be required to hide pages in the web archive from public view.
Anyone can submit a motivated takedown request. Please use this email link to initiate it: op-web-preservation@publications.europa.eu.
Takedown will only be considered in one of the following cases:
© European Union, 2019
The Publications Office carries out web archiving to preserve the websites of the European Union. Most of the archived content of the websites that are accessible in the EU web archive (EUWA), is under EU (or EU institutions, agencies or bodies) copyright. Ownership and copyright of websites in the EUWA remain the responsibility of the website owners.
Unless otherwise stated, the material obtained from the EUWA may be freely reproduced. This general principle can be subject to conditions, which may be specified in individual copyright notices. It does not apply to photographs, videos, pieces of music or other material subject to intellectual property rights of third parties (non-EU). In such cases, permission to use the material must be sought directly from the copyright holders. The Publications Office does not warrant that all third-party content is appropriately marked.
All logos and trademarks are excluded from the abovementioned permission.
Any queries regarding the above should be addressed to the following email OP-COPYRIGHT@publications.europa.eu
Need further assistance on this topic? Please contact either the team in charge of Europa Domain Management or the team responsible for archiving at the Publications Office. (EU Login required)