2.1 High-Level Requirement Identification

HIGH-LEVEL REQUIREMENT IDENTIFICATION

The basis of architecture definition is built on an analysis of requirements, as this provides the necessary context and scope for the system to be built, including its desired functionality, as well as its main quality attributes. For complex projects such as data spaces, requirement elicitation is an iterative process performed in stages, with a gradually increasing level of detail. The initial set of requirements identified typically corresponds with high-level functionality at the core of the system, without which it cannot achieve its purpose.

These, albeit high-level and often not fully precise requirements, tend to be Architecturally Significant Requirements (ASRs), defined as those that have a profound impact on the final design, for example, that data spaces should support data transfer and exchange between participant infrastructures. Once identified, top-level requirements are further broken down into more, for example, which interfaces and standards to follow, building a hierarchical requirement tree gradually through the design lifecycle, often iterating between architecture definition, implementation, and further requirement elicitation steps

In this documentation we are mostly concerned with top-level requirements. Two effective ways to identify these are stakeholder interviews as well as the analysis of the business goals of the system. Data spaces are a European policy initiative, with a clearly defined objective to increase the availability of data within the economic fabric of the European Union, supporting its competitiveness and digital sovereignty. This is fully reflected in policy documents such as the aforementioned Staff Working Document (SWD) on the Data Spaces as well as the DIGITAL work programme.

ARCHITECTURE DEFINITION

The first stage of architecture definition should effectively transition from high-level requirements known at the start of the project to an initial implementable design that satisfies key stakeholder concerns, in this case those from data spaces participants.

They should provide functionality for a wide range of stakeholders, at societal scale. These include not only policy-makers and public administrations, but also private sector organisations of all sizes, the scientific community, and the public at large. While being a brand new initiative, Data Spaces cannot be considered to be a fully greenfield project to be built from scratch. Indeed, many public and industry-led initiatives have started to build and pilot data sharing ecosystems, and there is an expectation that the know-how —and even the software building blocks— these projects produced could be reused for the European Data Spaces.

Since the EU policy documents and programmes are themselves based on extensive stakeholder consultation, the requirements extracted from these sources are already informed by the needs of data spaces users across all sectors of interest. We therefore scanned relevant policy documents in order to extract requirements of data spaces and we further classified these into functional and non-functional categories. While functional requirements define what data spaces must provide, i.e. they explain the concrete components of data spaces, non-functional requirements define how data spaces should work, i.e. their quality attributes.

For this reason, requirement elicitation is performed at two different levels, including within the data spaces themselves, as well as at the level of the data spaces.

POLICY PROVISIONS ON DATA SPACES

The table below is a collection of policy provisions on data spaces, based on the data spaces policies. Such provisions are translated into groups of functional and non-functional requirements. These could form the basis for more granular requirement elicitation, as well as for an analysis of the functionality provided by existing data sharing initiatives

Policy provision	Functional requirements	Non-functional requirements
A secure and privacy-preserving IT infrastructure to pool, access, process, use and share data	Data Transfer and Exchange Data Storage Data Processing and Analytics Data Processing and Analytics Data Pooling & Collaboration	Security Confidentiality
Data holders will have the possibility, in the data space, to grant access to or to share certain personal or non-personal data under their control	Identity, Authentication & Access Control Usage Control Policies	Confidentiality
… promote the development of tools to pool, access, use and share all types of data favouring the development of common open standards and findable, accessible, interoperable and reusable (FAIR) principles... data holders could use these tools to ease the uploading of data into data spaces, to give or revoke their authorisation to data and to change access rights and specify new conditions of how their data can be accessed and reused over time	Data Transfer and Exchange Identity, Authentication & Access Control Usage Control Policies	Interoperability
Data that is made available can be reused against compensation, including remuneration, or for free.	Transaction Metering and Billing
Participants […] use the common technical infrastructure and building blocks which will allow the data spaces to be built in an efficient and coordinated manner		Maintainability Variability
The common technical infrastructure will have to […] integrate the cybersecurity-by-design principle		Security

Policy provision	Functional requirements	Non-functional requirements
Participation of an open number of organisations/ individuals	Identity, Authentication & Access Control	Scalability
Common European data spaces could be developed on international standards, INSPIRE (for spatial data) and FAIR principles to favour interoperability, ...exploitation of data on EU computing infrastructures (e.g., cloud and HPC) and be interconnected and progressively made interoperable	Data Interoperability Features Data Processing & Analytics	Interoperability Performance
European rules and values, in particular personal data protection, consumer protection legislation and competition law, are fully respected	Compliance Monitoring & Auditing Data Protection	Auditability
Enhance the development of new data-driven products and services in the EU and thereby create the core tissue of an interconnected and competitive European data economy	Data Processing & Analytics
Data Spaces Middleware: Provide a full cloud stack with basic services that can also be operated at the edge, while foreseeing the subsequent integration of High-Performance Computing and far edge computing	Multi-tier Support, Federation, Orchestration	Portability Performance
Data Spaces Middleware: Provide a technical baseline to be used by all EU common data spaces to avoid duplication of effort and overlaps and to ensure a proper alignment of the various implementation approaches		Maintainability Variability
Data Spaces Middleware: Allow state-of-the art data management between cloud and edge, enabling seamless ultra-fast data workload balancing between them, and intelligent data porting between centralised and decentralised data infrastructures Ensure performance and quality of service in the execution of applications across multiple cloud and edge providers Provide a multi-cloud orchestration solution, with built-in identification and security management services	Data Transfer & Exchange Multi-tier Support, Federation, Orchestration Identity, Authentication & Access Control	Portability Performance Security
Data Spaces Middleware: Provide data mapping services, data anonymisation and masking services	Privacy-Preserving Mechanisms Data Interoperability Features	Confidentiality Interoperability
Data Spaces Middleware: Embed business intelligence services for multi-uses based on crosscutting, low power, and software-enabled services	Data Processing & Analytics
Data Spaces Middleware: Integrate an environmental tracking performance system to ensure services operate in a low power mode		Energy Efficiency
Data Spaces Middleware: Provide secure resource efficient data storage services	Data Storage
Data spaces middleware: provide an “High Performance Computing as a service” connector to enable High Performance Computing resources to be accessible to users of the Cloud Federation	Multi-tier Support, Federation, Orchestration	Performance Portability
Data Spaces Middleware: Ensure that Artificial Intelligence (AI) solutions […] can operate over the middleware platform Support sustainable and ultra-low latency digital twins’ business applications Allow the hosting of highly specialised tools for complex business activities simulation, forecasting and modelling	Data Processing & Analytics	Performance
Data Spaces Middleware: Provide secured communication, productivity and collaboration services Provide workflow management services Facilitate the integration with [cloud-to-edge] services and [their] marketplace	Data Pooling & Collaboration
A common European data space brings together relevant data infrastructures and governance frameworks in order to facilitate data pooling and sharing A clear and practical structure for access to and use of data in a fair, transparent, proportionate and/non-discriminatory manner and clear and trustworthy data governance mechanisms	Data Pooling & Collaboration Usage Control Policies Privacy-Preserving Mechanisms Data protection Data governance	Inclusivity Fairness Sustainability Trustworthiness Transparency

Suggested Section: Functional Requirements

Learn about what Functional Requirements define a data space or the relevant technical questions about data spaces covered in the Technical ‘How to’ Information sheets.

Disclaimer: The views expressed are purely those of the authors and may not in any circumstances be regarded as stating an official position of the European Commission.

Page tree

2.1 High-Level Requirement Identification

HIGH-LEVEL REQUIREMENT IDENTIFICATION

ARCHITECTURE DEFINITION

POLICY PROVISIONS ON DATA SPACES