ServiceUpdated on 16 January 2026
Onedata for distributed data ecosystems supporting data & metadata management
IT System Engineer & Researcher @ Onedata.org (Cyfronet AGH) at Polish EOSC Node
Kraków, Poland
About
See it in action: watch the attached video or see http://hdl.handle.net/21.15131/pLrlicyt
**In a nutshell:** Onedata is an open-source software stack used to build a distributed data platform. It integrates the storage resources of data centres and remote repositories into a unified data namespace, enabling collaborative data sharing across organizations. It supports a wide range of backend storage systems and exposes mainstream data access interfaces, like POSIX, Python, REST, and S3, making it easy to integrate with various execution environments. Onedata proves its integration capabilities in EOSC Node | Poland and EOSC DataCommons, where it serves as a unified data layer, bridging heterogeneous data sources with platforms for analysis using a user-friendly logical data namespace.
----------------------
Broader description: Onedata (https://onedata.org) is a data management platform that provides easy and unified access to globally distributed storage resources, supporting a wide range of use cases from personal data management to data-intensive scientific computations. It is an open-source project, started in 2013, and implemented by the team from the Cyfronet Computing Center in Krakow, Poland.
Onedata creates a virtual file system layer spanning geographically dispersed computing centers and data providers that host heterogeneous storage resources. The virtual file system is POSIX-compatible and based on a classic structure of directories and files. The virtualized data can be accessed using multiple interfaces: Web GUI, REST API, CDMI API, fuse-based POSIX mount, Python libraries, or S3. Regardless of the interface, the user gets the same, unified view of all his data.
Onedata uses the concept of spaces for data organization. A space is a logical data volume that appears as a monolithic file system from the user’s PoV. Still, it virtualizes the physical data stored on distributed storage systems of different data providers. Spaces facilitate collaborative data sharing between users and groups across organizational domains — using the Onedata interfaces, users can manage and access the data together in a unified namespace, while it is physically distributed.
The Onedata software can be used to build different ecosystems. Each Onedata ecosystem constitutes an independent data management platform, made up of multiple data centers. One of the flagship examples is EGI DataHub, a Europe-wide ecosystem bringing together 17 data sites and catering for many scientific projects around Europe.
The system is designed to aid with the complete data lifecycle, offering built-in features such as:
- metadata management
- data discovery
- sharing and publishing, with built-in PID/DOI minting
- OAI-PMH interface for metadata harvesting
- integration of remote (external) datasets without copying (by reference)
- archiving / long-term preservation
- data transfers
- replication rules (QoS)
- data-centric automation workflows
More information: https://onedata.org/#/home/documentation
Applies to
- Integrating scientific data repositories
- Federated Compute & Storage
- Federated sync-and-shares
Similar opportunities
Product
Build a distributed data ecosystem with Onedata for seamless collaboration
- VRE
- Hosting
- Use Case
- Piloting
- Onboarding
- Co-development
- Federated sync-and-shares
- Federated Compute & Storage
- Scientific workflows and services
- Integrating scientific data repositories
- Service Catalogues, Interoperability, & Integration
Lukasz Opiola
IT System Engineer & Researcher @ Onedata.org (Cyfronet AGH) at Polish EOSC Node
Kraków, Poland
Partnership
Looking for partners tackling distributed data challenges
- Use case
- Piloting
- Onboarding
- Consulting
- Co-development
- Scientific workflows and services
Lukasz Opiola
IT System Engineer & Researcher @ Onedata.org (Cyfronet AGH) at Polish EOSC Node
Kraków, Poland
Service
- Federated Compute & Storage
- Scientific workflows and services
- Service Catalogues, Interoperability, & Integration
Enol Fernández
Principal Software Architect at EOSC Data Commons
Amsterdam, Netherlands