einfra logoDocumentation

Overview

Onedata logo

About e-INFRA CZ Onedata

What is Onedata?

Onedata is a global data management platform that lets you store, share, and access research data across distributed storage systems — regardless of where the physical storage is located. It presents data from multiple storage providers as a single, unified file system, making it easy to work with large datasets across institutions and borders.

In simple terms: Think of Onedata as a smart cloud storage system designed specifically for research — one that connects storage from multiple locations and lets you access it all from one place.

When to Use Onedata & Its Advantages

When is Onedata a Good Fit?

Onedata is well-suited for:

  • Storing and sharing large research datasets that exceed the capacity of typical cloud storage (e.g., Google Drive, Dropbox)
  • Collaborating with colleagues across different institutions or countries
  • Archiving research outputs that need to be accessible long-term
  • Processing data on HPC (High-Performance Computing) clusters — Onedata can be mounted directly as a file system
  • Publishing datasets so others can access or cite your research data

Key Advantages

FeatureBenefit
Unified accessAccess all your data from one place, no matter where it’s physically stored
High capacityDesigned for large-scale scientific datasets
Flexible sharingShare data with specific users, groups, or make it publicly accessible
Data locality transparencySee which storage provider holds your data
Multiple access methodsWeb portal, desktop sync, command-line, REST API, FUSE mount
Security & access controlFine-grained permissions for files and folders
Integration with HPCCan be used directly with computing resources

Not sure whether to use Onedata or S3? Here is a quick comparison.

Differences between the two:

OnedataS3 Storage
PurposeData management platform across a storage federationRaw data storage
Storage typeMultiple backends with a unifying layer over themObject storage
Access methodsPOSIX FS, S3 API, REST API, CDMI APIS3 API only
Data locationSame data can live in multiple locationsSingle data storage cluster
Other featuresPermissions, groups, roles, replication, inter-site transferTenant/bucket/object model, reliability

When to use:

Onedata
Unified interface across multiple storages.
Multiple access methods.
Integration of already existing storage.
Distributed / shared datasets, operations on data from different locations.
S3 Storage
Backup, archiving.
Fast access, big data.
No need for automatic migration, replication, sharing.
S3 API access is sufficient.

Official documentation at Onedata docs.

Last updated on

publicity banner

On this page

einfra banner