본문으로 건너뛰기

"Filecoin" 태그로 연결된 1개 게시물개의 게시물이 있습니다.

모든 태그 보기

Decentralized Storage Services: Arweave, Pinata, and Comparative Analysis

· 약 79분
Dora Noda
Software Engineer

Decentralized storage networks aim to address issues of data impermanence, censorship, and centralization by distributing data across peer-to-peer networks. Traditional web content is surprisingly ephemeral – for example, studies indicate over 98% of the internet becomes inaccessible after 20 years, highlighting the need for resilient long-term storage. Vendors like Arweave and Pinata (built on IPFS) have emerged to offer permanent or distributed storage solutions, alongside others such as Filecoin, Storj, Sia, Ceramic, and the underlying IPFS protocol. This report analyzes these services in terms of: (1) technical architecture and capabilities, (2) pricing models, (3) developer experience, (4) user adoption, (5) ecosystem maturity, and (6) key use cases (e.g. NFT metadata hosting, dApp backends, archival data, content delivery). Comparative tables and examples are provided to illustrate differences. All sources are linked to official documentation or authoritative analyses.

1. Technical Capabilities and Architecture

Arweave: Arweave is a blockchain-like permanent storage network built on a novel Blockweave data structure. Unlike traditional blockchains that link blocks linearly, Arweave’s blockweave links each block to its immediate predecessor and a random earlier block, creating a web-like structure. This design (coupled with a Succinct Proof of Random Access (SPoRA) consensus) means miners must verify random old data to mine new blocks, incentivizing them to store as much of the archive as possible. The result is high redundancy – in fact, there are currently approximately 200 replicas of the entire Arweave dataset distributed globally. Data uploaded to Arweave becomes part of this “Permaweb” and is immutable and permanent. To improve performance and scalability, Arweave uses Bundling (combining many small files into one transaction) to handle large data throughput (e.g. an Arweave bundler once stored 47 GB of data in a single tx). A Wildfire mechanism ranks nodes by responsiveness to encourage fast data propagation across the network. Overall, Arweave acts as a decentralized hard drive – storing data permanently on-chain, with the expectation that storage costs will keep dropping so that miners can be paid forever from an upfront endowment.

IPFS and Pinata: The InterPlanetary File System (IPFS) provides a content-addressed, peer-to-peer file system for distributed data storage and sharing. Data on IPFS is identified by a content hash (CID) and retrieved via a global distributed hash table (DHT). By design, IPFS itself is file-sharing infrastructure – it does not guarantee persistence of data unless nodes explicitly continue to host (“pin”) the content. Services like Pinata build on IPFS by providing pinning and bandwidth: Pinata runs IPFS nodes that pin your data to keep it available, and offers a fast HTTP gateway with CDN integration for quick retrieval (often called “hot storage” for frequently accessed data). Technically, Pinata’s architecture is centralized cloud infrastructure backing the decentralized IPFS network – your files are distributed via IPFS (content-addressed, and retrievable by any IPFS peer), but Pinata ensures high availability by keeping copies on their servers and caching through dedicated gateways. Pinata also offers private IPFS networks (for isolated use), an IPFS-backed key-value data store, and other developer tools, all of which leverage IPFS under the hood. In summary, IPFS+Pinata provides a decentralized storage protocol (IPFS) with a managed service layer (Pinata) to handle reliability and performance.

Filecoin: Filecoin is often considered the incentive layer for IPFS. It’s a blockchain-powered decentralized storage network where storage providers (miners) rent out disk space in an open market. Filecoin uses a novel Proof-of-Replication (PoRep) to ensure a miner has saved unique copies of client data, and Proof-of-Spacetime (PoSt) to continuously verify that the data remains stored over time. These proofs, built on zero-knowledge proofs, are recorded on Filecoin’s blockchain, giving a cryptoeconomic guarantee that data is being stored as agreed. The Filecoin network is built on IPFS technology for content addressing and data transfer, but adds smart contracts (“storage deals”) enforced on-chain. In a storage deal, a user pays a miner in Filecoin (FIL) to store data for a specified duration. Miners put up collateral that can be slashed if they fail to prove storage, ensuring reliability. Filecoin does not automatically make data public; users typically combine it with IPFS or other retrieval networks for content delivery. It is scalable and flexible – large files can be split and stored with multiple miners, and clients can choose redundancy level by making deals with multiple providers for the same data to guard against node failure. This design favors bulk storage: miners optimize for large datasets, and retrieval speed might involve separate “retrieval miners” or use of IPFS caches. In essence, Filecoin is like a decentralized Amazon S3 + Glacier: a storage marketplace with verifiable durability and user-defined redundancy.

Storj: Storj is a distributed cloud object storage network that does not use a blockchain for consensus but instead coordinates storage through a decentralized network of nodes and a satellite metadata service. When a file is uploaded to Storj (via their service called Storj DCS – Decentralized Cloud Storage), it is first client-side encrypted and then erasure-coded into 80 pieces (by default) such that only a subset (e.g. 29 of 80 pieces) is needed to reconstruct the file. These encrypted pieces are distributed to diverse Storage Nodes all over the world (each node only holds random fragments, not useful data by itself). This gives Storj extremely high durability (claimed 11×9s durability – 99.999999999% data survival) and also parallelism in downloads – a user fetching a file can retrieve pieces from dozens of nodes simultaneously, often improving throughput. Storj uses a proof-of-retrievability concept (storage nodes periodically audit that they still have their pieces). The network operates on a zero-trust model with end-to-end encryption: only the file owner (who holds the decryption key) can read the data. The architecture has no central data center – instead it taps into existing excess disk capacity provided by node operators, which improves sustainability and global distribution (Storj notes this yields CDN-like performance and much lower carbon footprint). Coordination (file metadata, payment) is handled by “satellites” run by Storj Labs. In summary, Storj’s technical approach is encrypted, sharded, and distributed object storage, delivering high redundancy and download speeds comparable to or better than traditional CDNs, without a blockchain consensus but with cryptographic audits of storage.

Sia: Sia is another decentralized cloud storage platform, utilizing its own blockchain and cryptocurrency (Siacoin) to form storage contracts. Sia splits files into 30 encrypted shards using Reed–Solomon erasure coding, and requires any 10 of those shards to recover the file (providing built-in 3x redundancy). Those shards are stored on independent hosts across the network. Sia’s blockchain is Proof-of-Work and is used to enforce smart contracts between renters and hosts. In a Sia storage contract, the renter locks up Siacoin for a period and the host puts up collateral; the host must periodically submit storage proofs (similar in spirit to Filecoin’s proofs) that they are storing the data, or they lose their collateral. At contract end, hosts are paid from the escrowed funds (and a small portion goes to Siafund holders as a protocol fee). This mechanism ensures hosts have economic incentive and penalties to reliably store data. Sia’s design emphasizes privacy (all data is end-to-end encrypted; hosts cannot see user files) and censorship-resistance (no central server). Like Storj, Sia enables parallel downloads of file pieces from multiple hosts, improving speed and uptime. However, Sia does require users to renew contracts periodically (default contracts last 3 months) to maintain storage, meaning data is not “permanent” unless the user continually pays. Sia has also introduced a layer called Skynet (earlier) for web-centric use: Skynet provided content addressing (via “skylinks”) and web portals for easy retrieval of Sia-hosted content, effectively acting as a decentralized CDN for Sia files. In summary, Sia’s architecture is blockchain-secured cloud storage with strong redundancy and privacy, suitable for “hot” data (fast retrieval) in a decentralized manner.

Ceramic: Ceramic is a bit different – it is a decentralized network for mutable data streams rather than bulk file storage. It targets use-cases like dynamic JSON documents, user profiles, identities (DIDs), social content, etc. that need to be stored in a decentralized way but also updated frequently. Ceramic’s protocol uses cryptographically signed events (updates) that are anchored to a blockchain for ordering. In practice, data on Ceramic is stored as “streams” or smart documents – each piece of content lives in a stream that can be updated by its owner (with verifiable history of versions). Under the hood, Ceramic uses IPFS for content storage of each update, and an event log is maintained so that all nodes can agree on the latest state of a document. The consensus comes from anchoring stream updates onto an underlying blockchain (originally Ethereum) to get an immutable timestamp and ordering. There is no native token; nodes simply replicate data for the dapps using Ceramic. Technical features include DID (decentralized identity) integration for update authentication and global schemas (data models) to ensure interoperable formats. Ceramic is designed to be scalable (each stream’s state is maintained independently, so there’s no global “ledger” of all data, avoiding bottlenecks). In summary, Ceramic provides decentralized databases and mutable storage for Web3 applications – it’s complementary to the file storage networks, focusing on structured data and content management (whereas networks like Arweave/Filecoin/Storj focus on static file blobs).

Summary of Architectures: The table below compares key technical aspects of these systems:

ProjectArchitecture & MechanismData PersistenceRedundancyPerformance
ArweaveBlockchain “Blockweave”; Proof of Access (SPoRA) consensus. All data on-chain (permaweb).Permanent (one-time on-chain storage).Very high – essentially 200+ full replicas across network (miners store old blocks to mine new ones).Write: moderate (on-chain tx, bundling helps throughput); Read: via gateways (decentralized web, slightly slower than CDN).
IPFS (protocol)P2P content-addressed file system; DHT for locating content. No built-in consensus or payments.Ephemeral (content persists only if pinned on some node).Configurable – depends on how many nodes pin the data. (No default replication).Write: immediate add on local node; Read: potentially fast if content is nearby, otherwise needs DHT discovery (can be slow without a pinning service).
Pinata (service)Managed IPFS pinning cluster + HTTP gateways. Centralized cloud ensures files stay online, built on IPFS protocol.As long as Pinata (or user’s nodes) pins the data (subscription-based persistence).Pinata likely stores multiple copies across their infrastructure for reliability (details proprietary).Write: fast uploads via API/SDK; Read: fast CDN-backed gateway (suitable for hot content).
FilecoinBlockchain with Proof-of-Replication + Proof-of-Spacetime. Content addressed (IPFS), deals via smart contracts.User-defined duration (e.g. 6 months or 2 years deals, extendable). Not permanent unless continuously renewed.User can choose number of copies (deals with multiple miners) – e.g. NFT.Storage uses 6× redundancy for each NFT file. Network capacity is huge (EB scale).Write: batched into sectors, higher latency for initial storage; Read: not instantaneous unless data cached – often served via IPFS gateways or emerging retrieval nodes (Filecoin is improving here).
StorjDistributed cloud with erasure coding (80 pieces per file) and audits (proofs of retrievability). Central coordination via Satellites (not blockchain).As long as the user pays for service (data automatically repaired if nodes drop). Providers are paid in STORJ tokens or USD.Very high – 80 shards spread globally; file tolerates ~50/80 node failures. Network auto-heals by replicating shards if a node quits.Write: high throughput (uploads are parallelized to many nodes); Read: very fast – downloads pull from up to 80 nodes, and automatically skip slow nodes (“long-tail elimination” for performance).
SiaBlockchain with smart contracts for storage. Erasure-coded 30-of-10 scheme; Proof-of-Work chain for contract enforcement.Time-bound contracts (typically 3 months); users renew to maintain storage. Not perpetual by default.~3× redundancy (30 shards for 10 needed). Hosts may geographically diversify; network also replicates shards to new hosts if one goes offline.Write: moderate (uploads require forming contracts and splitting data); subsequent updates need renewing contracts. Read: fast parallel fetch from 10+ hosts; Skynet HTTP portals enabled CDN-like retrieval for public data.
CeramicEvent stream network on top of IPFS; data updates anchored periodically to a blockchain for ordering. No mining – nodes replicate streams of interest.Data exists as long as at least one node (often developer-run or community) stores the stream. No token incentives (uses a community-run model).Depending on adoption – popular data models likely on many nodes. Generally not for large files, but for pieces of data in many apps (which encourages widespread replication of shared streams).Write: near-real-time for updates (just needs to propagate to a few nodes + anchor, which is efficient); Read: fast, queryable via indexing nodes (some use GraphQL). Ceramic is optimized for many small transactions (social posts, profile edits) at web scale.

2. Pricing Models

Despite similar goals of decentralized storage, these services use different pricing and economic models:

  • Arweave Pricing: Arweave requires a one-time upfront payment in AR tokens to store data *forever*. Users pay for at least 200 years of storage for the data, and the protocol places ~86% of that fee into an endowment fund. The endowment’s accrual (through interest and appreciating value of AR) is designed to pay storage miners indefinitely, under the assumption that hardware costs decline over time (historically ~30% cheaper per year). In practical terms, the price fluctuates with AR’s market price, but as of 2023 it was around $3,500 per 1 TB one-time (note: this buys permanent storage, whereas traditional cloud is a recurring cost). Arweave’s model shifts burden upfront: users pay more initially, but then nothing thereafter. This can be costly for large data, but it guarantees permanence without needing to trust a provider in the future.
  • Pinata (IPFS) Pricing: Pinata uses a subscription model (fiat pricing) common in Web2 SaaS. It offers a Free tier (up to 1 GB storage, 10 GB/month bandwidth, 500 files) and paid plans. The popular “Pinata *Picnic*” plan is $20/month which includes 1 TB of pinned storage and 500 GB bandwidth, with overage rates of ~$0.07 per GB for storage and $0.10/GB for bandwidth. A higher “Fiesta” plan at $100/month raises this to 5 TB storage and 2.5 TB bandwidth, with even cheaper overages. All paid tiers include features like custom gateways, increased API request limits, and collaboration (multi-user workspaces) at additional cost. There is also an enterprise tier with custom pricing. Pinata’s costs are thus predictable monthly fees, similar to cloud storage providers, and not token-based – it abstracts IPFS into a familiar pricing structure (storage + bandwidth, with free CDN caching in gateways).
  • Filecoin Pricing: Filecoin operates as an open market, so prices are determined by supply and demand of storage miners, typically denominated in the native FIL token. In practice, due to abundant supply, Filecoin storage has been extremely cheap. As of mid-2023, storing data on Filecoin costs on the order of $2.33 for 1 TB per year – significantly cheaper than centralized alternatives (AWS S3 is ~$250/TB/yr for frequently accessed storage) and even other decentralized options. However, this rate is not fixed – clients post bids and miners offer asks; the market price can vary. Filecoin storage deals also have a specified duration (e.g. 1 year); if you want to keep data beyond the term, you must renew (pay again) or make long duration deals up front. There is also a concept of Filecoin Plus (FIL+), an incentive program that gives “verified” clients (storing useful public data) a bonus to attract miners at lower effective cost. In addition to storage fees, users may pay small FIL for retrieval on a per-request basis, though retrieval markets are still developing (many rely on free retrieval via IPFS for now). Importantly, Filecoin’s tokenomics (block rewards) heavily subsidize miners – block rewards in FIL supplement the fees paid by users. This means today’s low prices are partly due to inflationary rewards; over time, as block rewards taper, storage fees may adjust upward. In summary, Filecoin’s pricing is dynamic and token-based, generally very low cost per byte, but users must manage renewals and FIL currency risk.
  • Storj Pricing: Storj is priced in traditional currency terms (though payments can be made in fiat or STORJ token). It follows a usage-based cloud pricing model: currently $4.00 per TB-month for storage, and $7.00 per TB of egress bandwidth. In granular terms, that is $0.004 per GB-month for data stored, and $0.007 per GB downloaded. There is also a tiny charge per object (segment) stored to account for metadata overhead (about $0.0000088 per segment per month), which only matters if you store millions of very small files. Notably, ingress (uploads) is free, and Storj has a policy of waiving egress fees if you decide to migrate out (to avoid vendor lock-in). Storj’s pricing is transparent and fixed (no bidding markets), and substantially undercuts traditional cloud (they advertise ~80% savings vs AWS, due to no need for regional replication or large data center overhead). End-users don’t have to interact with tokens if they don’t want to – you can simply pay your usage bill in USD. Storj Labs then compensates node operators with STORJ tokens (the token supply is fixed and operators bear some price volatility). This model makes Storj developer-friendly in pricing while still leveraging a token for the decentralized payouts under the hood.
  • Sia Pricing: Sia’s storage market is also algorithmic and token-denominated, using Siacoin (SC). Like Filecoin, renters and hosts agree on prices via the network’s market, and historically Sia has been known for extremely low costs. In early years, Sia advertised storage at ~$2 per TB per month, though actual prices depend on host offerings. One Reddit community calculation in 2020 found the true cost around $1-3/TB-month for renters, excluding redundancy overhead (with redundancy, effective cost might be a few times higher, e.g. $7/TB-month when accounting for the 3x redundancy) – still very cheap. As of Q3 2024, storage prices on Sia rose ~22% QoQ due to increased demand and SC token fluctuations, but remain far below centralized cloud prices. Renters on Sia also need to allocate some SC for bandwidth (upload/download) and collateral. The economics are such that hosts compete to offer low prices (since they want to attract contracts and earn SC), and renters benefit from that competition. However, because using Sia requires operating a wallet with Siacoin and dealing with contract setup, it’s a bit less user-friendly to calculate costs than, say, Storj or Pinata. In short, Sia’s costs are token-market-driven and very low per TB, but the user must continually pay (with SC) to extend contracts. There is no upfront lump-sum for perpetuity – it’s a pay-as-you-go in crypto form. Many users obtain SC through an exchange and then can lock in contracts for months of storage at predetermined rates.
  • Ceramic Pricing: Ceramic does not charge for usage at a protocol level; there is no native token or fee to create/update streams beyond the minor gas cost of anchoring updates on the Ethereum blockchain (which is typically handled by Ceramic’s infrastructure and is negligible per update when batched). Running a Ceramic node is an open activity – anyone can run one to index and serve data. 3Box Labs (the team behind Ceramic) did offer a hosted service for developers (Ceramic Cloud), which might introduce enterprise pricing for convenience, but the network itself is free to use aside from the effort of running a node. Thus, the “price” of Ceramic is mainly the operational cost developers incur if they self-host nodes or the trust cost if using a third-party node. In essence, Ceramic’s model is more akin to a decentralized database or blockchain RPC service – monetization (if any) is through value-added services, not micropayments for data. This makes it attractive for developers to experiment with dynamic data storage without needing a token, but it also means ensuring long-term node support (since altruistic or grant-based nodes are providing the storage).

Pricing Summary: The table below summarizes the pricing and payment models:

ServicePricing ModelCost ExamplePayment MediumNotes
ArweaveOne-time upfront fee for perpetual storage.~$3,500 per TB once (for indefinite storage). Smaller files cost proportionally (e.g. ~$0.035 per MB).AR token (crypto).86% of fee to endowment for future miner incentives. No recurring fees; user shoulders cost upfront.
PinataSubscription tiers + usage overages.Free: 1 GB; $20/mo: 1 TB storage + 0.5 TB bandwidth included; $100/mo: 5 TB + 2.5 TB BW. Overages: ~$0.07/GB storage, $0.08-0.10/GB egress.USD (credit card) – no crypto required.Simple Web2-style pricing. Billed monthly. “Unlimited files” (count) on paid plans, just limited by total GB. Enterprise plans available.
FilecoinOpen market bidding with prices in FIL. Block rewards subsidize storage (low user cost).~$2.33 per TB/year (market rate mid-2023). Prices vary; some miners even offer near-zero cost for verified data (earning mainly block rewards).FIL cryptocurrency. Some services (e.g. NFT.storage) abstract this and offer “free” storage backed by Filecoin deals.Renewal needed at contract end (e.g. 1 year). Users must maintain FIL balance. Network has huge supply, keeping prices low. Retrieval deals (if any) also in FIL.
StorjFixed utility pricing (usage-based).$4.00 per TB-month storage, $7.00 per TB egress. Free ingress, free repair, minimal per-file metadata fee.USD (can pay by credit card or STORJ token; payouts to node operators in STORJ).Post-pay billing (with credits for free tier/trial). Clear predictable costs and significantly cheaper than AWS/Google Cloud.
SiaDecentralized market in Siacoin.~$1–3 per TB/month historically (excluding redundancy overhead). With 3× redundancy, effective ~$3–7/TB/month to the user.Siacoin (SC) cryptocurrency. Users must acquire SC to form contracts.No set price – user software auto-chooses hosts by price. Very cheap, but requires ongoing payments (e.g. fund an allowance for N months). Hosts may also charge for bandwidth in SC.
CeramicNo direct fees for data – open network.N/A (No cost per stream or per update; you mainly pay indirectly for any Ethereum tx fees for anchoring, often cents).N/A (Protocol has no token; some nodes might charge for hosting data on behalf of users, but core is free).Ceramic is run by community and the developing company’s nodes. Pricing is not an obstacle – monetization could come from SaaS offerings around Ceramic (if using a hosted API endpoint, e.g. Infura-style).

3. Developer Experience

A key factor for adoption is how easily developers can integrate these storage solutions – via APIs, SDKs, documentation, and tooling:

  • Arweave Developer Experience: Arweave provides a graphQL API endpoint (at arweave.net/graphql) which allows querying the permaweb for transactions and data – developers can search stored content by tags, wallet addresses, etc. There are official SDKs like Arweave.js for browser and Node.js that simplify uploading files and posting transactions to the network. For example, a developer can use the Arweave SDK to bundle and upload a file with just a few lines of code. Because each upload is an on-chain transaction, the UX for large-scale uploads was historically challenging, but the introduction of Bundlr (Bundlr Network) has greatly improved throughput. Bundlr (now rebranded to “Iris” for Arweave scaling) is essentially a network of bundling nodes that let developers pay once and upload many files off-chain, then periodically commit them to Arweave in bulk. This allows dApps (especially NFT platforms) to upload thousands of files quickly without spamming the chain, while still getting eventual permanence. Arweave’s tooling ecosystem also includes Arweave Deploy CLI, and ArDrive (a user-friendly app for file management on Arweave). The Permaweb concept extends to hosting web apps – developers can deploy HTML/JS to Arweave via tools like Ardor or the Web3 bundler, and have it available at a permanent URL. Documentation for Arweave is extensive, covering how to price uploads (there’s even a calculator), how to retrieve data (via gateways or running a lightweight node), and community-made “cookbooks” for common tasks. One learning curve is handling the wallet key for signing transactions; Arweave uses RSA-based keys that developers manage (though web wallets and cloud key management solutions exist). Overall, dev experience is improving as Arweave matures, with reliable SDKs, a straightforward REST-like interface (GraphQL), and community tooling. One noteworthy aspect: since users pay in AR, developers must integrate a crypto payment flow – some solve this by pre-paying for users or using third-party services that accept credit cards and convert to AR.
  • Pinata Developer Experience (IPFS): Pinata is built with developers in mind – its slogan is “Add IPFS file uploads and retrieval in minutes” and it provides a simple REST API and a robust JavaScript SDK. For instance, using Node.js, a dev can npm install @pinata/sdk and then do pinata.pinFileToIPFS(file) or the newer pinata.upload methods to store files on IPFS via Pinata’s service. The SDK handles authentication (Pinata uses API keys or JWTs) and abstracts away running any IPFS node. Pinata’s documentation is clear, with examples for uploading files, pinning by CID (if the content is already on IPFS), and managing pins (unpinning, pin status, etc.). It also supports a content gateway: developers can use a custom subdomain (e.g. myapp.mypinata.cloud) to serve content over HTTP, with built-in CDN and even image optimization. This means devs can treat IPFS-stored images almost like they would with Cloudinary or Imgix (Pinata’s image optimizer can resize/crop on the fly via URL parameters). Pinata recently introduced features like “Pinata KV” (key-value storage for JSON or metadata, useful alongside file storage) and Access Controls (to set content as public or restricted). These higher-level features make it easier to build full applications. Additionally, since Pinata is just interfacing with IPFS, developers maintain the flexibility to leave – they can always take a CID pinned via Pinata and pin it elsewhere (or on their own node) since IPFS is interoperable. Pinata’s support (guides, community) is well-regarded, and they even partner with Protocol Labs on initiatives like NFT.Storage migration (providing guides to help users move data between services). For those wanting not to touch crypto at all, Pinata is ideal – no blockchain to integrate, just simple API calls and a credit card. The flip side is less decentralization for the integration itself, since you rely on Pinata’s availability and service quality (though your content is still hash-addressed and replicable on IPFS). In summary, Pinata offers excellent DX: easy setup, comprehensive docs, SDKs, and features (gateway, CDN, analytics) that abstract the complexities of IPFS.
  • Filecoin Developer Experience: Using Filecoin directly can be complex – it traditionally required running a Filecoin node (e.g. Lotus) and dealing with concepts like sectors, deals, miners, etc. However, the ecosystem has created many developer-facing services and libraries to simplify it. Notably, web3.storage and NFT.storage (by Protocol Labs) allow developers to store data on IPFS with Filecoin backup without needing to handle any FIL tokens or deal mechanics. These services provide a simple API (similar to Pinata’s) – e.g. an NFT project can call NFT.storage’s API to upload an image and metadata; NFT.storage will pin it on IPFS and make Filecoin deals with multiple miners to store it long-term, all free of charge (subsidized by PL). This has been a game-changer for dev adoption in the NFT space. Beyond that, there are tools like Estuary, Powergate (from Textile), and Glacier that offer developer-friendly gateways to Filecoin storage. There’s also a growing ecosystem around the Filecoin Virtual Machine (FVM), which launched in 2023, enabling smart contracts on Filecoin – developers can now write programs that run on the Filecoin blockchain, opening up possibilities for data-centric dApps (like auto-renewing storage deals, or incentivizing retrieval). For basic storage and retrieval, most devs will use either an IPFS layer on top (thus treating Filecoin as “cold storage” backup) or a hosted solution. It’s worth noting that because Filecoin is an open network, many third-party services exist: e.g. Lighthouse.storage offers a “pay once, store forever” service built on Filecoin (it charges an upfront fee and uses an endowment concept much like Arweave, but implemented via Filecoin deals). For developers who want more control, the Filecoin documentation provides libraries (in Go, JavaScript, etc.) to interact with the network, and there are frameworks like Slate (for building user-facing storage apps) and Space (Fleek’s Filecoin+IPFS user storage SDK). The learning curve is higher than for Pinata or Storj, especially if going low-level – devs must understand content addressing (CIDs), deal lifecycle, and possibly run an IPFS node for fast retrieval. The IPFS docs emphasize that IPFS and Filecoin are complementary; indeed, a dev using Filecoin will nearly always pair it with IPFS for actual data access in their app. So effectively, a Filecoin developer experience often becomes an IPFS developer experience with additional steps for persistence. The ecosystem is large: as of 2022 there were 330+ projects built on Filecoin/IPFS, spanning NFTs, Web3 gaming, metaverse storage, video, and more. This means abundant community examples and support. In summary, Filecoin’s DX ranges from turnkey (NFT.storage) to highly customizable (Lotus and FVM) – it is powerful but can be complex, though the availability of free IPFS+Filecoin storage services has eased adoption for many common use cases.
  • Storj Developer Experience: Storj DCS positions itself as a drop-in replacement for traditional object storage. It offers an S3-compatible API – meaning developers can use familiar AWS S3 SDKs or tools (boto3, etc.) by simply pointing the endpoint to Storj’s gateway. This drastically lowers the barrier to entry, as virtually any software that works with S3 (backup tools, file browsers, etc.) can work with Storj with minimal config changes. For those who prefer using Storj’s native interfaces, they provide libraries (in Go, Node, Python, etc.) and a CLI called uplink. The documentation on storj.io and storj.dev is thorough, including example code for common tasks (upload, download, sharing, setting access grants). One unique feature is Storj’s access grant tokens – a security mechanism that encapsulates encryption keys and permissions, enabling client-side trust: a dev can create a limited permission token (say read-only access to a certain bucket) to embed in an app, without exposing root keys. This is developer-friendly for creating sharable links or client-side uploads directly to the network. Storj’s dashboard helps monitor usage, and their support resources (community forum, Slack/Discord) are active with both devs and node operators. Integration guides with third-party services exist – for example, FileZilla (the FTP client) integrated Storj so users can drag-and-drop files to Storj like any server. Rclone, a popular command-line sync tool, also supports Storj out-of-the-box, making it easy for developers to incorporate Storj into data pipelines. Because Storj handles encryption automatically, developers don’t need to implement that themselves – but it also means if they lose their keys, Storj can’t recover the data (a trade-off for zero-trust security). Performance-wise, devs might notice that uploading many tiny files has overhead (due to the segment fee and erasure coding), so best practice is to pack small files together or use multipart upload (similar to how one would use any cloud storage). The learning curve is quite small for anyone familiar with cloud storage concepts, and many are: Storj intentionally mirrors the AWS developer experience where possible (SDKs, docs) but offers the decentralized backend. In essence, Storj provides a familiar DX (S3 API, well-documented SDKs) with the benefits of encryption and decentralization – making it one of the smoother onboarding experiences among decentralized storage options.
  • Sia Developer Experience: Sia historically required running a Sia client (daemon) on your machine, which exposes a local API for uploads and downloads. This was manageable but not as convenient as cloud APIs – developers had to incorporate a Sia node in their stack. The Sia team and community have worked on improving usability: for instance, Sia-UI is a desktop app for manual file uploading, and libraries like sia.js exist for interacting with a local node. However, the more significant DX improvement came with Skynet, introduced in 2020. Skynet allowed developers to use public web portals (like siasky.net, skyportal.xyz, etc.) to upload data without running a node; these portals handle the Sia interaction and give back a Skylink (a content hash/ID) that can be used to retrieve the file from any portal. This made using Sia storage as easy as an HTTP API – one could curl a file to a Skynet portal and get a link. Additionally, Skynet enabled hosting web apps (similar to Arweave’s permaweb) – developers built dApps like SkyID (decentralized identity), SkyFeed (social feed), and even entire app marketplaces on Skynet. From a developer standpoint, Skynet’s introduction meant you didn’t have to worry about Siacoin, contracts, or running nodes; you could rely on community-run portals (some free, some commercial) to handle the heavy lifting. There were also SDKs (SkyNet JS, etc.) for integrating this into web apps. The challenge, however, is that the primary backer of Skynet (Skynet Labs) shut down in 2022 due to funding issues, and the community and Sia Foundation have been working to keep the concept alive (open-sourcing portal code, etc.). As of 2025, Sia’s developer experience is bifurcated: if you want maximum decentralization, you run a Sia node and deal with SC and contracts – powerful but relatively low-level. If you want ease of use, you might use a gateway service like Filebase or Skynet portals (if available) to abstract that. Filebase, for instance, is a service that provides an S3-compatible API but actually stores data on Sia (and now other networks too); so a developer could use Filebase like they would Storj or AWS, and under the hood it handles Sia’s mechanics. In terms of docs, Sia has improved its documentation and has an active community channel. They also offer a Host ranking (HostScore) and network stats (SiaStats/SiaGraph) so developers can gauge network health. Another new initiative in Sia is the S5 project, which aims to present Sia storage in a content-addressed way akin to IPFS (with compatibility for S3 too) – this suggests ongoing efforts to streamline developer interaction. Overall, Sia’s DX has historically lagged some others due to the need to handle a blockchain and currency, but with Skynet and third-party integrations, it’s become easier. Developers valuing privacy and control can use Sia with some effort, while others can leverage services on top of Sia for a smoother experience.
  • Ceramic Developer Experience: Ceramic targets web3 dApp developers, especially those building social features, identities, or dynamic content. Developers interact with Ceramic by running a Ceramic node or using a hosted node (offered by 3Box Labs or community providers). The key concept is “ComposeDB”, a semantic data layer for Ceramic: devs can define a data model (schema) for their application’s data (e.g. a profile model with name, avatar, etc.), and then use GraphQL queries to store and retrieve that data from Ceramic. Essentially, Ceramic feels like using a database that’s global and decentralized. The Ceramic team provides a CLI and SDK to help bootstrap applications – for example, glaze/JS to manage data models and self.id (an identity SDK) for authenticating users with their crypto wallets/DIDs to control their data. Because it’s relatively new, the tooling is still evolving, but there’s solid documentation and a growing set of example apps (for social networks, blog platforms, credential storage, etc.). One important part of Ceramic DX is DID (Decentralized ID) integration: every update to data is signed by a DID, often using IDX (Identity Index) which 3Box Labs built to manage user identity data across streams. For developers, this means you often incorporate a library like did-js to authenticate users (commonly via their Ethereum wallet, which gives a DID using Ceramic’s did:3 method). Once authenticated, you can read/write that user’s data in Ceramic streams as if it were any database. The learning curve here is understanding decentralized identity and the concept of streams vs tables. However, those familiar with web development will find that ComposeDB’s GraphQL abstractions make it quite natural – you can query Ceramic for all posts in a blog app, for instance, using a GraphQL query that the Ceramic node resolves by looking at the relevant streams. Ceramic’s documentation covers “How it Works”, and emphasizes that it’s not for large files – rather, you store references to IPFS or Arweave for large media, and use Ceramic for metadata, indexes, and user-generated content. In practice, a dApp might use Ceramic for things like user profiles or comments (so they can be updated and shared across platforms), and use Filecoin/IPFS for the big files like images or videos. The community around Ceramic is active, with hackathons and grants, and tools like Orbis (a decentralized Twitter-like protocol built on Ceramic) provide higher-level SDKs for social features. In summary, Ceramic offers a high-level, Web3-native DX: developers work with DIDs, models, and GraphQL, which is quite different from low-level storage management – it’s more akin to building on a decentralized Firebase or MongoDB. For those use cases that need mutable, interoperable data, the developer experience is cutting-edge (if a bit bleeding-edge), and for others it may be unnecessary complexity.

4. User Adoption and Usage Metrics

Assessing adoption of decentralized storage is multifaceted: we consider data stored, number of users/developers, notable use cases or partners, and market share. Below we compile key adoption metrics and examples for each:

  • Arweave Adoption: Arweave’s network, launched in 2018, stores a smaller total volume of data compared to Filecoin but has carved out a critical niche in permanent storage. As of early 2023, approximately 140 TB of data was stored on the Arweave permaweb. While that is orders of magnitude less than Filecoin, Arweave emphasizes that this data is fully paid-up and permanently preserved. The rate of growth has been steady – developers and archival projects contribute data ranging from web pages (e.g. Arweave is used to archive webpages via the “archivist” community, akin to a decentralized Wayback Machine) to blockchain history (the Solana blockchain, for instance, uses Arweave to offload its historical data). A significant adoption milestone: Meta (Facebook) integrated Arweave in 2022 to permanently store Instagram’s NFT digital collectible media, signaling trust from a Web2 giant in Arweave’s permanence. (Though Meta later halted the NFT initiative, the fact remains they chose Arweave for immutable storage.) In the blockchain world, Solana’s NFT platform Metaplex uses Arweave to store NFT metadata and assets – Solana’s popular Candy Machine standard automatically uploads media to Arweave for permanence. This has resulted in millions of NFTs referencing Arweave URIs (often via arweave.net). Another example: KYVE, a Web3 archiving project, launched its mainnet on Arweave and by late 2023 had uploaded over 2,000 TB (2 PB) of data to Arweave – notably huge, this includes snapshots of other blockchains and datasets. Arweave’s ecosystem counts hundreds of developers; the official website Ar.io notes an endowment of 44,000+ AR accumulated by Jan 2023 to sustain storage. On social metrics, Arweave’s community is strong among NFT creators and archival enthusiasts – the term “permaweb” has become synonymous with preserving NFT artwork, web content (e.g. mirror.xyz uses Arweave to store decentralized blog posts permanently), and even permaweb-based applications (email, forums). Arweave has received backing from major crypto VCs and its founder Sam Williams is a prominent figure advocating for data permanence. While not as large in raw bytes, Arweave’s adoption is high-impact: it’s used wherever guaranteed permanence is required. It is also integrated into many Web3 stacks indirectly (for example, Ledger’s hardware wallet uses Arweave to store some NFT provenance data, and The Graph indexing protocol can use Arweave for storing subgraph data). In summary, Arweave’s adoption is strong in the NFT and blockchain metadata space, in permanent web archives, and has growing enterprise interest for long-term records. The current network utilization (140+ TB) may seem small, but each byte is intended to last forever, and usage has been accelerating.
  • Pinata and IPFS Adoption: IPFS is arguably the most widely adopted decentralized storage technology by sheer numbers, as it’s free and open for anyone to use. It’s hard to measure IPFS “storage” since anyone can run a node and add content – but it’s pervasive in the Web3 world. Pinata, as one of the leading IPFS pinning services, offers a window into IPFS usage by developers. Pinata’s website touts “Trusted by 600,000+ developers” – a huge number reflecting its popularity, likely boosted by the 2021 NFT boom when many projects used Pinata to host NFT assets. From indie artists using Pinata’s free tier to major NFT marketplaces integrating Pinata for content delivery, the service has become an industry standard. The NFT.Storage team noted in 2023 that “Pinata has been a trusted name in the IPFS community since 2018, powering many top projects and marketplaces.”. This includes well-known NFT platforms, game developers, and even some DeFi projects that needed to serve frontend assets over IPFS. For example, OpenSea (the largest NFT marketplace) uses IPFS for many stored assets and has at times recommended pinning services like Pinata to NFT creators for ensuring their content’s availability. Many profile-picture NFT collections (from CryptoPunks derivatives to countless generative art sets on Ethereum) use IPFS CIDs for images, and it’s common to find Pinata’s gateway URLs in token metadata. Pinata hasn’t publicly released stats on total data pinned, but anecdotally it is responsible for pinning petabytes of NFT data. Another dimension: IPFS is integrated into web browsers (Brave, Opera) and has a global peer network; Pinata’s role in that is as a reliable backbone for content hosting. Because IPFS is free to use self-hosted, Pinata’s large user count indicates that many developers prefer the convenience and performance it adds. Pinata also has enterprise users in media and entertainment (for example, some music NFT platforms used Pinata to manage audio content). It’s worth noting IPFS adoption extends beyond Pinata: competitors like Infura’s IPFS service, Cloudflare’s IPFS gateway, and others (Temporal, Crust, etc.) also contribute, but Pinata is among the most prominent. Summarily, IPFS is ubiquitous in Web3, and Pinata’s adoption reflects that ubiquity – it’s a backbone for NFT and dApp content, with hundreds of thousands of users and integration in production apps worldwide.
  • Filecoin Adoption: Filecoin has seen the largest uptake in terms of raw storage capacity. It reportedly has 22 exabytes (22,000,000+ TB) of available storage in its network, of which about 3% (660+ PB) was utilized by mid-2023. (By comparison, that used storage is three orders of magnitude above Arweave’s, showing Filecoin’s focus on big data.) Much of this capacity comes from large-scale miners; however, useful stored data has also grown significantly thanks to programs like Filecoin Plus. By early 2022, 45 PiB (~45,000 TB) of real data was stored, and it has likely grown much more since then as large archives onboard data. In terms of users, Filecoin’s adoption is bolstered by ecosystem projects: for instance, NFT.storage (which uses Filecoin under the hood) has over 150 million NFT assets uploaded as of 2023. Many NFT marketplaces rely on NFT.storage or similar services, indirectly making Filecoin a backend for those NFTs. Web3.storage (general IPFS/Filecoin storage for apps) has tens of thousands of users and stores data for applications like Web3 games and metaverse content. Notably, Filecoin has attracted enterprise and institutional partnerships: it partnered with University of California Berkeley to store research data, with the NYC government to preserve open data sets, and with companies like Seagate (a hard drive manufacturer exploring Filecoin for enterprise backup solutions) and Ernst & Young (EY) for decentralized storage in business use cases. OpenSea also became a Filecoin client, using it to back up NFT data. These high-profile clients show confidence in Filecoin’s model. Moreover, by number of projects: over 600 projects and dApps were built on Filecoin/IPFS by late 2022, including everything from video platforms (e.g. VideoCoin, Huddle01) to DeFi oracle data archives, to scientific data repositories (Shoah Foundation’s Holocaust archives via Starling project). Filecoin’s blockchain has a wide community of over 3,900 storage providers globally, making it one of the most decentralized infrastructures by geography. However, Filecoin’s user adoption is sometimes tempered by complexity; many users interact through the easier IPFS layer. Still, with the advent of FVM and a push toward Filecoin as a full cloud platform (storage + compute), developer and enterprise interest is accelerating. In summary, Filecoin leads in capacity and enterprise engagement: it is the decentralized storage network in terms of scale, and while much of that capacity is underutilized, initiatives are in place to fill it with valuable content (open science data, Web2 archives, Web3 app data). Its proven ability to handle exabyte-scale makes it a strong contender to disrupt traditional cloud storage if demand catches up.
  • Storj Adoption: Storj has grown steadily by targeting web2/web3 hybrid use cases (especially media). The network consists of around 13,000+ storage nodes (individual operators running the Storj software at home or data centers) across more than 100 countries – providing strong decentralization. On the customer side, Storj has landed enterprise partnerships in media and IT: for example, Videon’s LivePeer (video streaming) uses Storj to distribute live video chunks globally, Fastly’s Compute@Edge partnered with Storj for storing assets, and as seen on their website, Storj is trusted by organizations like Cloudwave, Caltech, TrueNAS, Vivint, and several media production houses. The presence of Caltech (a leading research university) suggests use in scientific data storage, while Vivint (a smart home company) implies IoT or camera footage storage – diverse real-world applications. Storj has won industry recognition, such as Product of the Year 2025 at NAB (National Association of Broadcasters), for its solution in media workflows. They highlight case studies: e.g. Inovo streaming video to millions of users cost-effectively, Treatment Studios using Storj for global video collaboration, and Ammo Content streaming 30+ million hours of content via Storj’s network. These examples indicate Storj is capable of handling high-bandwidth, high-volume content delivery – a critical proof point. Developer adoption is also significant: over 20,000+ developers had accounts on Storj DCS by 2022 (from a Storj stat report). The open-source community has embraced Storj in integrations (as mentioned, FileZilla, ownCloud, Zenko, etc.). Node operator interest is high because Storj pays out in tokens; at times there have been waitlists to become a node due to demand. In terms of data stored, Storj hasn’t publicly announced total PB stored lately, but it’s known to be in the multiple petabytes and rapidly growing especially with recent pushes into the Web3 space. It might not rival Filecoin’s raw numbers (because Storj focuses on active data, not just capacity), but it’s likely the largest encrypted cloud storage network by data count. Storj’s multi-region, CDN-like performance has attracted Web2 users purely for cost-performance benefits (some don’t even care about it being decentralized, they just enjoy 80% cost savings). This “Trojan horse” into traditional industries means adoption can grow outside the typical crypto circles. Overall, Storj’s adoption is strong in media streaming, backup, and developer tooling. It demonstrates that a decentralized service can meet enterprise SLAs (reflected by their 11 9’s durability and partnerships with firms like Evergreen for backup solutions). With its pivot to also offer decentralized cloud GPUs, Storj is positioning as a broader decentralized cloud provider, which could further drive adoption.
  • Sia Adoption: Sia is one of the oldest projects here (launching in 2015), but its adoption trajectory has been more modest. As of Q3 2024, Sia’s network stored 2,310 TB (2.31 PB) of data, which was a ~17% quarterly increase, indicating usage is growing steadily, albeit from a smaller base. Sia’s utilization rate relative to capacity also improved, suggesting more hosts are getting business. The Sia network historically had many individual users using it for personal backups due to its low cost – imagine tech-savvy users storing their photo collections or running Sia as a cheaper “Backblaze alternative”. On the enterprise side, Sia hasn’t seen the same level of public partnerships as Filecoin or Storj. Partly this is due to the early-stage UX and the fact that Sia’s parent company Nebulous pivoted to Skynet (which targeted Web3 dApps and content hosting). Skynet adoption was promising in 2020–2021: it powered a Web3 social media ecosystem (e.g. SkyFeed had thousands of users), and even some NFT projects used Skynet for hosting artwork (Skylinks appear in some NFT metadata as an alternative to IPFS). Audius, the decentralized music platform, experimented with Skynet for some content delivery. However, the shutdown of Skynet’s main portal has put some of that momentum into community hands. The Sia Foundation (established in 2021) is now driving development, and they introduced Sia v2 (a hardfork in 2025) with improvements to performance and perhaps economics, which could spur future adoption. The ecosystem is smaller: Sia’s stats show 32 projects built on Sia (not counting user-facing apps), and a total of $3.2M in grants allocated by 2025 to foster growth. This includes projects like Filebase (which uses Sia as one backend), SiaStream (for media streaming storage on Sia), and community tools like HostScore and SiaFS. Sia’s community, while smaller, is passionate – for instance, there was a notable user-run operation storing the Library of Congress’s public data on Sia. The number of hosts on Sia is in the hundreds (not thousands like Storj), and many provide enterprise-grade setups (data center nodes) because profitability as a host is thin unless you have very cheap storage to offer. In summary, Sia’s adoption is niche but steady: it’s used by a core community for low-cost cloud storage and by some Web3 projects for hosting decentralized web content. Its usage (2+ PB stored) is non-trivial but lags far behind Filecoin; however, Sia distinguishes itself by being non-profit and community-driven, which resonates with those who prioritize decentralization ethos. The ongoing improvements (Sia v2) and focus on being “the world’s safest cloud” may yet attract more users concerned with self-sovereign data.
  • Ceramic Adoption: Ceramic being a specialized network for data/composable content, its adoption is measured by developers and applications rather than sheer storage volume. According to the Ceramic website (2025), over 400 apps and services are built on Ceramic, managing around 10 million streams of content. This indicates a growing interest in decentralized data among Web3 app developers. Some notable projects using Ceramic include Orbis (decentralized social networking protocol, akin to Twitter on Ceramic), CyberConnect (social graph protocol initially built on Ceramic DIDs), Gitcoin (which explored Ceramic for decentralized user profiles), and Self.ID (an identity hub for users to manage profiles across dApps). Additionally, DID adoption via Ceramic’s 3ID has been significant – for example, many Ethereum-based applications leveraged Ceramic to store user profiles (so your profile could port between say Uniswap and Boardroom and Snapshot for DAOs). There have been partnerships like NEAR Protocol integrating Ceramic for cross-chain identity, showing that Layer-1 blockchains see Ceramic as a solution for off-chain user data. Another domain is DeSci (decentralized science): projects use Ceramic to store research metadata, lab notes, etc., where data needs to be shared and verifiable but not immutable (updates needed). The fact that 3Box Labs (Ceramic’s founding team) recently joined with Textile (a team known for IPFS/Filecoin tooling) is also telling – it suggests an effort to combine forces and perhaps will expand Ceramic’s reach in the data infrastructure domain. The number of active Ceramic nodes is not public, but many apps run their own or use the community nodes. In the big picture, Ceramic is newer and its concept of a “dataverse” is still catching on; it doesn’t have household name enterprise users yet, but it’s seeing grassroots Web3 adoption in areas that existing storage networks don’t serve well (like social media content and cross-app data interoperability). As a benchmark, if we consider each stream as a piece of data, 10 million streams is substantial, though many streams are tiny (like a user’s profile doc or a single post). The metric to watch is how many end-users those 400 apps bring – potentially in the hundreds of thousands, if apps like decentralized social networks scale up. In summary, Ceramic’s adoption is promising in the Web3 dev community (hundreds of apps, integration in various Web3 ecosystems), but it’s inherently limited to specific use cases and is not competing on storage size or throughput with the likes of Filecoin/Arweave.

To visualize adoption, the table below highlights some metrics and notable adopters:

NetworkData Stored / CapacityUser Base & DevelopersNotable Usage Examples / Partners
Arweave~140 TB stored (2023) (fully permanent).Thousands of users; strong NFT and archival dev community.Solana NFT metadata via Metaplex Candy Machine; Meta/Instagram NFT media storage; KYVE (2 PB of blockchain data); Permanent web archives (e.g. web pages, documents) by Internet Archive enthusiasts.
Pinata/IPFSDifficult to measure (IPFS global network in PBs). Pinata pins probably many PB of NFT data.600k+ developers on Pinata; IPFS used by millions via browsers and apps.Top NFT projects & marketplaces (Ethereum and others) rely on IPFS+Pinata; Browser integrations (Brave uses IPFS for content); Cloudflare and Infura run public IPFS gateways serving billions of requests.
Filecoin~22 EB capacity, ~0.66 EB (660 PB) used (2023). Used storage growing fast (45 PB in early 2022; now much higher with FIL+).Thousands of clients (direct or via services); 3,900+ miners globally; 600+ ecosystem projects.OpenSea (backing up NFT data); UC Berkeley (research data); NYC Open Data; Shoah Foundation archives; Seagate & EY partnerships for enterprise storage; NFT.storage & Web3.storage (over 150M NFT files).
StorjSeveral PB stored (exact not public; growing via media use). Network: ~13k nodes in 100+ countries.20k+ developers; mix of Web3 and Web2 customers. Node operator community worldwide.Video/Media platforms (e.g. 30M+ hours streamed via Storj for one client); Telecom/Smart Home (Vivint); Academia (Caltech); ownCloud integration for enterprise file sharing; FileZilla integration for backups; recognized by Forrester as a top disruptor.
Sia~2.3 PB used (Q3 2024); capacity somewhat higher (a lot of free space still on hosts).Hundreds of active hosts; user count not published (likely in thousands). Developer count relatively small (32 projects listed).Personal & small business backups (via Filebase, Sia-UI); Skynet dApps (decentralized social media, web hosting – e.g. SkyFeed had thousands of users at peak); VPN/Proxy services using Sia for logs (privacy-sensitive storage); Library of Congress data (community-driven archival on Sia).
Ceramic~10 million streams (pieces of content) across network (data size is small per stream).400+ apps built on it; user reach in tens of thousands (through those apps). Growing dev community via grants and hackathons.Decentralized Social (Orbis for Twitter-like feeds); Cross-app profiles (e.g. used in multiple Ethereum dApps for unified profiles); DAO tools (governance forums storing proposals/comments via Ceramic); Identity (DID wallets, verifiable credentials in DeFi KYC); Near Protocol using Ceramic for profiles.

5. Ecosystem Maturity and Activity

Beyond raw usage, the maturity of each ecosystem – including third-party tools, integrations, funding, and community activity – is crucial in evaluating long-term viability:

  • Arweave Ecosystem: Arweave’s ecosystem is robust for its size. On the infrastructure side, a number of projects enhance Arweave’s functionality: Bundlr (Iris), as mentioned, operates a network of bundling nodes and has raised its own funding to scale Arweave throughput (processing over 1 billion bundled transactions by late 2023). ArDrive is a popular user-facing app that offers a Dropbox-like experience on Arweave – it became fully decentralized in 2023 and launched a 2.0 with features like large file support. EverPay and Warp enable a layer-2 style instant transactions and smart-contract-like functionality on Arweave by using the permaweb as a base layer (Arweave itself doesn’t support traditional smart contracts, but these projects store contract states and allow interactions). In 2024, Arweave introduced the “Atomic Oasis (AO) Compute” – a compute-over-data network that sits atop Arweave, allowing parallel on-chain computation while using Arweave for data availability. This essentially brings Arweave into the realm of providing cloud computing (similar to how Filecoin is adding compute with FVM) and indicates a forward-looking roadmap. On the funding front, Arweave has strong backing: it raised $37.3M from a16z, Union Square Ventures, and others, ensuring runway for continued development. The community is engaged via a profit-sharing token (PST) system – developers can create PSTs for their permaweb apps, which entitle holders to a share of fees, incentivizing app development. There are numerous permaweb apps live: from Decent.land (decentralized social profiles on Arweave) to CommunityXYZ (a DAO platform for Arweave PSTs). Arweave has a DAO-like governance for its endowment as well, involving the community in decision-making. The network has undergone major upgrades smoothly (e.g. the SPoRA consensus upgrade in 2022). Integration-wise, Arweave has been integrated with other chains: Smart contracts on Ethereum, Polkadot, Avalanche have used Arweave for storing large data or metadata (often via the Arweave <-> Ethereum bridge and via The Graph indexing). Lens Protocol (Web3 social on Polygon) offers Arweave as an option for permanent post storage. Arweave’s collaboration with Solana is deep: it’s essentially Solana’s archival layer, and now with Solana’s new phone (Saga), there was mention of Arweave being used to permanently store mobile dApp content. Overall, the Arweave ecosystem is active and growing with dedicated storage apps, cross-chain integrations, and even exploring new verticals like compute. The community culture is centered on “the permaweb” mission – evidenced by initiatives like Arweave Boost (a program that subsidized storage costs for valuable data sets) and partnerships preserving cultural data (e.g. archives of Ukraine war documents have been stored on Arweave by activists). All these signs point to a mature, mission-driven ecosystem, albeit smaller than Filecoin’s.
  • Pinata/IPFS Ecosystem: Pinata itself is a single company’s offering, but it sits within the larger IPFS ecosystem, which is very extensive. Pinata’s ecosystem activity includes partnerships (as seen with NFT.storage – they have a referral deal that supports the NFT.storage mission) and integrations in creator platforms (for example, some NFT minting platforms have built-in Pinata uploading for user convenience). Pinata has expanded its own product features (KV storage, private IPFS, gateway plugins, etc.) indicating a drive to provide more than basic pinning. Meanwhile, IPFS as a whole has a huge open-source community: projects like IPFS-Cluster (for orchestrating your own pinning network), Textile (which built ThreadsDB and other tools on IPFS), Fleek (which provides hosting on IPFS for web apps), and many others flourish. The Protocol Labs ecosystem that IPFS belongs to also includes libp2p (networking layer), and Filecoin – developments there often benefit IPFS (e.g. IPFS retrieval caching via Filecoin Saturn is a new initiative). The maturity is such that IPFS is in version 0.15+ and has been battle-tested. Enterprise use of IPFS beyond crypto is emerging: e.g. NFL (American football league) used IPFS to distribute video highlights to fans (to reduce bandwidth costs). Cloudflare’s IPFS gateway shows interest from Web2 players in bridging to IPFS. There are even IPFS RFCs and academic research regularly produced, showing it’s a well-established protocol. In terms of support, countless libraries exist (Go, JS, Python, Rust IPFS implementations). IPFS is essentially the de facto standard for content addressing now. Pinata benefits from all this maturity while contributing a user-friendly layer. One challenge historically was discoverability on IPFS (content addressing doesn’t provide search) – ecosystems tools like IPFS Search engines and pinning service indexes have been created, and Pinata likely participates in those networks (they might share data about pins if users opt in, to help permanence). Pinata’s move to support NFTs specifically (with dedicated guides and case studies for NFT devs) shows adapting to user needs. They have also been active in community events (sponsoring hackathons, etc.). Summation: IPFS’s ecosystem is very mature (8+ years old, widespread adoption), and Pinata is a key commercial player in that space, well-integrated with others. Pinata’s own ecosystem is more about its customer base (devs and creators), which is large and growing, rather than third-party developers building on Pinata (since it’s not open-source). But given IPFS’s openness, switching costs are low – Pinata stays competitive by offering reliability and ease. This competitive environment includes Infura, web3.storage, etc., which spurs continuous improvement. In summary, IPFS is as mature as decentralized storage gets, and Pinata rides atop it, focusing on developer UX and adding features to remain a go-to service in a thriving interoperative ecosystem.
  • Filecoin Ecosystem: Filecoin’s ecosystem is arguably the most active and well-funded in decentralized storage. Since launch, Protocol Labs and the Filecoin Foundation have organized numerous hackathons (HackFS, Space Race, etc.) and accelerator programs (e.g. Filecoin Launchpad with Tachyon) to seed startups. By 2022, as noted, over 330 projects were building on Filecoin – by 2025, this number is even higher, especially with FVM enabling DeFi and new primitives on Filecoin. A significant development was the launch of the Filecoin Virtual Machine (FVM) in 2023, which brought general programmability (smart contracts) to Filecoin. This has spawned projects like Filecoin DeFi (marketplaces for storage deals, tokenized storage, etc.), data DAOs (decentralized organizations pooling funds to pay for storing valuable data), and cross-chain bridges to use Filecoin storage in Ethereum dApps. Additionally, retrieval markets (like Lighthouse or others) are being built so that content stored on Filecoin can be delivered efficiently by incentivized nodes (complementing IPFS). On the enterprise side, as mentioned, Filecoin’s partnerships with major companies (Seagate, etc.) imply a developing ecosystem of enterprise tools – e.g. there is talk of Filecoin integration with IBM’s cloud or other storage vendors for hybrid solutions. Filecoin’s governance and community is also notable: a Filecoin DAO (governance proposals) exists, and Filecoin Plus is managed by community-selected notaries who verify real data – a social trust system unique in this space, showing a maturing governance process. The network’s tokenomics, while complex, have been holding with thousands of miners participating, indicating a healthy supply side. Another ecosystem component is L2s on Filecoin: projects like Polybase or Tableland (decentralized databases) considering using Filecoin for data availability, and Estuary providing an API on top of Filecoin for easier storage. There are even Filecoin side-chains (one called Filecoin Saturn focuses on content delivery, using Filecoin as payment). The Protocol Labs research team remains active in improving the tech (e.g. improving proof performance, exploring new coding schemes for durability). Community events like Filecoin Orbit meetups and the annual Sustainable Blockchain Summit (which often highlights Filecoin’s role in open data and sustainability) further cement the ecosystem’s vibrancy. Funding-wise, beyond the ICO, a huge Filecoin Ecosystem Fund ($100M+) was launched in 2022 to invest in projects building on Filecoin. Notable investments went to companies like ChainSafe (building Filecoin tooling), Open Forest Protocol (using Filecoin to store climate data), etc. Summarizing, Filecoin’s ecosystem is large, well-capitalized, and rapidly evolving – it’s become more than just storage, aiming to be a full-fledged decentralized cloud (storage, retrieval, compute, maybe even databases). This breadth is a sign of maturity, but also means it’s contending on multiple fronts (competing with specialized networks in each domain). However, the synergy with IPFS and the backing of Protocol Labs provide strong momentum.
  • Storj Ecosystem: Storj’s ecosystem, while not as “web3-trendy” as Filecoin’s, is quite mature in terms of integration and enterprise readiness. On the supply side, Storj has a stable base of node operators thanks to consistent token payouts. The node software (now in its third major version) is well-documented, and operators have community tools (like Grafana dashboards, etc.) to monitor their nodes. Storj Labs has also been creative in incentivizing adoption: they offered free storage for open source projects, encouraging communities to try Storj for things like hosting release binaries or datasets. On the demand side, Storj’s focus on media and big data workflows has led to integrations: e.g., Iconik (a media asset management software) supports Storj as a backend, Gsuite alternative Skiff Mail/Drive uses Storj to store encrypted email attachments and files, and the partnership with ownCloud enables enterprises to plug Storj in without changing their workflows. The open-source library ecosystem is growing: for instance, Fastly’s Terrarium project uses Storj for edge caching. Storj also emphasizes community developers: they have an active forum where third-party devs share projects (like a WordPress plugin to offload media to Storj, a Veeam backup integration, etc.). One sign of maturity is third-party services on top of Storj: e.g., Filebase not only uses Sia, it also added Storj as a backend in 2021 – meaning Filebase’s users can choose Storj via the same S3 interface. This shows Storj is stable and attractive enough to be included in a multi-backend storage service. Storj’s token, while used for payouts, is largely abstracted away for customers, which might limit DeFi-style composability but increases traditional adoption. In 2022-2023, Storj repositioned itself as not just storage but part of a distributed cloud platform, launching Storj Next with plans for compute and databases. Indeed, their Cloud GPUs product (in beta as of 2025) extends the ecosystem to edge computing – renting GPUs from decentralized providers. If successful, this will create a mini-ecosystem of GPU providers and users under the Storj umbrella, further entrenching their platform. Storj Labs itself remains the primary steward of the network (somewhat more centralized governance compared to Filecoin or Sia which have separate foundations), but they have made the code open source and welcome community contributions. They also underwent third-party audits (security, compliance like SOC2), which is important for enterprise trust. In summary, Storj’s ecosystem is mature in integrations and enterprise features, though smaller in pure Web3 developer community presence. It’s carving a niche where decentralized tech is sold on its merits (cost, security) rather than ideology, which could prove a sustainable approach.
  • Sia Ecosystem: Sia’s ecosystem has had ups and downs. After Skynet Labs shutdown, the Sia Foundation took over and has since been executing on a roadmap for Sia v2 (code name often “Nebulous” for the hardfork). They’ve rebranded some pieces (the Skynet portal code is being reworked into Sia v2 which will unify the renter-host protocol with improved performance and maybe an integrated portal functionality). The current ecosystem includes Sia central apps like Sia-UI and the host software, as well as community projects like HostScore (benchmarking hosts) and SiaStats/SiaGraph (network stats sites). The grant program launched by the Sia Foundation (with $3.2M allocated by 2025) is spurring new tools: for example, SiaFS (a FUSE file system for Sia), Décentral (decentralized web frontends on Sia), and S5 (a content-addressed layer on Sia that mimics IPFS functionality). These indicate a recognition of where Sia needed to catch up (e.g., making it easier to reference and share content). The community, while smaller, remains dedicated – the r/siacoin subreddit is active, and many long-term users stick with Sia for ideological reasons (true decentralization, no heavy VC influence, etc.). Sia’s tokenomics (with Siafunds) have remained stable; Siafunds even trade as a kind of “dividend-earning” token from contracts – a unique aspect of Sia’s ecosystem financial model. Competition within ecosystem: A few companies built on Sia, like Skynet’s spin-offs, didn’t survive, which slowed ecosystem growth. But new ones are coming: e.g., Cloudless is a recent startup building a user-friendly Sia storage app; PixelSlime uses Sia to store NFT game assets, etc. The Filebase integration (multi-network) means Sia is part of a broader ecosystem indirectly. The Sia Foundation publishes monthly “State of Sia” updates which increases transparency and fosters trust in development progress – a healthy sign of community engagement. One challenge is that Sia hasn’t achieved the mindshare of IPFS or Filecoin in Web3 – some developers who want decentralized storage don’t consider Sia simply because it’s less talked about. However, those who do use it often praise its reliability and low cost, which suggests potential for word-of-mouth growth if the DX hurdles are reduced. Summation: Sia’s ecosystem is in a rebuilding and growth phase under the Foundation. It is smaller and more grassroots compared to others, but with a long history and some distinct features (like no reliance on other protocols, an altruistic ethos). The next year or two (with Sia v2 launch) will be critical to see if it accelerates.
  • Ceramic Ecosystem: Ceramic, being relatively new (launched around 2021), has shown good ecosystem traction among decentralized app builders. 3Box Labs secured significant funding (from firms like Coinbase Ventures, Multicoin, etc.) to develop Ceramic and its tooling. The ecosystem includes the Ceramic Network itself plus ComposeDB as a flagship product for developers. They’ve cultivated community through Discord and regular developer calls. One interesting aspect is Data composability standards: Ceramic has a “data models marketplace” where developers can publish and reuse each other’s schemas (e.g., a profile model, a blog post model), which fosters an ecosystem of interoperable data. This is quite a unique approach – it encourages apps to build on common data structures (just like many dApps share the ERC-20 token standard, Ceramic apps can share a “SocialPost” model or “Profile” model). This means as more apps adopt these, a user’s profile or content can be ported across many services (a true Web3 network effect). The ecosystem also interacts with other networks: e.g., Ceramic uses Ethereum for anchoring by default, so improvements in Ethereum L1 or L2 (they have plans to use scaling solutions for cheaper anchors) directly benefit Ceramic. They’ve also integrated Chainlink (for timestamping on multiple chains) and IDX which can link Ceramic identities with blockchain addresses. Another synergy is with wallets: since Ceramic’s user authentication often is via crypto wallets, wallet providers are partners in a sense. For instance, MetaMask’s Snaps could eventually include Ceramic integration to manage user data, and identity wallets like Spruce or IDen3 might bridge to Ceramic’s identity. The Textile merger (joining the “Textile family”) suggests alignment with other data/storage projects (Textile originally built on IPFS/Filecoin; their Threads DB concept complements Ceramic’s streams). This could yield new hybrid solutions (e.g., using IPFS for content and Ceramic for metadata seamlessly). In terms of community projects, we see hackathon winners using Ceramic for things like NFT ticketing (store ticket metadata that updates on Ceramic) or DAO member profiles. Ceramic’s mainnet is still young, but it has multiple gateway providers (similar to Infura for Ethereum, there are hosted Ceramic nodes one can use) – including one by 3Box and others by community – which shows decentralization in access is being addressed. The roadmap includes features like “ceramic anchoring on multiple chains”, “light nodes” for easier participation, etc., which are signs of maturing technology. To sum up, Ceramic’s ecosystem is dynamic and dev-focused, with an emphasis on interoperability and integration with the broader Web3 stack. It’s not a general-purpose storage ecosystem but rather a composable data ecosystem, which it seems to be achieving with hundreds of devs and an ethos of collaboration (data models marketplace). Its success will depend on whether those 400 apps onboard large user bases, but the infrastructure and community groundwork is being actively laid.

6. Price Comparison

ServicePricing modelStorage price (USD per TB‑month)Key notes
Amazon S3 (Standard, us‑east‑1)Pay‑as‑you‑go$23.00 (first 50 TB)$0.023/GB‑month (tiered). AWS bills in GiB; that’s $23.55/TiB‑month. Egress & requests are extra. (AWS Documentation)
Wasabi (Hot Cloud Storage)Pay‑as‑you‑go$6.99Flat rate $6.99/TB‑month (~$0.0068/GB). No egress or API request fees. (Wasabi Technologies)
Pinata (IPFS pinning)Plan$20.00 (included 1 TB on Picnic)Picnic plan: 1 TB included for $20/mo, +$0.07/GB overage (=$70/TB). Fiesta: 5 TB for $100/mo (=$20/TB), +$0.035/GB overage (=$35/TB). Bandwidth & request quotas apply. (Pinata)
Arweave (permanent)One‑time≈ $12,081 per TB (once)Calculator example: ~2033.87 AR/TB at AR≈$5.94. If you amortize: ≈$1,006/TB‑mo over 1 yr; ≈$201/TB‑mo over 5 yrs; ≈$101/TB‑mo over 10 yrs. Model is “pay once for ~200 years.” Prices vary with AR & fee market. (Arweave)
Walrus (example via Tusky app)Plan$80.00Tusky “Pro 1000” lists 1 TB for $80/mo (≈$64/mo on annual, –20%). Network‑level prices may differ; this is an app’s retail price on Walrus. (Tusky)
Cloudflare R2 (Standard)Pay‑as‑you‑go$15.00$0.015/GB‑month. No egress fees; operations are billed. Infrequent Access tier is $10/TB‑mo. (Cloudflare Docs)
Backblaze B2Pay‑as‑you‑go$6.00$6/TB‑mo, free egress up to 3× your stored data/month. Requests billed. (Backblaze)
StorjPay‑as‑you‑go$6.00$6/TB‑mo storage, $0.02/GB egress, and a $5 minimum monthly usage fee (as of Jul 1 2025). (Storj)

7. Use Cases and Applications

Decentralized storage networks can serve a variety of use cases, each with different requirements (permanence, mutability, speed, etc.). Below we explore some prominent use cases and how each of the discussed vendors fits into them:

a. NFT Metadata and Media Hosting: Perhaps the killer app of 2021 for decentralized storage was NFTs. NFTs on chains like Ethereum and Solana typically store only a token ID on-chain, with the metadata JSON (which contains attributes, name, description) and the media file (image, video, audio) stored off-chain. Decentralized storage is crucial here to avoid NFTs pointing to disappearing links.

  • IPFS + Pinata became the standard for most Ethereum NFTs – creators upload their media to IPFS and use a hash (CID) in the token’s URI. Pinata is often used to ensure the content is persistently pinned and quickly accessible via IPFS gateways. This way, even if a creator goes away, the NFT content can be retrieved by anyone with the CID. For example, high-profile collections like Bored Ape Yacht Club used IPFS for images. Pinata’s role was to make sure tens of thousands of images were reliably available without creators running their own IPFS infrastructure. Marketplaces (OpenSea, etc.) fetch metadata from these IPFS links to display NFTs. The advantage: content addressing adds trust (buyers can verify the hash of the asset matches what’s on sale) and censorship resistance (no single server hosts the images). The challenge: if no one pins the data, it could still vanish – hence services like Pinata or NFT.storage stepped in to maintain availability.
  • Arweave emerged as a strong solution for permanent NFT storage. Projects that wanted to ensure NFT assets live forever gravitated to Arweave, despite higher cost. Solana’s NFT ecosystem is a prime example: Solana’s Candy Machine (minting program) directly integrates with Arweave to upload media and metadata, returning an Arweave TXID URL (often proxied via arweave.net) for the NFT’s token URI. This means once minted, the NFT’s JSON and image are in Arweave’s permaweb permanently (paid for by the minter). Solana’s MetaPlex claims this design was to guarantee collectors that their NFT art won’t disappear or change. Even on Ethereum, some projects used Arweave for high-value art or generative pieces (e.g., Async Art stored components of programmable art on Arweave). Additionally, Arweave’s bundled upload capability allowed for efficient batch storing of thousands of images for NFT drops.
  • Filecoin (via NFT.storage) became a popular back-end for NFTs as well, especially after mid-2021 when NFT.storage launched free storage. NFT.storage uses a hybrid: it pins data to IPFS (for fast retrieval) and concurrently stores that data with multiple Filecoin miners for long-term durability. Many NFT projects (including some on Ethereum, Polygon, and Flow) use NFT.storage, trusting Protocol Labs to keep their content alive (which is facilitated by Filecoin deals). The benefit here is projects get decentralized redundancy without needing to pay (subsidized by Filecoin token economics). Some projects also like the idea of data being on a blockchain-backed network (Filecoin) with cryptographic proofs. There’s also concept of “NFT Checkers” being developed to verify preservation status (e.g., NFT.storage’s upcoming checker to show which NFTs are safely stored).
  • Storj and Sia have been less commonly used for NFTs. However, they are fully capable of hosting NFT media; it’s more a matter of what integrations exist. Sia’s Skynet had some NFT integrations (like SkyNFT, which allowed minting Sia-hosted NFTs with Skylinks as the token URI). Storj, with its focus on enterprise, didn’t directly target NFTs, but conceivably a marketplace could use Storj for hosting content (enjoying the CDN performance). The reason IPFS/Arweave dominated NFT storage is largely due to network effects and tooling: IPFS had widespread support in NFT minting libraries, and Arweave had a clear value proposition of “forever storage” that resonated with collectors. In contrast, using Storj would require a custom integration (though technically one could use a Storj link or gateway URL as the token URI – it’s just not common). Sia’s advantage is cost; an NFT project concerned with budget might quietly use Sia via Filebase to store assets at low cost, but it would be unusual as it’s not the standard approach.
  • Ceramic comes into play for NFTs in the context of dynamic or evolving NFTs. If an NFT’s metadata needs to update (say a game item that levels up), Ceramic could store those evolving properties since it allows mutable streams. Also, Ceramic can be used for NFT ownership tracking off-chain or linking an NFT to a user’s profile. But for the actual media files, Ceramic would typically just store references (like CIDs or Arweave links) because it’s not meant for large binary blobs.

b. Decentralized Application (dApp) Backends: Many decentralized applications require storing data that is too large or inappropriate for the blockchain (either due to cost or because it’s user-generated content). Decentralized storage networks fill this gap, acting as the “backend” or database for dApps:

  • Web3 Front-end Hosting: A common pattern is to host the front-end code (HTML/JS/CSS) of a decentralized app on a decentralized storage network, so that the app can be accessed in a decentralized way (often via IPFS gateways or Arweave URLs). IPFS (via services like Fleek or Pinata) is widely used to host static sites for DeFi apps, NFT marketplaces, etc., ensuring that even if the main website is down, users can retrieve the UI via IPFS. Arweave is also used for hosting front-ends that need to be censorship resistant – e.g. many Ethereum DeFi project UIs are uploaded to Arweave and pinned with an ENS link. In one notable event, when Turkey banned certain crypto sites, users shared Arweave links of Uniswap’s interface so that it could still be accessed. Skynet was enabling similar distributed web hosting (Skynet’s SkyLive and SkyPages hosted videos and personal pages). Storj could also serve web assets, although typically IPFS/Arweave are the go-to for front-ends because of easier linking and content addressing. By using decentralized storage for front-ends, projects reduce reliance on centralized servers – if done fully, a user can interact with a dApp by loading the UI from IPFS/Arweave, which then connects to smart contracts on-chain, achieving a fully decentralized stack.
  • User Data and Social Applications: Decentralized social media or collaboration apps need to store posts, messages, profile info. Ceramic shines here – it provides schemas for common social data types and allows updates. For example, an app like Lens Protocol (decentralized social on Polygon) uses IPFS for storing posts content (often via an IPFS gateway), but could use Ceramic for user profiles or an index of posts, enabling cross-platform social identity. Orbis uses Ceramic to store tweets and comments so that multiple front-ends can display the same content. Arweave can be used for social content that one might want permanent (someone might choose to Arweave-post something akin to an “immutable tweet”), but generally social needs mutability (edit/delete), which Arweave doesn’t allow. Sia/Skynet had a social media demo (SkyFeed) which stored posts on Skynet (somewhat mutable via updating a registry entry pointing to latest feed). For chat applications, Matrix (an open decentralized chat protocol) can be configured to store media in IPFS or Sia for truly decentralized media sharing – these experiments have been done by communities concerned with data ownership. Storj could serve as a backend for apps needing secure user file storage (for instance, a decentralized Dropbox-like app can use Storj to store user files with client-side encryption, benefiting from the network’s resilience). The Holochain community even looked at using Storj or Sia as a file storage layer for large data that doesn’t fit into Holochain’s peer db model.
  • Decentralized Identity (DID) and Credentials: This is a use case where Ceramic and IPFS play key roles. DID documents (which are small JSON documents describing a user’s public keys and so forth) can be stored on IPFS (some DID methods do that). Ceramic’s DID method (did:3) actually stores the DID Document events on Ceramic streams. Also, verifiable credentials (cryptographically signed claims, often a couple of KB in size) can be stored/distributed via IPFS or Ceramic, rather than on a blockchain, and then referenced by identity protocols. Arweave might be used to archive public credentials or attestations for posterity.
  • Back-end Data for DeFi and DAOs: Many DeFi protocols produce a lot of data (trade records, analytics) that are too expensive for chain. Some projects use Filecoin/IPFS to store trading history or backups of state. The Graph (which indexes blockchain data) recently introduced support for Arweave as a storage for queryable data – Graph Nodes can persist subgraph data to Arweave, leveraging its permanence. DAOs often need to store proposals, charters, voting data snapshots – services like Snapshot use IPFS to store proposal JSON and IPFS links are embedded on-chain in governance votes. Aragon (DAO platform) stored some DAO config files on IPFS. DAOs handling large files (like a research DAO storing datasets) might use Filecoin for bulk storage and then share CIDs in the DAO.

c. Archival Data Preservation: A core promise of decentralized storage is preserving information indefinitely and redundantly – be it historical records, open data, scientific knowledge, or cultural artifacts:

  • Arweave explicitly markets itself for archival. The term “permaweb” is about creating a permanent archive of human knowledge. We’ve seen Arweave used to store archives of websites (e.g., the Internet Archive collaborated with Arweave to store data, and there’s a browser extension “ArweaveSave” to archive web pages to Arweave). Arweave’s profit-sharing communities also funded projects like ArweaveNews (to archive news articles immutably). And given that once data is on Arweave it’s practically uncensorable and permanent, it’s ideal for preserving important documents (e.g. journalists have used Arweave to store documents from war zones or protests so they cannot be erased). Arweave has even been described as a “Library of Alexandria that cannot burn”. Organizations with large archival needs (libraries, museums, academic institutions) are interested, though cost is a barrier for really large data sets unless they secure funding.
  • Filecoin also targets archival use cases. It was famously used to store Open Access scientific datasets and open government data via the Filecoin Discover program (where people who stored certain approved datasets got mining advantages). Starling Lab (a partnership between USC Shoah Foundation and Stanford) uses Filecoin to archive sensitive historical media (like Holocaust survivor testimonies) with authenticity proofs. The Filecoin Archives initiative has community groups selecting and uploading important public data (for example, pandemic data, or massive literature corpuses). Because Filecoin storage is cheap, it’s practical to store multi-terabyte archives like entire public dataset repositories. One challenge is retrieval – if these archives aren’t accessed often, they may sit with miners and require some effort to retrieve (like staging a retrieval deal or pinning to IPFS when needed). But efforts like Filecoin Saturn aim to keep popular archival data cached on IPFS for quick access.
  • Storj can serve archival needs with its high durability and automatic geo-distribution. For instance, medical archives or media archives can be stored on Storj to reduce risk of loss and avoid maintaining multiple copies in different regions (Storj does that automatically). It’s also financially attractive for archives that currently pay big cloud bills, as Storj is cheaper. One niche Storj touts is backup storage – companies can use Storj as a backup target (it even has integrations for common backup software). So not only new archives, but also backups of existing important data, can be put on Storj to leverage that redundancy and security (with the bonus of encryption – helpful if the data is sensitive).
  • Sia is inherently good for archival because of its low cost and redundancy. If someone wants to keep a personal archive of documents or photos for decades, Sia can do it at minimal expense (assuming they keep renewing contracts). In fact, there’s a story of someone archiving all of Wikipedia on Sia, leveraging its affordability. Sia’s fully private nature (encrypted by default) is appealing for archives with privacy concerns (e.g., personal health records). The drawback for organizational archives is the complexity of managing Siacoin and contracts over long periods – but a service like Filebase again could fill that gap by providing a subscription model on top of Sia.
  • Ceramic is less about archival (since it’s for dynamic data). It wouldn’t be used to archive large files or big data; rather, it could archive small pieces of historical data in verifiable way (like an official document or policy text, which can be versioned on Ceramic). But if something needs to be truly permanent and unalterable, you’d choose Arweave or similar instead.

d. Content Delivery and Streaming: Decentralized storage can also function in delivering content to end-users, sometimes even live or on-demand streaming:

  • Storj has emphasized this use case. With its architecture, content is effectively on a distributed CDN – nodes around the world can serve video segments to users from the nearest / fastest node, yielding low latency. Storj’s partnership with platforms streaming video (like the example of 30M hours of content streamed) shows it can handle high throughput. It’s suitable for video on-demand (store the video files, and when user plays, the player fetches pieces in parallel – Storj even had a demo where they compared to Cloudflare Stream and had competitive performance). For live streaming, decentralized storage itself isn’t typically used because live requires immediate relay, but after live broadcast, the recordings can be distributed via Storj or Filecoin. Storj is also good for software distribution (it can deliver large binaries or game files to users quickly, acting like a decentralized Akamai).
  • IPFS can be used for content delivery if content is popular enough to be seeded by many nodes. There have been experiments with IPFS for live streaming (e.g., Livepeer used IPFS in their earliest versions to distribute video segments from broadcasters to viewers). IPFS’s challenge is ensuring availability – for popular content, multiple peers might have it (especially if a BitTorrent-like swarming is set up), but for less popular content, a fallback to a gateway that has it pinned (like Pinata’s) is needed. IPFS also introduced IPFS Cluster for pinning content across multiple nodes which can help with load balancing serving to many users.
  • Filecoin is currently more for storage than live delivery, but a concept of Filecoin as a CDN has been floated (using storage miners geographically to serve content). The likely scenario is that Filecoin’s data will be cached on faster networks when needed (e.g. IPFS or specialized retrieval providers). There’s also work on Beta Storage Providers (who might keep some popular datasets always online and close to users).
  • Sia/Skynet had a unique approach: Skynet portals did act as CDN nodes. If a Skylink was frequently accessed, a Skynet portal would cache it and serve quickly to others. Since multiple portals exist, they collectively provided redundancy and some performance. Skynet also allowed decentralized web apps to fetch content from whichever portal was fastest. However, with main portal down, the performance now depends on community-run ones.
  • Arweave wasn’t designed for high-speed content distribution – retrieving a large file from Arweave means you have to query the network (often via a gateway) and miners fetch the pieces from the weave, which could be slower than a dedicated CDN. That said, Arweave’s Wildfire protocol incentivizes miners to share data quickly amongst themselves, which improves retrieval time for the next request. For moderately popular content (like an image that a few people fetch), the gateways often cache it. There are also community-operated Arweave gateways (similar to IPFS gateways) that could be globally distributed to improve performance. For truly latency-sensitive streaming, Arweave is not the first choice; but for delivering content that is okay with a second or two latency and where permanence matters (like a public document or site), Arweave is fine.

In practice, solutions can be layered: for instance, an app might store content permanently on Arweave or Filecoin, but use IPFS or Storj to distribute it to end-users quickly. Or use Ceramic for the index and references, Arweave for backup, and IPFS for hot access.

Summary of Use-Case Fit: Each solution has strengths in particular areas:

  • Arweave: Best for permanent web content, NFT media that must last forever, and tamper-proof archives. Often used for NFT metadata, blockchain history archive, web pages/blog posts preservation, and censorship-resistant hosting of sites. Less suited for frequently changing data or high-bandwidth streaming.
  • Pinata/IPFS: Ideal for content addressing and distribution. Used for NFTs (widespread), web asset hosting for dApps, general file sharing in Web3 apps, and any scenario needing a quick, decentralized CDN. With proper pinning, works for semi-permanent storage, though not guarantee permanent unless someone keeps pinning (hence often combined with Filecoin). Great for collaborative data (multiple people can fetch and re-host content). Less ideal if you need guarantee of long-term persistence without maintenance.
  • Filecoin: Great for long-term storage at scale, especially when combined with IPFS for retrieval. Used for archival of large datasets, backup of NFT assets (many NFTs are quietly backed by Filecoin through NFT.storage even if front-end uses IPFS URIs), enterprise cold storage, and increasingly, data-heavy Web3 services (like video, where Filecoin can store source files or transcoded outputs cheaply, while a separate layer handles streaming). Also being tried for data DAOs (communities funding storage of valuable data like OpenStreetMap, genomic data, etc.). Not typically used for real-time data or frequent small writes due to deal overhead, but evolving in that direction with new tools.
  • Storj: Excels in secure, performant cloud storage, targeted at developers and enterprises who want a plug-and-play decentralized solution. Used for media streaming platforms, app backends needing S3 API (some blockchain explorers could use Storj to store user-uploaded content, for example), globally accessible collaboration files (e.g., video production houses sharing large footage globally through Storj rather than Dropbox to get speed + cost benefits). Also suitable for IoT data storage (lots of small writes from many locations – Storj can handle parallel writes well). Possibly less used in the purely crypto context (e.g. NFTs) simply due to IPFS’s mindshare there, but technologically it could do many of the same tasks with potentially better performance.
  • Sia: Good for budget-conscious decentralized storage and those prioritizing privacy. Often the choice for personal backups (by crypto enthusiasts), redundant backup for businesses (some might run Sia alongside traditional storage to have an extra copy off-site encrypted), and as the layer behind user-friendly services like Filebase. Skynet extended it to web hosting and app data, which had a brief flourish and might return via Sia v2. Sia is also used in some VPN or cloud services as an encrypted data dump (there were projects to use Sia as a decentralized Dropbox for end-users like Pixeldrain). In summary, a solid general-purpose backend if configured, but with fewer turn-key integrations than Storj or IPFS.
  • Ceramic: Tailored for dApp data and identities. Used for user profiles in Web3 (one profile accessible across many dApps), social content (posts, comments that need updates and deletions, e.g., decentralized Twitter, Reddit analogues), DAO proposals and discussions (so they can be edited, threaded, without bloating blockchain), game state data (like tracking an on-chain game’s off-chain stats or achievements in a mutable way), and metadata registries (e.g., an NFT project could use Ceramic to allow token owners to add additional info or provenance notes to their NFT which are then linked to the token). Essentially, anywhere you need a database in a decentralized context (but the data is not huge). It’s not for file storage at all – instead it complements file storage networks by handling the relational/structured data aspects.

To conclude, the landscape of decentralized storage vendors offers complementary strengths. A complex dApp might actually use multiple of these: e.g. NFT marketplace could use IPFS+Pinata for quick asset serving, Filecoin to back up the assets long-term, Ceramic for user profiles and comments on NFTs, and maybe Arweave for its website and a permanent record of all NFT metadata. Each project – Arweave, Pinata/IPFS, Filecoin, Storj, Sia, Ceramic – contributes a piece to the emerging decentralized web infrastructure, balancing trade-offs in protocol design, cost, performance, and permanence.

Sources:

  • Reflexivity Research (2024) – Arweave Overview
  • Gate.io Research (2023) – Arweave: Pay Once, Store Forever
  • FiveT Investment (2023) – Decentralized Storage: Filecoin vs Arweave
  • Pinata Cloud – Official site and pricing
  • NFT.storage Blog (2023) – Partnerships with Pinata and Lighthouse
  • Storj Docs – Pricing and architecture; ownCloud – Storj integration
  • Messari (2024) – State of Sia Q3 2024
  • Ceramic Network – Official site (2025); LogRocket – Managing data with Ceramic
  • IPFS Docs – Comparisons
  • The Block (2022) – Meta uses Arweave for Instagram NFTs