Filecoin – A Cryptocurrency Operated File Storage Network

bogle · on Feb 24, 2017

Sia [http://sia.tech/]; filecoin; Storj [https://storj.io/]; maidsafe [https://maidsafe.net/]. Sounds like someone could do a good comparison article. Does anyone have any experience of using any of these? They do take quite a bit to set up compared to, say, Backblaze B2 or AWS S3.

Taek · on Feb 24, 2017

https://forum.sia.tech/topic/21/sia-vs-storj-vs-maidsafe

(I am the founder of Sia)

Sia's key distinguishing feature is that it's the only platform today which is fully/properly decentralized. If the MaidSafe devs today disappear, I believe all the servers running MaidSafe go with them. If the Storj team disappears, payments stop entirely and anyone using the Storj bridge (most of their users) will not be able to access files. If the Sia devs disappear, everything will continue to work as it currently does (though, bugfixes and feature adds will stop)

Sia distinguishes in two other major ways as well. First, we are the only platform that allows hosts to put up collateral - if a host loses your data, they don't just lose revenue they also lose collateral money that they put up as a promise to keep the data safe. Second, Sia is the only network that gives you full control over which machines your data ends up on. This is good if you want to run a CDN, need to comply with data laws, or have some other reason for favoring a particular region or set of hosts.

I encourage you to dive deep in the technical details, I think you'll find that Sia is pretty far ahead of everyone else.

xvolter · on Feb 24, 2017

Seems really annoying to get setup with Sia though. You need to have at least 50,000 SC to begin hosting, and you can't earn SC until you start hosting. You even need SC to announce your host (15 SC), and the apps don't explain if the software deals with dynamic IPs or if it supports hostnames instead of just IPv4 addresses. Also, it seems like it is only using IPv4 addresses, kind of short minded for modern technology.

How are you even supposed to get started using Sia if there's no way to do it except to go and spend money. The point of being able to host files is that I'll offer up some storage on my NAS, earn coins, then be able to backup some of my own important documents on the network. Seems like if I have to dish out money to get started I doubt I'd ever be able to maintain the service without continuously spending money. So many other syncing technologies seem easier than this.

Taek · on Feb 24, 2017

Hostnames, IPv4, and IPv6 are all supported. Dynamic IP addresses are also supported, but they are generally a lot less reliable (the renter needs a way to discover that the host IP has changed, and often this discovery is late). http://siapulse.com/page/network has examples of hostnames, IPv4, and IPv6 addresses. Not all home connections are able to use IPv6, so if you don't see any it's probably a problem with your home setup.

> You need to have at least 50,000 SC to begin hosting, and you can't earn SC until you start hosting.

This is one of the ways that we address churn on the network. It's actually really bad for the network for a host to get started, be around storing files for a day or two, and then leave. If you are going to be a host on the Sia network, it's important that you are committed.

> The point of being able to host files is that I'll offer up some storage on my NAS, earn coins, then be able to backup some of my own important documents on the network.

Sia is not designed as a tit-for-tat network. The economics are supposed to work much like Bitcoin + Bitcoin mining. That is, you have a dedicated set of specialists who work hard to provide a high quality service at a very good price. If you are hosting, the goal is to make money, not to be able to afford to back up your own data.

> So many other syncing technologies seem easier than this.

Yes but none of them are offering storage at $1 / TB / Mo. But also, Sia is still an early product and we're still iterating heavily on both the core algorithms and on the user experience.

Sia's core user market is people who are willing to pay for cloud storage. This would include businesses and enterprises. Sia's long term game is to offer a cloud storage platform that competes with the likes of Amazon S3 at less than 10% of the price, all while offering the security, privacy, decentralization, and open source codebase of a blockchain project.

problems · on Feb 24, 2017

Sia seems really cool - out of curiosity, is there a way to see the available storage prices on the network?

Taek · on Feb 24, 2017

SiaPulse.com has some very rough estimates.

The master branch client has a command 'siac renter prices' that will look at the network and estimate the costs. This feature hasn't been released yet because it's very recent.

My client is saying $0.50 to create a set of contracts (needs to be done once every six weeks), $0.60 per TB to download from the network, $1 / TB / Mo to store data on the network, and $0.30 per TB to upload to the network.

These prices include a redundancy of 3x in the upload and storage costs.

Long term I expect prices to not be more than about 2x what they are today, though it really depends on how the supply-demand mechanics play out. It's a fully open and competitive marketplace.

waynenilsen · on Feb 25, 2017

Maidsafe is fully decentralized. It is still in testing but full networks can and have been run without maidsafe servers. Any node can be used to bootstrap

brilliantcode · on Feb 24, 2017

what problems did end users solve with Sia?

super3 · on Feb 24, 2017

CEO of Storj here. You can sign up to use Storj and get the tools working to upload/download your files in 60 seconds. Last time I tried to use S3 it was 12 steps, and a phone call. Its faster, more secure, 50% cheaper than traditional cloud storage.

Basics are data stored for each. Storj: ~1400 TB, Sia: ~100 TB, Filecoin: Not Functional, Maidsafe: Not Functional.

mintplant · on Feb 24, 2017

Working clickable links:

https://sia.tech/ https://storj.io/ https://maidsafe.net/

j_s · on Feb 24, 2017

A good comparison article would include costs and the difficulty of obtaining that info... feels fuzzy for most of these, and that's a big problem for adoption.

Taek · on Feb 24, 2017

Happy to fill in:

Prices on Sia:

- siapulse.com contains some information about pricing

- running `siac renter prices` on client on the master branch (yet unreleased) will give you a price estimate given the set of hosts you know about. This feature will be available in the next release.

- Storage cost: about $1 / TB / Mo (for 10-of-30 redundancy)

- Bandwidth cost: about $0.60 / TB

Prices on Storj:

- Not sure how the prices get set, but the prices are advertised on their website

- Storage cost: $15 / TB / Mo

- Bandwidth cost: $50 / TB

Prices on MaidSafe:

- No price given, but the claim is that it will adjust according to supply and demand. Should be within a small factor of Sia if that is the case (Sia's prices are also set by supply + demand).

myowncrapulence · on Feb 24, 2017

Instead of simply using bitcoin (internet), these companies all created arbitrary currencies (intranet) to essentially generate profit for themselves. Having seen dozens of literally identical projects I can promise you all of these "networks" which utilize their own arbitrary currency are scams.

sneak · on Feb 24, 2017

The "altcoin == scam" assertion doesn't hold up under scrutiny.

Bitcoin is IPv4. There is lots of research into the next, better version.

_prometheus · on Feb 24, 2017

Hey Everyone, thanks for the great discussion and excellent questions. Was surprised to see this rise up to FP, given we have not updated the site in ages. :] Thanks for the great discussion here! Unfortunately i can't give much in form of answers as time is super punishing right now :(. Good news is we're hard at work, and we'll have a big announcement within 2 mo. ;) See you then! --- Oh also, big shoutout to Sia, Storj, & Swarm. They're all doing solid work on this. One thing that we've been working on is how to increase collaboration, interop, and shared upside across our systems. Stay tuned!

(I'm jbenet, of IPFS, Filecoin, & Protocol Labs)

lgierth · on Feb 24, 2017

The latest news on Filecoin is that it's being built on top of Ethereum: https://www.youtube.com/watch?v=Itb_2EMgBUI

While Filecoin isn't available yet, IPFS and libp2p have been functional for more than two years, and can be used independently of Filecoin: https://github.com/ipfs/ipfs && https://github.com/libp2p/libp2p

(IPFS dev here)

infinisil · on Feb 24, 2017

I believe filecoin is going to be a part of IPFS [1] at some point in the future, but development hasn't started yet. I really hope all the decentralized technology is going to blow up.

[1]: http://ipfs.io/

api · on Feb 24, 2017

I feel like it's approaching a tipping point. We have good decentralized networking, storage, currency, etc.

A missing piece is decentralized compute at scale. That's hard, especially for security reasons.

It's too bad Amazon branded "serverless" because lambda is not serverless. It's one big mainframe server. This stuff is serverless.

kefka · on Feb 24, 2017

In all honesty, I need to try building something that does semi-distributed computation. Here's my idea to throw in the pool.

First, you don't get something for nothing. So you need computers (surprise). Now, how do you share code? You can share code directly via IPFS. Now, admittedly, binary is the fastest, but limits your architecture rather greatly. So an interpreted language seems better.

I'm looking at Erlang, given its functional attributes and ability of swapping code with no downtime. It also works in a cluster.

I also look at Tor for entry points to get requests on these machines. We already can interact with IPFS for files, but it has no logic capability. And Ethereum is a joke in many ways (and it simply can't interact outside its blockchain).

The idea: IPFS filestore for storing functions, and chaining IPFS hashes to be ran by a Erlang cloud connected by Tor Onion links.

It's not completely serverless computation, but you can ideally abstract out the server so it doesn't matter where it is, or who handles it.

Other ways to go about this would be homomorphic encryption (way too computationally expensive at this time)... Or some sort of trusted computing (shudder).

chriswarbo · on Feb 24, 2017

> Ethereum is a joke in many ways

Ethereum is/was pretty interesting; but as with all blockchains, it's solving a globally-distributed byzantine/trustless problem, at the expense of being massively inefficient.

It's a good idea for that domain, but such constraints don't apply to the majority of computing problems. For example, Web app A might want its events to appear in the same order to all of its clients, and Web app B might want the same for its events, but there's no reason to enforce that all clients of all apps see all events of all apps in the same interleaved order, in an open world where new apps can be created without any central authority, and where no app or client trusts any other app or client.

Regarding a practical language, I think something like Morte/Annah/Dhall would be nice as a way to:

- Use pure functional computation as a powerful 'sandbox' against causing nasty effects or having results affected by outside interference

- Use IPFS URLs as function names

- Use Church (et al) encoding and strong normalisation as a form of statically-checkable duck-typing (i.e. if it encodes to a duck, then it's a duck)

bergie · on Feb 24, 2017

We did an experiment on distributing work via IPFS and an Ethereum contract:

* JavaScript function to run, stored on IPFS

* Input data stored on IPFS

* Result of the execution gets stored on IPFS and hash written as contract fulfillment

https://github.com/flowhub/jsjob-ethereum

woah · on Feb 24, 2017

What's your goal here? Right now it just sounds like you want to use a bunch of cool technologies.

zapt02 · on Feb 24, 2017

Why is Ethereum a joke? It seems to do exactly what you are trying to, and you just discount it?

kefka · on Feb 24, 2017

Primarily because they have engaged in the "We didn't really like the way project XYZ worked out, so we're going to re-do."

Look at the DAO. It's more than enough reason to dismiss this as a libertarian capitalist's experiment, with straight up dictatorialism if they don't get their way.

splintercell · on Feb 24, 2017

> Primarily because they have engaged in the "We didn't really like the way project XYZ worked out, so we're going to re-do."

As someone who has tried to build a simple DAPP on bitcoin, my god, this is not what bitcoin was meant to do. I mean I love bitcoin, but building smart contracts on bitcoin is like using a screwdriver to implement a calculator.

Bitcoin took a route which isn't suited for things which Ethereum wants to achieve. A great example of this is Augur vs Truthcoin/Hivemind. The team behind former looked at Bitcoin first, but realized how much difficult it would be for them to build this on bitcoin so they went Ethereum route and now they are very close to their alpha, where as Truthcoin/Hivemind went nowhere.

UweSchmidt · on Feb 24, 2017

Elaborate?

Do you support Ethereum Classic and consider "being hacked by the DAO hack" == "things not going [their] way"?

Just curious.

kefka · on Feb 24, 2017

By Ethereum's own documentation, their code is the contract and the contract is the code.

Yes, it sucks that they did not write tighter code and allowed a gotcha.... But when that happens in the legal world, people do pay for it.

Instead, because one of founders put up a lot of ETH, they changed the rules of the game. Now, the question is, "How close to the CEO of ConsenSys do you need to be to roll out bad contracts?".

That's what makes it worthless.

UweSchmidt · on Feb 24, 2017

Right. This also means that you can't currently trust code that runs on the blockchain since there are likely more bugs like the one that was exploited. Thoughts on that?

The change in Ethereum was "voted" for by a majority of hash power (but of course as influenced by the Ethereum leadership, so people had the popup in the client along with a recommendation what to vote for). So isn't that correction also in the spirit of Ethereum? Miners decide.

How about fixing obvious bugs, but otherwise upholding contracts?

PeCaN · on Feb 24, 2017

> This also means that you can't currently trust code that runs on the blockchain since there are likely more bugs like the one that was exploited.

To elaborate, it's especially hard to trust ETH code because it's not provable in any way. Bugs will hide, and people will find them. If ETH wasn't turing-complete this would be less problematic (because static analysis would be much easier, and possible), but as it stands there's no way to trust Ethereum contracts.

danielpatrick · on Feb 25, 2017

You don't understand what happened. The CEO (lol) doesn't just get to choose. The network's hashing power gets to choose.

The rules aren't: "the code is the contract".

The rules are: "the code that is accepted by a majority of hashing power is the contract."

chuhnk · on Feb 24, 2017

https://golem.network/

adrianN · on Feb 24, 2017

There is https://boinc.berkeley.edu/

api · on Feb 24, 2017

Yes, but I mean something more general purpose and developer friendly.

If I'm writing a mobile app that needs heavy compute (too heavy for small mobile devices) can I distribute that compute across larger peers like desktops, NAS boxes, etc.? Not easily and not if it involves anything that anyone might ever want to attack or spy on.

saganus · on Feb 24, 2017

Then that would require a breakthrough in FHE no? I believe current implementations would be way too slow for this.

Or is there another way to outsource computation in a secure manner?

hex-m · on Feb 24, 2017

Another concept would be to slice your task it into small enough junks so that one would need many of them to extract meaningful information. That's not strictly secure, but nothing is. ;)

adrianN · on Feb 24, 2017

Either your chunks are big enough that the network delay doesn't matter, or you'll be killed by communication overhead. Having each server simulate a single transistor is reasonable safe, but you'll never complete your task.

tech_man7 · on Feb 26, 2017

I think this is their slack channel: http://golemproject.org:3000/

might be worth asking there

tromp · on Feb 24, 2017

I wonder how these proof-of-retrievabililty blockchains deal with the potential problem of miners injecting a ton of fake files (e.g. whose i'th block is H(secret||i)) into the system in order to boost their chances of being able to do a retrievabililty proof without actually needing storage.

Is it simply a matter of mining rewards not being able to compensate for the cost of injection?

Taek · on Feb 24, 2017

Is there more than just Filecoin? Sia and Storj both only do file contracts, consensus is driven by other means (Proof of Work for Sia, Counterparty for Storj).

As for Filecoin, I believe that it has several significant issues, both with the game theory (super linear returns when investing in hashrate and storage simultaneously - a massive centralization pressure), and with data withholding attacks (to mine on Filecoin, you need to have the Filecoin data. What if the existing miners decide not to give it to you?).

tromp · on Feb 24, 2017

I see this is answered in Section 2.7:

"Altering the reward scheme must include careful analysis of the resulting market equilibria. For instance, if the currency is too inflationary, then attackers may benefit from adding large amounts of “dummy data” that they can easily reproduce without incurring the cost of storage (e.g., the outputs of a pseudorandom function for which they know the secret key), and thereby gain a net advantage in the challenge reward system over the long term."

_kbso · on Feb 24, 2017

Seems to be the same as Sia [1]. Except that you can already use Sia today.

[1] https://sia.tech/

iamgopal · on Feb 24, 2017

What's your experience of it ?

xkxx · on Feb 24, 2017

Hmm, and their last activity on Twitter was 3 Nov 2015: https://twitter.com/MineFilecoin/with_replies. Why is it important? They provide Twitter and email newsletter as main ways to follow their news. If there was no activity for over a year, the project might be stalled.

tyingq · on Feb 24, 2017

If it takes off, will be interesting to see how they deal with the inevitable questionable content and resulting DMCA takedown requests.

Taek · on Feb 24, 2017

Takedown requests would need to be sent to the nodes holding the data or to servers holding links to the data. I don't think the dev teams or parent companies would have too much to worry about.

tyingq · on Feb 24, 2017

At a high level, the idea seems to hinge on "incentive to hold the data". Dealing with DMCA becomes a disincentive.

Kinnard · on Feb 24, 2017

Filecoin's actually been around for several years. I think Storj is based off of it.

tyingq · on Feb 24, 2017

Ahh, thanks. Storj doesn't appear to support file sharing though, looks more like personal storage, which would make this less of an issue.

Edit: Might want to put 2014 in the title.

Kinnard · on Feb 24, 2017

I think filecoin's even older than that though this site appears to be from 2014.

Taek · on Feb 24, 2017

I believe that Filecoin is from early 2014, same as Sia and Storj.

super3 · on Feb 24, 2017

Filecoin whitepaper came out a few months before the Storj one. Filecoin however is not functional, while Storj has been functional for about 8 months.

brador · on Feb 24, 2017

In that case, how are they dealing with questionable content and resulting DMCA takedown requests?

jakeywankenobi · on Feb 24, 2017

I'm always curious about how things like this would do if an effort was made to market it to the general populace. The makers are all amped about the underlying tech (and rightfully so), but it's a hard thing to really care about outside that niche space. What if you hid all the underlying tech, layered on a good UX, and spoke to the value proposition?

vasili111 · on Feb 25, 2017

Does anyone use one of the decentralized storage networks? Please tell us about your personal experience.

OJFord · on Feb 24, 2017

This looks really interesting, but the misuse of 'rent' all over the landing page is confusing at first, ("I get paid to use someone else's storage space?") and subsequently distracting.

homakov · on Feb 24, 2017

I know how hard it is to explain an outsider what your product does, but maybe you could try to explain to me who should be using it? Personally, I have enough space on my laptop.

INTPenis · on Feb 24, 2017

Isn't KBFS (keybase) also based on blockchain technology?

mundo · on Feb 24, 2017

I believe Keybase (the identity and key management tool) stores signatures on the BTC blockchain, but KBFS (the encrypted shared filesystem built on Keybase) does not, it is just S3 storage that they're offering as a freebie (10GB/user) to drive adoption.

vocatus_gate · on Feb 24, 2017

Prediction: Dead or abandoned in <= 2 years.

verelo · on Feb 24, 2017

I see you've heard of startups before...

Seriously though, that is a likely outcome, but most good ideas do not sound like good ideas to begin with.

My bigger concern here is the types of files people use this service for, and as distributed host of these files, what are the legal ramifications on me for doing this?

brilliantcode · on Feb 24, 2017

say you want to "sprinkle" cryptocurrency on a really bland idea.

What is the fastest and cheapest way in doing so?

Just so you can put "cryptocurrency" on your website to get marketing exposure.

That's about the only value I see with the current crypto business. Hype.

eptcyka · on Feb 24, 2017

This isn't anything novel. How are they different from https://maidsafe.net/ ?