managing the # of actors, safely dealing with tombstones and reducing space comp...

rzezeski · on July 21, 2014

Not only that, why make every user reimplement the same resolution strategies? Or make Basho dev's reimplement them for each official client? You'll increase the chance for bugs or subtle semantic differences. Also, many users don't know or want to understand conflict resolution. They want a data structure that just "does the right thing". This is most easily achieved by baking it into the database itself.

Disclaimer: Former Basho developer with a lot of pride in the 2.0 release.

lomnakkus · on July 21, 2014

Can you add (or link to) something with more details?

"Managing the # of actors" doesn't make much sense to me without further context. I know that Erlang is actor-based, but that shouldn't matter if I'm e.g. a client of the system? Also, I'm not sure what "safely dealing with tombstones" would mean -- what additional safety is added by "server-side" CRTDs which cannot be achieved with client-side code (and how so?). (Etc.)

cmeiklejohn · on July 21, 2014

Full disclosure: I’m a current employee of Basho Technologies.

Other than some supporting work [1] [2] introduced into 2.0 to provide advanced causality tracking, you’re correct in assuming we could have introduced more [3] CRDTs as part of the Riak 1.x series. We could have also implemented all of the CRDTs we provide in the client as well, which is similar to what the SwiftCloud CRDT reference platform does.

There's a couple important things to note here, however:

* When talking about merging conflicting writes, we are specifically referring to state-based CRDTs, not operation-based, which is what we have implemented in Riak.

* Retrieving conflicting writes from the client, or siblings as we call them in Riak, requires bringing all of the siblings to the client, performing the merge operation, and shipping the updated state back. Given this, the number of siblings an object can have on disk, assuming all merge operations happen at the client, is potentially unbounded if you never ever read, and only ever write. When implemented on the server, we can ensure that we perform this merge operation during both the read and write cycle, keeping the sibling count down to one and reducing the amount of state we need to ship to the client.

* In addition, we use the coordinating node (really, a combination of virtual node and partition index) of the write as the "participant" or "actor" for the operation. This is not to be confused with actor model based languages. This allows us to have better control over actor growth; when dealing with clients all writing to CRDTs, every single particpant needs to have a unique actor id. Recall, that most of the CRDTs track actor counts, for instance the G-Counter which is structurally equivalent to a vector clock, although semantically different. This introduces a problem of garbage collection. Interval tree clocks, is one such solution for addressing the problem, but can not be used as the basis for some CRDTs. [4]

* Finally, there is work underway in making state-based CRDTs more efficient through "delta-CRDTs" [5], which allow for a more efficient optimistic and anti-entropy repair mechanism.

While the most notable resource for exploring CRDTs continues to be the comprehensive report by Shapiro, et al, [6] in practice many of the data structures outlined here have unbounded growth in garbage (specifically referring to items such as the OR set, which tracks an object for every operation performed). Therefore, we rely on some of the more optimized representations which don’t accumulate garbage. [7] In addition, the conflict-free, composable, replicated map structure, which is provided by Riak 2.0 was specifically invented by Basho, and it is the first of its kind. [8] It took many hours and iterations on QuickCheck models to ensure that, given somewhat arbitrary composition, that merge operations happened correctly. This is why there has been interest in exploring alternative ways of checking or building these models. [9]

By storing these CRDTs at the server-side, we also are able to provide a operation-based interface for interacting with these objects from all of our clients, and leave the complexity of implementing the CRDTs out of the client. This additionally allows for our search offering, Yokozuna, to be able to index these data types and provide query over their values.

[1] https://github.com/basho/riak_kv/pull/746

[2] https://github.com/basho/riak_core/pull/463

[3] http://basho.com/counters-in-riak-1-4/

[4] http://gsd.di.uminho.pt/members/cbm/ps/itc2008.pdf

[5] https://twitter.com/xmal/status/467331615535149059

[6] http://hal.inria.fr/inria-00555588

[7] http://arxiv.org/abs/1210.3368

[8] http://dl.acm.org/citation.cfm?id=2596633

[9] http://arxiv.org/abs/1406.4291

* Edited to fix citation formatting.

cmeiklejohn · on July 22, 2014

For what it's worth, I'm putting up a more permanent resource with information, links to papers, etc., on my blog:

http://christophermeiklejohn.com/crdt/2014/07/22/readings-in...