This is explained a bit in a blog post[1]. Basically, this is because FaunaDB us...

ergl · on March 15, 2017

Yes, it has to do with the way Calvin handles transactions, which are required to declare their read and write sets before executing.

These kind of transactions are also called static, and are normally of the type "I have this read/write operation(s) against multiple keys, go and do it" vs dynamic transactions that might depend on read values from the db to figure out what to do next.

jchrisa · on March 15, 2017

We allow you to make different writes depending on reads. The query language has loop and control flow structures, so you can push a lot of the logic to the database. You can read more about the query language here: https://fauna.com/documentation/queries

ergl · on March 15, 2017

You can issue writes depending on reads, but are they performed in the same transaction?

If I do `if(read(A) = 0) then write(B, 1) else write(C, 2)`, would that be executed in 1 or 2 transactions?

dwenzek · on March 16, 2017

It should be executed in a single transaction. But the query may be scheduled more than once.

According to the Calvin paper, there are two strategies here:

* Either, the write set of the query is considered to be {B,C}.

* Or the query if rewritten and scheduled several times, the `if` operation beeing replaced by an assert operation, until the assertion is indeed true, in which case the transaction proceeds using a smaller write set. If not, the query is scheduled again after having been rewritten to match the current values. In other words, if A equals 0 then the query is rewritten as `assert A == 0 then Write(B,1)`. Otherwise the query is rewritten as `assert A != 0 then Write(C,2)`.

Months ago, I implemented a prototype after the Calvin paper and used the first strategy. It may imply more contentions and even lock the whole dataset with queries like `Write(Read(A),a))`. Sadly, such queries are not rare: updating a distributed index is the typical case.

What is the approach of FaunaDB ?

ergl · on March 16, 2017

Ah, yeah, I suspected it could be fixed with your first approach, but the second one also sounds plausible.

But if you go with the first option, then you can end up locking the entire database, as you pointed out with `write(read(A), _)`. Wouldn't your second option cause multiple transaction aborts until you find the rewrite that satisfies the assert? And in that case, the scheduler would need to know if the abort was actually caused by a faulty rewrite.

It would be interesting to know what Fauna does in these cases.

Is your implementation open somewhere?

dwenzek · on March 16, 2017

The prototype is unfortunately tied to a more global project, which I'm not ready to make open.

This is a POC implemented in OCaml using Kafka. I'll take a few hours to see what I can open.

ergl · on March 16, 2017

Sounds great! looking forward to it

dwenzek · on March 16, 2017

Here it is:

https://github.com/didier-wenzek/poc-calvin-transactions

ergl · on March 16, 2017

Thanks for opening it up! I'll give it a look as soon as I get the time.