How to replicate: Building a CouchDB-compatible database in 400 lines of Python

TL;DR: A tutorial for building a database that replicates like CouchDB in Python. Also introduces ChairDB.

1 Introduction

1.1 Why CouchDB?

Years ago, I first encountered CouchDB. Soon, I was hooked. CouchDB is a database that allows you to store and retrieve JSON documents through an HTTP API. If you have multiple installations, say on a server and your own computer, you can easily and quickly synchronize your database between them through a process called replication. On top of that, it allowed you to host your web application directly from your database using so-called CouchApps¹. It was amazing.

At the time, I heavily used an application I developed for my personal use. It was hosted both by my laptop’s and my web server’s CouchDB. When I made a change on my laptop, either to the application itself or by entering some data, it would transparently push the change to the server and vice versa. If I did not have an internet connection³, I could access the application locally. When at someone else’s computer, I could access the application online. The same HTTP API that is used to keep two CouchDB databases in sync can also be used to get notified of every change to your database. This makes it easy to build applications that update in real-time when someone enters data, without having to reload. You simply listen to the ‘changes feed’, and process updates as they come in. This was very nice compared to having to implement long polling in your own backend API. Also, you would get an admin interface for free. The cost was adhering to CouchDB’s data model, but that was an acceptable trade-off.

1.2 Alternative implementations

Skip forward a couple of years, and somehow this nice way of developing applications seemed to have mostly failed to achieve traction. Maybe because users don’t like it if you ask them to install a database just to get your application working offline. Or perhaps developers don’t like being that constrained on the server, e.g. when it comes to authentication⁴ and authorization⁵. Still, the CouchDB ecosystem was active and evolving. You could embed a CouchDB-like database in your smartphone application using TouchDB⁶ (nowadays CouchDB-Lite). And with the rise of JavaScript data storage APIs⁷ in the browser came PouchDB, which allows you to do the same inside a web application. I jumped on that and for a while worked on trying to port the (by now mostly abandoned) CouchApp paradigm over to it, culminating in a demo, which allowed you to run an existing CouchApp entirely from the user’s disk with only minimal changes being necessary thanks to the then brand new service workers. But I never got around to extending ‘PouchApps’ into a full-blown project without the rough edges.

1.3 Goal

My aim so far has been to explain why I think CouchDB and its replication are interesting, and to show the state of the field. While I have been a long-time CouchDB user, I never got around to actually trying to understand how it and its replication work from first principles. When I recently decided to examine what so far had seemed like a magic trick, my first thought was to look at PouchDB’s internals⁸. Sadly, that alone was not enough. Partly, because PouchDB is a mature implementation which handles a lot of edge cases. Partly because its control flow is hard to follow, as it was written at a time when callbacks were still the only way of doing asynchronous control flow in JavaScript. To get the overview I decided to build my own minimal prototype of a CouchDB-compatible database, including replication, instead. This blog post summarizes my attempt to do so using the Python programming language.

Before we get to it, I should mention Alexander Shorin previously embarked on a similar project. In my opinion it oversimplifies conflict handling a bit, which comes at a cost to correctness, but it’s an impressive resource worth checking out nonetheless. That project is what finally convinced me to give it a try myself. More recently, Garren Smith wrote a mini-CouchDB in Rust. That prototype isn’t complete enough to support replication, but it gives a nice introduction to using FoundationDB like CouchDB will soon do. Very much worth a look as well!

2 Conflicts⁹

The first thing to figure out is how to represent documents and their metadata in our database¹⁰. If you’ve used CouchDB, you know that each document has (at least) two special fields: ’_id’ and ’_rev’. The first specifies the key under which the document is stored¹¹ in the database. It’s what you use to retrieve the document again. The second is short for ‘revision’. A revision consists of two parts: an incrementing integer telling us the version of this document, and a hash of the document and all its metadata. For example: ‘1-85a961d0d9b235b7b4f07baed1a38fda’. In this prototype, we will not actually bother with calculating hashes: as long as they uniquely identify documents, that’s good enough. Even PouchDB got away with using random UUIDs for quite some time without major issues. The revision is used to resolve conflicts. Let’s walk through an example to understand how that works.

Imagine a municipality that keeps track of all the trees it plants in a database. In the morning, employee Bob comes along and plants a tree somewhere at a roadside. He dutifully updates the record for the current plot of land on his smartphone (revision ‘1-1a9c’ gets replaced with ‘2-e3b0’), but as this is in a remote spot, the information is not synced to the municipality’s server immediately. To make matters worse, his phone’s battery dies before this can happen. While the phone is switched off, Jane reaches the same location with a second tree. She plants it, updates the database record (revision ‘1-1a9c’ gets replaced with ‘2-6e05’), and goes on her way. Soon afterward, her phone syncs her changes to the server (which now also has ‘2-6e05’ as its latest revision).

When Bob’s phone is turned on again, it also replicates its record to the database. Contrary to what you might expect, this succeeds. But now the server has two versions of the same document: ‘2-6e05’ and ‘2-e3b0’. This situation is called a conflict. If a user now requests the record for the roadside location, a CouchDB-like database will arbitrarily give them the one with the highest revision (the so-called ‘winner’). In this case, Bob’s:

Clearly, this is unacceptable. If nothing were done, Jane’s record update would be lost, through no fault of her own. This is why CouchDB’s documentation recommends you either resolve conflicts on each read, or have a background process that watches for conflicts and resolves them. Let’s say the municipality uses the latter approach. This background process then requests the two latest versions of the document. It creates a record which includes both of the newly planted trees, and replaces ‘2-e3b0’ (Bob’s revision) with a new revision ‘3-5bd6’. This doesn’t yet resolve the conflict. For that, we also need to remove Jane’s revision (‘2-6e05’) by replacing it with a new revision (‘3-b617’). This revision marks the removal using an (otherwise empty) document with a ‘_deleted’ field. Such a revision is called a ‘tombstone revision’ in CouchDB terminology.

Note that if the background process had removed¹² Bob’s revision (‘2-e3b0’) and stored the new revision as a continuation of Jane’s revision (‘2-6e05’) instead, this also would have resolved the conflict. The point is that there should be at most one last revision that is not a tombstone revision¹³.

3 Revision tree

The municipality story makes clear that just storing a revision per document isn’t enough. If the background process doesn’t tell the database which ‘branch’ to extend (Bob’s or Jane’s), the database cannot know which one to pick. In fact, the only logical solution from the database’s perspective in such a situation would be to add the new record as another conflicting revision, making the problem worse.

Instead, CouchDB-like databases keep all previous revisions. The most efficient way to store this is in a tree structure called the ‘revision tree’. The revision tree for our municipality example was already given previously as a figure. Our example tree contains two (‘last’) leaf nodes, one of which is a tombstone revision, and has a single root node. To make sure revision trees don’t increase in size indefinitely as data is added to the database, CouchDB-like databases implement two mechanisms:

3.1 Representation

With all that out of the way, it’s time to get busy. Let’s implement a revision tree. The easiest way is to implement it as a list of branches¹⁴. It makes the revision tree for the roadside record look like this in code:

You can see that we store, for each leaf, its revision number, the revision hashes of itself and all its ancestor nodes and the (user-supplied) document. If this is a tombstone revision, the document is None. Branch is simply a named tuple:

The index method allows us to get the revision hash for a revision number. Let’s verify it works:

It does. Now what about RevisionTree? We decided our RevisionTree representation is essentially a list (of branches), so let’s inherit from that. If we keep the branches ordered by highest revision number and hash, that should make determining the winning revision easier later on.

So far so good. Now let’s implement some of the revision tree operations we discussed. First, finding the branch with the winning revision:

Note that the loop iterates from the end of the list to the start. This means it iterates from the branch with the highest leaf revision to the branch with the lowest leaf revision. Let’s make sure this correctly points to Bob’s branch as the winner:

Useful. But what if we don’t want to get just the winner, but want to find branches by revision? That’s possible too:

Not so fast, what’s this self.branches() thing? Well, it’s a trivial method, but I gave it a name because it’s very often useful:

OK. We can now find out Bob’s branch, knowing only the revision he added, as follows:

Success! One final method for querying the RevisionTree. What if we want to iterate over all revisions in the tree? That might not sound useful now, but I promise we’ll find a use for it when implementing replication later on:

3.2 Updating the tree

Alright, by now we have a feel of what a revision tree looks like, but we need to figure out how to build one up incrementally.

What information is available during insertion? Well, definitely the new revision’s revision number. We also talked about requiring one or more ancestor revision hashes next to the ‘new’ one. And of course we need the document contents. Oh, and finally this revision limit.

It turns out there are four possible cases during insertion. But our first function will only handle two of them itself:

The implementation is as follows. There are inline examples to understand how the code handles both cases:

Note that for the typical case (2), we no longer need the ‘old’ branch, as it is completely incorporated into the ‘extended’ branch. That’s the reason the delete statements are in there. There are some unfamiliar functions in this code block. We’ll get to self._insert_as_new_branch soon, it implements the third and fourth cases we talked about previously. But this is a good moment to introduce self._insert_branch, which creates a branch and inserts it in the location that maintains the tree ordering.

As you can see, revision pruning becomes a simple operation courtesy of our ‘list of branches’ approach. If you don’t know about bisect.bisect, here’s the short version: given a sorted list, it gives you the place to insert an item which will keep the list sorted. And it can do so efficiently without going through the whole list¹⁵.

It’s worth noting that I make use of a for-else loop here, because as far as I know that’s a statement exclusive to Python. Anyway, that’s all there is to the revision tree class. Let’s try out our nice new merge method:

All seems to be in order. Next up, building a database out of the primitives we just wrote. The hardest part is behind us!

4 An in-memory database

As this post tries to keep things simple, we won’t worry about persisting data to disk. Instead, we’ll build an in-memory database. For indexes, we use sortedcontainers’s SortedDict type. This allows us to both efficiently iterate over keys in sorted order¹⁶, and to quickly retrieve their associated values¹⁷.

On the other hand, I will try to separate generic functions from ‘memory database’ specific functions, allowing you to re-use the former when working on a disk-backed database.

Time to introduce some more CouchDB concepts, just in case you haven’t heard of them yet:

With that out of the way, let’s look at what our main class is going to look like¹⁸:

You see we have a the separate index for local documents, as promised. That leaves the other two. First, the ‘by seq’ index maps the update sequence at the time of the last update to a document to that document’s id. This allows us to reconstruct the order in which documents were updated. Replication needs that to keep track of what changed. But perhaps the most important index is the ‘by id’ index¹⁹. It maps document IDs to a DocumentInfo object. Which is another named tuple type:

You already know all about the revision tree. We cache the output of RevTree.winner_idx() after each write in the winning_branch_idx field for efficiency. Finally, last_update_seq is the same value that’s used as key in the ‘by sequence’ index. We keep it around so we can remove old records from that index.

4.1 Writing

Time to implement some methods²⁰. Let’s start with adding documents to our database. To do so, we first need to pre-process the user’s input, so let’s write a function for that. This consists of extracting information we need, and cleaning up the document before insertion. The exact steps are:

We then return the ID, the revision information (None for a local document) and the cleaned up document.

The user-facing API is actually pretty simple. We take a document, and store it returning nothing. As explained before, we create a separate code path for local documents:

The more interesting code path uses the revision tree logic we implemented previously:

There’s quite a bit going on here. Let’s break it down. At this point, it’s clear we’re going to modify the database. So, we’ll have to update both of the main indices. Our first step is to get the existing revision tree (if there is one) for this document ID. Soon, we’ll overwrite it with a new one. We also remove the soon-to-be outdated entry in the ‘by sequence’ index.

After that’s done, we update the revision tree to include the new revision. We’ll get to the details of that soon. The important thing is that we get back a fresh revision tree and winning branch index from this process. The only thing left to do for us is to increment the ‘update sequence’, as the database has been modified, and to store the new values in the indices.

It’s not that complicated: we simply create a new revision tree if there wasn’t one, and merge the user’s new revision into the retrieved tree. Then we return the values we need in _write_normal. All the real logic was already implemented previously.

And that’s all there is to writing a document²². Let’s try it out, playing out what happens on Jane’s phone before syncing:

So far so good. But that last line isn’t a user-friendly way of reading documents at all. Let’s implement a better API.

4.2 Reading

Probably the most common operation on a database is to read a document given the document ID. But as we’ve seen, there isn’t necessarily a single document version. Instead, we allow the user to specify which version(s) they want to retrieve using a revs argument. It can have a couple of different values:

There is one final parameter to our read method. It’s called include_path and tells us whether ancestor revisions should be included as a _revisions field of the returned document. This is useful during replication.

Phew. Who knew reading a document could get so complex? Time to look at the code:

As you can see, it mostly just queries the indexes, and passes the result of to other functions, before yielding their response. Why a yield, and not a return? Because when revs does not equal ‘winner’, multiple revisions of the document might be returned. If the index doesn’t contain a document, the NotFound exception is raised. That exception’s definition is typical:

Now, let’s first get the local document case out of the way. It isn’t really that interesting, it’s just a read from a key/value store after all. The biggest surprise is that we add a dummy ‘_rev’ value to imitate CouchDB…

Alright, time to get to the more difficult case of reading normal documents from the revision tree. We first handle the ‘winner’ case. We can directly pull the correct branch from the revision tree:

We’ll ignore the other (read_revs()) cases for now, and look at to_doc first. It reconstructs a CouchDB-style document given a branch. This mostly consists of re-creating the revision information. If we’re dealing with a tombstone revision, we also need to represent that information as JSON.

Next, let’s look at the cases where revs is not ‘winner’. First, the ‘all’ case, which is easy thanks to our trusty RevisionTree.branches() method. And actually, the other case isn’t much harder thanks to the RevisionTree.find() method we wrote previously. Although it does require us to perform some parsing: our API expects revisions in string format, but find() expects a revision number and revision hash.

And that’s it. It’s everything you need to read from a CouchDB. You might notice that we didn’t implement a lot of options supported by CouchDB. That’s because you don’t need them during replication or casual use. Most of the remaining options are either attachment-related (which we don’t implement), convenience options for users, or expose more internal information to the caller.

Anyway, let’s read back the (single) document we just wrote to the database as a quick test:

4.3 Replication requirements

By now we can read and write documents. But the replication process needs two more endpoints. Luckily, they are simpler to add.

First, as previously discussed, the replication process needs to have a record of all the changes that were made to the database. This record is the reason why we keep the ‘by sequence’ index, which we expose to the user through the changes feed API. This API has many options, but we limit it to the bare minimum required for replication. That includes implementing the endpoint such that it gives us all leaf revs (the non-default style='all_docs' option).

Alright, that’s doable. The user can specify from which update sequence to start listing all database changes²³, and then we get the document information for each of these changes. The only thing left is to give this information back to the user in a structure close to what CouchDB returns:

Looks good. Note that the previous edit of the ‘roadside’ record isn’t shown. This is by design: recall how we removed old entries from the ‘by sequence’ index.

Now let’s implement our final end point: the revision difference reporter. Given a document ID and a list of revisions, it tells us which of those revisions are unknown to the database. This is where the all_revs() method comes in. But I’m getting ahead of myself. First, we need to get the revision tree for the id:

Once we have it, we extract from it all known revisions for this document ID. Note that if there is no revision tree, there are no known revisions in the database.

Finally, we compare the list of revisions given by the caller with the list of revisions known to be in the database. This returns all the unknown revisions on the user’s list using a set difference operation. Note that because these are set operations, it doesn’t matter if revisions occur more than once in either of the lists.

As you’d expect, only the unknown revision is returned. We now have all the elements in place to move our attention to the replication process itself.

5 Replication

A replicator is an implementation of the CouchDB replication protocol. This is a process that relies only on the user-facing APIs we just designed. In fact, you could wrap up the methods we just wrote in an HTTP API, and CouchDB’s replicator would be able to sync databases to and from it. So would PouchDB’s replicator. This is something I did in fact do while working on this project, because it’s a great way to test it. Which brings me to…

5.1 Intermezzo: testing

This post makes it seem like I wrote all this code down at once. That’s not the case, I experimented quite a bit before eventually settling on this design. It might be interesting to know a bit about my work process here.

As I just explained, I wrapped the API inside a CouchDB-compatible HTTP API so I could test it using CouchDB’s replicator. There is a relatively straight-forward mapping between the two. What I did not yet explain is that you only need to implement a subset of the API to perform one-way replication. For example, if you implement just the id_sync, update_seq_sync, changes and read properties/methods, that’s enough for CouchDB to replicate data from your database into a database on its own. Similarly, to replicate data from some database to your own, you only need to implement id_sync, revs_diff_sync and write_sync. As you can see, this nicely divides the amount of work you need to do into about half.

While simply using CouchDB’s replicator to replicate a couple of databases I had lying around back and forth provided the bulk of the testing, I also wrote a couple of unit tests, mostly targeting RevisionTree. They are quite similar to the inline snippets you’ve seen throughout this post that demonstrate what individual functions can do²⁴.

Finally, while writing my own replicator, I noticed that it would also be convenient if that replicator could receive changes from a remote (HTTP) CouchDB, without having to expose its own HTTP API. That way, I could test my own replicator instead of CouchDB’s. My solution was to wrap the CouchDB HTTP API into the same API as we just wrote. There’s one problem though: HTTP requests can be slow. There are two possible solutions. The first is the most mature one: just throw a thread pool at the problem. But I thought this would be a good moment for me to try out Python’s comparatively recent asyncio ecosystem instead. I’d been wanting to do so for some time, anyway.

5.2 Asynchronous database API

The first step was to define an asynchronous API for the database. Just to be clear: this is lunacy for an in-memory database on its own. Almost by definition, such a database is synchronous. But it makes some sense when wrapping the CouchDB HTTP API. After all, the CouchDB database in question could be hosted at the other side of the planet, introducing some real latency to the process.

In the end, I implemented the asynchronous API for the in-memory database for code that requires compatibility with async implementations. This way, I could run the new replicator against both CouchDB and in-memory databases. But I also exposed the synchronous API for code that does not need that compatibility. Let’s go through this wrapper code.

I thought I’d start with the only part that arguably isn’t just a wrapper. We introduce an asyncio.Event that we set whenever the database is modified. This allows us to implement the continuous option of the changes feed. When this option is True, the changes() function never returns but will instead block until a new change is available. The replicator can use this to implement continuous replication, which means that a change will be replicated over immediately as it is entered into the database.

The other wrapper methods are simpler. The most complex ones are arguably the read and write methods, which support reading or writing multiple documents with a single call. That’ll come in handy while writing the replicator, as it’s supported by CouchDB as well.

Ah, I forgot to describe two things. First of all, there is a new method called ensure_full_commit. It’s there because the replication protocol requires it, but for an in-memory implementation, it’s a no-op. You might also wonder what as_future_result does. It’s a way of making synchronous properties awaitable. The implementation is as follows:

5.3 Replicator implementation

Now that we have a unified API, we can work on the actual replicator. Replication has four inputs:

Bidirectional replication, which is also known as synchronization, is nothing more than two unidirectional replication jobs running simultaneously with exchanged ‘source’ and ‘target’ roles.

We define the replicator as a single function. This function is long. As such, we will discuss it in multiple parts, sometimes interleaved with the functions it calls itself. While reading this section, I recommend having the replication protocol documentation open. It gives examples for each part of the process, and the code comments refer to its sections.

A good start. We store some information about how the replication process goes in hist_entry. We also generate a unique replication session identifier. As an aside, timestamp is implemented as follows:

Apart from that, we check whether both databases exist by requesting their update_seq. In the case of the target database, we create it as necessary:

OK, so by now both databases exist. We first want to know if replication has already occurred previously between these databases by generating a replication ID:

Next, we query both databases for local documents named after this replication id, which gives us the previous replication logs (if any). We compare them to see if they agree previous replication occurred. If so, we found a so-called checkpoint. This means we can start replication from the update_seq that was at the time the last one (stored in startup_checkpoint):

At this point, we either need to get all the changes from the source database or the changes since the last checkpoint. Note that the changes() function is an async generator function. That means it only generates changes when requested. For a while, no changes will be retrieved yet. We’ll see that applies to a lot more functions.

The revs_diff_input function converts Changes into IDs and revisions as expected by the _revs_diff method on the target database. It also keeps our history entry up-to-date. Note that this is another async generator function.

Next up, revs_diff will tell us which documents to read from the source database because they aren’t in the target database. This again requires transforming the results slightly to match the read() function’s input. Note that at this point, still nothing is actually happening because of all the async generators.

All documents that were read from the source database now need to be inserted into the target database. This time, no conversion is necessary, but we still pipe them through an extra async generator function count_docs to keep our history_entry up-to-date:

… before performing the actual write action. Note that this is the point at which all the generator functions start running. So this is the point at which the first change is retrieved, analysed by the revisions difference reporter, then read from disk and finally written. All in a single stream. This isn’t actually strictly according to the protocol, which requires batching as it was written with an HTTP API in mind. For in-memory databases this is a nice and clear way of doing things, though. The HTTP implementation of the database API could work around this by performing batching internally, but that isn’t fully implemented at the moment.

It’s also worth mentioning that when performing continuous replication, none of the code we will discuss next is currently run, because this loop stays blocked waiting for new changes. Ideally, it would write a checkpoint every few changes instead. But this will do for demonstration purposes.

We’re done writing. The following ensures all the writes we did are saved to disk, at least for ‘normal’ (not memory-based) CouchDB installations:

One thing left to officially complete the replication process: merging the history entry with previous replication logs and replacing these logs in the local documents so future replicator calls have a checkpoint to start from. We do that as follows:

The build_history function inserts the history entry into the existing log, throwing away the oldest replication statistics to prevent the log from growing too large:

You might wonder what async_iter and to_list are doing. They are small functions that help you use the async API without writing async loops or having async input:

… and that’s all there is to it! Let’s come back to our municipality example one last time and completely simulate what happens.

First, we set up the initial situation by creating databases for the server, Jane and Bob, and inserting the ‘original’ document to the server. We replicate this change to the databases of both Jane and Bob:

Now, Bob plants a tree, but doesn’t get the chance to replicate his change yet.

The background process responsible for fixing the conflict is listening to the changes feed and notices that there is now (possibly) a conflict:

To make sure all the databases are again in sync, we also replicate the changes to the smartphones of Jane and Bob:

6 Conclusion

It surprised me that replication itself is actually a relatively straight-forward process once you have all the endpoints in place. When I started out, I also expected coming up with the right indices to be the most complex part of this experiment. Clearly, I underestimated the complexity of working with the revision tree. Most of my time was actually spend on getting that correct (hopefully!).

6.1 Extending the prototype: ChairDB

While the above is a full implementation of a CouchDB-compatible database and replication, you might be interested to see the HTTP API and CouchDB database wrapper I described in the testing section. The former is implemented using the Starlette web framework. The latter uses the excellent HTTPX library. Both can be found in my ChairDB repository. It also includes a database implementation on top of SQLite, in case you (rightfully) think an in-memory database doesn’t cut it. Finally, it contains all the tests I wrote. I’m not yet certain if I will continue to work on ChairDB. Perhaps.

6.2 Final words

If you read this far, congratulations! You now know how to write a CouchDB-compatible database, including replication. If you write one of your own, or if you have any remarks, comments or questions, I’d love to hear from you. Drop me an email. Thank you for reading!

Language	files	blank	comment	code
Python	5	161	175	365

See https://github.com/couchapp/couchapp. Sadly, the technique fell out of favour.↩
Pages can be downloaded from: https://github.com/couchone/pages ↩
This was a while ago, clearly… Definitely before affordable mobile data plans.↩
There was no way for a user to ask for a password reset, for example. See for a more thorough discussion this blog post by Nolan Lawson from 2013.↩
For example, you can only restrict reading databases, not documents. This lead to db-per-user workarounds which come with downsides of their own.↩
iOS: https://github.com/couchbaselabs/TouchDB-iOS; Android: https://github.com/couchbaselabs/TouchDB-Android ↩
First WebSQL. It was deprecated due to the specification essentially being ‘do what SQLite does’. These days, there’s IndexedDB.↩
See https://github.com/pouchdb/pouchdb. CouchDB’s source code is pretty accessible as well, but it’s written in Erlang, which I cannot write and only somewhat read.↩
A more thorough discussion of conflicts can be found in the CouchDB documentation.↩
For our prototype, we will ignore attachments, and focus on storing JSON documents only.↩
We will also ignore secondary indexes (both CouchDB views and the newer Mango query server).↩
With ‘remove’ I mean replacing it with a tombstone revision. ‘True’ deletion is possible in CouchDB, but not recommended because it messes up replication. As such, this prototype will not implement that operation. Neither does PouchDB, by the way.↩
Incidentally, that also means that if you remove a document which has conflicts, it will not disappear. Instead, the conflict with the highest revision will be promoted to be the new ‘winner’.↩
Why a list of branches and not a real tree structure? I’m glad you asked. The short answer is that it makes a lot of the algorithms described here more complex. It was my initial approach, but I never got revision pruning to work that way. The code at that time can be found here. It’s possible to do it I assume, but it’s hard. To the point where CouchDB and PouchDB both convert their internal tree structure into something similar to the ‘list of branches’ representation used here and back again when doing revision pruning. A tree would definitely waste less memory, though.↩
In O(log n) to be exact. But if you know about Big O notation, I probably don’t have to explain binary search in the first place.↩
With a time complexity of O(n).↩
With a time complexity of O(1).↩
If you’re wondering why I’m adding the word ‘sync’ everywhere, it’ll be explained in due time.↩
Curious how other implementations handle indices? I know I was. The on-disk (B+ tree) file format of couchstore can be found here. It’s similar to CouchDB’s, I believe. A discussion of PouchDB’s latest database schema can be found here.↩
In the next few sections, every function that has ‘self’ as its first argument is a method of the SyncInMemoryDatabase class.↩
For the format of the _revisions field, see https://docs.couchdb.org/en/stable/api/document/common.html#getting-a-list-of-revisions.↩
Actually, this is not the API CouchDB presents you by default. Instead, it’s the one you get when you use the ‘new_edits=false’ option. The ‘normal’ one tries to prevent you from inserting conflicts unnecessarily, and generates new revision numbers and hashes for you. But by now, you know enough about CouchDB’s internal that you don’t need for it to hold your hand. There are no fundamental limitations that prevent you from adding such behaviour, though.↩
Thanks to SortedDict’s irange method.↩
It so happens to be that writing the snippets for this blog post led me to discover a (now fixed) bug.↩