12.5. Clustered Purge
The primary purpose of clustered purge is to clean databases that have multipledeleted tombstones or single documents that contain large numbers of conflicts.But it can also be used to purge any document (deleted or non-deleted) with anynumber of revisions.
Clustered purge is designed to maintain eventual consistency and preventunnecessary invalidation of secondary indexes. For this, every database keepstrack of a certain number of historical purges requested in the database, aswell as its current purge_seq
. Internal replications and secondary indexesprocess database’s purges and periodically update their corresponding purgecheckpoint documents to report purge_seq
processed by them. To ensureeventual consistency, the database will remove stored historical purge requestsonly after they have been processed by internal replication jobs and secondaryindexes.
12.5.1. Internal Structures
To enable internal replication of purge information between nodes and secondaryindexes, two internal purge trees were added to a database file to trackhistorical purges.
- purge_tree: UUID -> {PurgeSeq, DocId, Revs}
- purge_seq_tree: PurgeSeq -> {UUID, DocId, Revs}
Each interactive request to _purge API
, creates an ordered set of pairs onincreasing purge_seq
and purge_request, where purge_request is a tuple thatcontains docid and list of revisions. For each purge_request uuid is generated.A purge request is added to internal purge trees:a tuple {UUID -> {PurgeSeq, DocId, Revs}}
is added to purge_tree
,a tuple is {PurgeSeq -> {UUID, DocId, Revs}}
added to purge_seq_tree
.
12.5.2. Compaction of Purges
During the compaction of the database the oldest purge requests are to beremoved to store only purged_infos_limit
number of purges in the database.But in order to keep the database consistent with indexes and other replicas,we can only remove purge requests that have already been processed by indexesand internal replications jobs. Thus, occasionally purge trees may storemore than purged_infos_limit
purges. If the number of stored purges in thedatabase exceeds purged_infos_limit
by a certain threshold, a warning isproduced in logs signaling a problem of synchronization of database’s purgeswith indexes and other replicas.
12.5.3. Local Purge Checkpoint Documents
Indexes and internal replications of the database with purges create andperiodically update local checkpoint purge documents:_local/purge-$type-$hash
. These documents report the last purge_seq
processed by them and the timestamp of the last processing. An example of alocal checkpoint purge document:
- {
- "_id": "_local/purge-mrview-86cacdfbaf6968d4ebbc324dd3723fe7",
- "type": "mrview",
- "purge_seq": 10,
- "updated_on": 1540541874,
- "ddoc_id": "_design/foo",
- "signature": "5d10247925f826ae3e00966ec24b7bf6"
- }
The below image shows possible local checkpoint documents that a database mayhave.
Local Purge Checkpoint Documents
12.5.4. Internal Replication
Purge requests are replayed across all nodes in an eventually consistent manner.Internal replication of purges consists of two steps:
Pull replication. Internal replication first starts by pulling purges fromtarget and applying them on source to make sure we don’t reintroduce to targetsource’s docs/revs that have been already purged on target. In this step, we usepurge checkpoint documents stored on target to keep track of the last target’s
purge_seq
processed by the source. We find purge requests occurred afterthispurge_seq
, and replay them on source. This step is done by updatingthe target’s checkpoint purge documents with the latest processpurge_seq
and timestamp.Push replication. Then internal replication proceeds as usual with an extrastep inserted to push source’s purge requests to target. In this step, we uselocal internal replication checkpoint documents, that are updated both on targetand source.
Under normal conditions, an interactive purge request is already sent to everynode containing a database shard’s replica, and applied on every replica.Internal replication of purges between nodes is just an extra step to ensureconsistency between replicas, where all purge requests on one node are replayedon another node. In order not to replay the same purge request on a replica,each interactive purge request is tagged with a unique uuid
. Internalreplication filters out purge requests with UUIDs that already exist in thereplica’s purge_tree
, and applies only purge requests with UUIDs that don’texist in the purge_tree
. This is the reason why we needed to have twointernal purge trees: 1) purge_tree
: {UUID -> {PurgeSeq, DocId, Revs}}
allows to quickly find purge requests with UUIDs that already exist in thereplica; 2) purge_seq_tree
: {PurgeSeq -> {UUID, DocId, Revs}}
allows toiterate from a given purge_seq
to collect all purge requests happened afterthis purge_seq
.
12.5.5. Indexes
Each purge request will bump up update_seq
of the database, so that eachsecondary index is also updated in order to apply the purge requests to maintainconsistency within the main database.
12.5.6. Config Settings
These settings can be updated in the default.ini or local.ini:
Field | Description | Default |
---|---|---|
max_document_id_number | Allowed maximum number of documents in onepurge request | 100 |
max_revisions_number | Allowed maximum number of accumulatedrevisions in one purge request | 1000 |
allowed_purge_seq_lag | Beside purged_infos_limit, allowedadditional buffer to store purge requests | 100 |
index_lag_warn_seconds | Allowed durations when index is notupdated for local purge checkpoint document | 86400 |
During a database compaction, we check all checkpoint purge docs. A client (anindex or internal replication job) is allowed to have the last reportedpurge_seq
to be smaller than the current database shard’s purge_seq
bythe value of (purged_infos_limit + allowed_purge_seq_lag)
. If the client’spurge_seq
is even smaller, and the client has not checkpointed withinindex_lag_warn_seconds
, it prevents compaction of purge trees and we have toissue the following log warning for this client:
- Purge checkpoint '_local/purge-mrview-9152d15c12011288629bcffba7693fd4’
- not updated in 86400 seconds in
- <<"shards/00000000-1fffffff/testdb12.1491979089">>
If this type of log warning occurs, check the client to see why the processingof purge requests is stalled in it.
There is a mapping relationship between a design document of indexes and localcheckpoint docs. If a design document of indexes is updated or deleted, thecorresponding local checkpoint document should be also automatically deleted.But in an unexpected case, when a design doc was updated/deleted, but itscheckpoint document still exists in a database, the following warning will beissued:
- "Invalid purge doc '<<"_design/bar">>' on database
- <<"shards/00000000-1fffffff/testdb12.1491979089">>
- with purge_seq '50'"
If this type of log warning occurs, remove the local purge doc from a database.