Fetch snapshot by time #262

alecgibson · 2018-11-30T17:09:40Z

This change adds the ability fetch a snapshot by time. The motivation
for this is that fetching a document by time is quite a "natural" way
to think about document history, and allows us to - for example - fetch
multiple documents as they were at a given time, without having to
look up their exact version numbers first.

We add a new Connection.fetchSnapshotByTimestamp method, which
follows a very similar route to Connection.fetchSnapshot, and where
possible, as much code is re-used as possible:

both methods use a subclassed child of SnapshotRequest
both methods have their requests handled by the same machinery in
Connection
both methods in the Backend have ops applied by a common method,
but use their own methods for calls to middleware

In order to make this feature possible at scale, this change also adds
two new methods to the MilestoneDB interface:

getMilestoneSnapshotAtOrBeforeTime
getMilestoneSnapshotAtOrAfterTime

These methods are used to fetch milestone snapshots either side of the
requested timestamp, which means we only need to fetch the ops between
the two of them to reach the desired timestamp.

In the case where a milestone database is not being used, then fetching
a snapshot by timestamp is still possible, but it will fetch all the ops
for a document, and keep applying them from v0 until the timestamp is
reached, which is not particularly scalable.

coveralls · 2018-11-30T17:16:41Z

Coverage increased (+0.2%) to 95.848% when pulling 8e46271 on snapshot-by-timestamp into 95c81b8 on master.

This change adds the ability fetch a snapshot by time. The motivation for this is that fetching a document by time is quite a "natural" way to think about document history, and allows us to - for example - fetch multiple documents as they were at a given time, without having to look up their exact version numbers first. We add a new `Connection.fetchSnapshotByTimestamp` method, which follows a very similar route to `Connection.fetchSnapshot`, and where possible, as much code is re-used as possible: - both methods use a subclassed child of `SnapshotRequest` - both methods have their requests handled by the same machinery in `Connection` - both methods in the `Backend` have ops applied by a common method, but use their own methods for calls to middleware In order to make this feature possible at scale, this change also adds two new methods to the `MilestoneDB` interface: - `getMilestoneSnapshotAtOrBeforeTime` - `getMilestoneSnapshotAtOrAfterTime` These methods are used to fetch milestone snapshots either side of the requested timestamp, which means we only need to fetch the ops between the two of them to reach the desired timestamp. In the case where a milestone database is not being used, then fetching a snapshot by timestamp is still possible, but it will fetch all the ops for a document, and keep applying them from v0 until the timestamp is reached, which is not particularly scalable.

ericyhwang

Here's the notes I took during the PR review meeting today.

I haven't looked at the tests yet, but the actual code itself looks great. Glad you could find a clean abstraction boundary.

lib/backend.js

lib/client/connection.js

This change removes or renames `shouldBreak` calls. In `Backend`, for clarity we instead pre-filter ops, and just pass around the ops we want to be applied to a snapshot. In the `MemoryMilestoneDB`, these functions are extracted and renamed to more descriptive break condition names.

The function for building a snapshot from ops is useful, and has no dependencies on `Backend`. This change moves it into the `ot` module, where it will be a bit more discoverable and can be reused.

alecgibson · 2019-01-23T17:52:27Z

@nateps / @ericyhwang as discussed, I've moved the snapshot building function into ot.js.

ericyhwang · 2019-01-30T17:13:18Z

LGTM from me and Nate

This change adds [new milestone database methods][1] for fetching milestones by timestamp. In order to make this performant, this change also adds a new index to the `m.mtime` (modified timestamp) field. [1]: share/sharedb#262

alecgibson force-pushed the snapshot-by-timestamp branch from af55acd to d8bbbe2 Compare November 30, 2018 17:11

alecgibson force-pushed the snapshot-by-timestamp branch 4 times, most recently from bd8763f to 513f6b6 Compare December 3, 2018 15:08

alecgibson force-pushed the snapshot-by-timestamp branch from 513f6b6 to 4a7a178 Compare December 3, 2018 15:25

ericyhwang reviewed Dec 19, 2018

View reviewed changes

lib/backend.js Outdated Show resolved Hide resolved

lib/backend.js Outdated Show resolved Hide resolved

lib/client/connection.js Outdated Show resolved Hide resolved

alecgibson force-pushed the snapshot-by-timestamp branch from dc9b8ca to 579b22c Compare January 23, 2019 17:15

Move snapshot building function into ot

8e46271

The function for building a snapshot from ops is useful, and has no dependencies on `Backend`. This change moves it into the `ot` module, where it will be a bit more discoverable and can be reused.

ericyhwang merged commit f507ca1 into master Jan 30, 2019

alecgibson deleted the snapshot-by-timestamp branch January 30, 2019 17:27

alecgibson mentioned this pull request Jan 30, 2019

Add timestamp methods share/sharedb-milestone-mongo#2

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fetch snapshot by time #262

Fetch snapshot by time #262

alecgibson commented Nov 30, 2018

coveralls commented Nov 30, 2018 •

edited

Loading

ericyhwang left a comment

alecgibson commented Jan 23, 2019

ericyhwang commented Jan 30, 2019

Fetch snapshot by time #262

Fetch snapshot by time #262

Conversation

alecgibson commented Nov 30, 2018

coveralls commented Nov 30, 2018 • edited Loading

ericyhwang left a comment

Choose a reason for hiding this comment

alecgibson commented Jan 23, 2019

ericyhwang commented Jan 30, 2019

coveralls commented Nov 30, 2018 •

edited

Loading