Automatically fetch runs from archives #60

as2388 · 2020-01-10T17:49:04Z

Automatically fetches runs from archives when necessary. Works as a drop-in replacement for previous versions of Rapid Pro Tools in the pipelines.

Main caveat is that it does some unnecessary work, e.g.:

Will fetch all archives once for each flow, so expensive when starting up.
Fetches archives known to be empty.
Pipelines request all runs since the modification date of the most recent run in that flow when running incrementally. This means for the oldest runs, time will be wasted searching in more recent archives, because the most recent update will fall in the archive period, which will hurt the performance of incremental fetching.

But better to get something that works than pre-optimise.

Takes <1 hour to fetch all data for WorldBank (7 flows from 2 different TextIt instances).

…a flow from Rapid Pro's archives

…tion

rapid_pro_tools/rapid_pro_client.py

IsaackMwenda

Thanks @as2388 .

…d runs in the archive + live exports.

as2388 added 4 commits January 10, 2020 15:50

Add _get_archived_runs_for_flow_id, which retrieves all the runs for …

8bc8fd3

…a flow from Rapid Pro's archives

Update get_message_for_flow_id to fetch from both archives and produc…

f3b4686

…tion

Update documentation for the get_runs functions to reflect archiving

8a6a08b

Improve get_archive log text

f1fbe8f

as2388 requested review from lukechurch and IsaackMwenda January 10, 2020 17:49

IsaackMwenda reviewed Jan 13, 2020

View reviewed changes

rapid_pro_tools/rapid_pro_client.py Outdated Show resolved Hide resolved

IsaackMwenda approved these changes Jan 13, 2020

View reviewed changes

Print the id of the first duplicate run when failing due to duplicate…

b7e4825

…d runs in the archive + live exports.

lukechurch approved these changes Jan 16, 2020

View reviewed changes

as2388 merged commit f9c1286 into master Jan 17, 2020

as2388 mentioned this pull request Jan 20, 2020

Add 'ignore_archives' flag to functions that fetch runs #64

Merged

Provide feedback