iterate_stream uses deprecated write_object instead of write_entity #33

stchris · 2024-02-27T07:56:53Z

iterate_stream uses the (deprecated) write_object:

Line 30 in e1c5718

def iterate_stream(dataset, file, entity_id=None):

instead of the newer write_entity. The latter uses orjson and might show a significant speed boost, so some before/after benchmarking would be useful here as well.

The text was updated successfully, but these errors were encountered:

stchris · 2024-03-05T17:55:14Z

I ran the following benchmark:

from io import StringIO, BytesIO

from followthemoney import model
from followthemoney.cli.util import write_object, write_entity

import pyperf


ENTITY = {
    "id": "test",
    "schema": "Person",
    "properties": {
        "name": ["Ralph Tester"],
        "birthDate": ["1972-05-01"],
        "idNumber": ["9177171", "8e839023"],
        "topics": ["role.spy"],
    },
}


def bench_write_object(obj):
    write_object(StringIO(), obj)


def bench_write_entity(obj):
    write_entity(BytesIO(), obj)


runner = pyperf.Runner()
obj = model.get_proxy(ENTITY)
runner.bench_func("write_object", bench_write_object, obj)
runner.bench_func("write_entity", bench_write_entity, obj)

and it yielded a significant improvement:

write_object: Mean +- std dev: 2.76 us +- 0.02 us
write_entity: Mean +- std dev: 924 ns +- 8 ns

stchris mentioned this issue Mar 5, 2024

Use write_entity instead of deprecated write_object #34

Merged

catileptic added a commit that referenced this issue Mar 26, 2024

Complete fix for #33

4530b65

stchris closed this as completed Apr 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

iterate_stream uses deprecated write_object instead of write_entity #33

iterate_stream uses deprecated write_object instead of write_entity #33

stchris commented Feb 27, 2024

stchris commented Mar 5, 2024 •

edited

Loading

iterate_stream uses deprecated write_object instead of write_entity #33

iterate_stream uses deprecated write_object instead of write_entity #33

Comments

stchris commented Feb 27, 2024

stchris commented Mar 5, 2024 • edited Loading

stchris commented Mar 5, 2024 •

edited

Loading