Skip to content

Pyarrow and asyncio #3151

@SaeedAr

Description

@SaeedAr

I'm trying to use pyarrow in an asynchronous python code-base for processing stream of batches of data.
I found a sample here that basically wraps pyarrow RecordBatchStreamWriter with asyncio sleeps to make an asynchronous behavior. Is this the best possible way to use pyarrow streams asynchronously?

I also found some experimental support for asyncio in pyarrow.plasma here which again by async wait. As I understood IO write/read functions in pyarrow are all synchronous and blocking. Is there any plan for asyncio support in pyarrow?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions