Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix serialized big data issue #38

Merged
merged 1 commit into from
Oct 24, 2024
Merged

Fix serialized big data issue #38

merged 1 commit into from
Oct 24, 2024

Conversation

sharpener6
Copy link
Collaborator

  • now single object can be more than 500MB because of cap'n protocol limitation, according to author, it should be List(List(Data)) then concatenate them should support the bytes more than 500MB, but pycapnp is not supporting memoryview now, so when splitting and concatenation, it will create multiple copy for sure, but it should be done right after serialization and deserialization stage and release the memory,

  • later when we moved from pyzmq to pynng, the memory copy issue should get easied because pynng will expose more raw operations, and we can save threads

- now single object can be more than 500MB because of cap'n protocol
  limitation, according to author, it should be List(List(Data)) then
  concatenate them should support the bytes more than 500MB, but pycapnp
  is not supporting memoryview now, so when splitting and concatenation,
  it will create multiple copy for sure, but it should be done right
  after serialization and deserialization stage and release the memory,

- later when we moved from pyzmq to pynng, the memory copy issue should
  get easied because pynng will expose more raw operations, and we can
  save threads

Signed-off-by: Sharpner6 <[email protected]>
@sharpener6 sharpener6 requested a review from rafa-be October 24, 2024 01:40
@sharpener6
Copy link
Collaborator Author

@rafa-be, please review this, if approved, please merge and create release, thanks

scaler/io/utility.py Show resolved Hide resolved
@rafa-be rafa-be merged commit 4d7b85d into Citi:main Oct 24, 2024
5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants