Skip to content

Distributed Series 🎉#522

Merged
williamma12 merged 34 commits intomodin-project:masterfrom
devin-petersohn:features/distributed_series
Apr 14, 2019
Merged

Distributed Series 🎉#522
williamma12 merged 34 commits intomodin-project:masterfrom
devin-petersohn:features/distributed_series

Conversation

@devin-petersohn
Copy link
Collaborator

@devin-petersohn devin-petersohn commented Mar 29, 2019

What do these changes do?

Adds distributed Series compatibility to Modin. Currently a work in progress.

New addition can be tested with:

import modin.pandas as pd 
import numpy as np 
frame_data = np.random.randint(0, 100, size=(10**5, 1000)) 
df = pd.DataFrame(frame_data).add_prefix("col")
df["col1"] + df["col2"]

There is still several functionalities to add, in addition to the defaulting to pandas.

Related issue number

  • passes flake8 modin
  • passes black --check modin
  • tests added and passing

@AmplabJenkins
Copy link

Merged build finished. Test FAILed.

@AmplabJenkins
Copy link

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Modin-PRB/667/
Test FAILed.

@codecov
Copy link

codecov bot commented Mar 29, 2019

Codecov Report

Merging #522 into master will decrease coverage by 0.01%.
The diff coverage is 92.44%.

Impacted file tree graph

@@            Coverage Diff             @@
##           master     #522      +/-   ##
==========================================
- Coverage   90.22%   90.21%   -0.02%     
==========================================
  Files          34       37       +3     
  Lines        4849     5406     +557     
==========================================
+ Hits         4375     4877     +502     
- Misses        474      529      +55
Impacted Files Coverage Δ
modin/pandas/base.py 95.03% <ø> (ø)
modin/pandas/dataframe.py 88.96% <ø> (-4.46%) ⬇️
modin/data_management/utils.py 83.72% <100%> (-0.73%) ⬇️
modin/pandas/general.py 97.72% <100%> (-2.28%) ⬇️
modin/pandas/indexing.py 81.28% <100%> (+0.51%) ⬆️
modin/pandas/utils.py 95.83% <100%> (ø) ⬆️
modin/pandas/__init__.py 85.1% <100%> (+0.32%) ⬆️
modin/backends/base/query_compiler.py 100% <100%> (ø) ⬆️
modin/engines/ray/pandas_on_ray/frame/partition.py 73.58% <66.66%> (-1.42%) ⬇️
...din/engines/ray/generic/frame/partition_manager.py 91.66% <80%> (-8.34%) ⬇️
... and 13 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 62c2996...0f7736b. Read the comment docs.

@AmplabJenkins
Copy link

Merged build finished. Test FAILed.

@AmplabJenkins
Copy link

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Modin-PRB/668/
Test FAILed.

@AmplabJenkins
Copy link

Merged build finished. Test FAILed.

@AmplabJenkins
Copy link

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Modin-PRB/669/
Test FAILed.

Copy link
Collaborator

@williamma12 williamma12 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looked over it quickly

@devin-petersohn devin-petersohn changed the title [WIP] Distributed Series 🎉 Distributed Series 🎉 Apr 2, 2019
Update implementation

Fix repr

Update mapreduce functionality

Update

Add more methods

work in progress updates from home

Fix test files

Add several methods to Series

Fix comment

Remove duplicate code

Add some defaulting to pandas

Push today's progress

Fix some test failures

Fix failures

Fix indexing

Fix python2 compat

Address comments, cleanup

Lint

Cleanup

Fix test error for Ray, propagate errors

Lint
Copy link
Collaborator Author

@devin-petersohn devin-petersohn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Other things to add:

  • Documentation section about Series
  • Finish tests

Copy link
Collaborator

@williamma12 williamma12 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Left some quick comments on some documentation, but otherwise it looks good!

Copy link
Collaborator

@williamma12 williamma12 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks for the much needed addition @devin-petersohn!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants