Implement expanding.groupby.count in Series and Frame #991

HyukjinKwon · 2019-11-01T09:16:45Z

This PR implements expanding.groupby.count in Series and Frame

>>> import databricks.koalas as ks
>>> kser = ks.Series([2, 3, float("nan"), 10])
>>> kser.groupby(kser).expanding().count()
0
3.0   1    1.0
2.0   0    1.0
10.0  3    1.0
Name: 0, dtype: float64

>>> df = kser.to_frame()
>>> df.groupby(df['0']).expanding().count()
          0
0
3.0  1  1.0
2.0  0  1.0
10.0 3  1.0

Relates to #977

codecov-io · 2019-11-01T09:51:17Z

Codecov Report

Merging #991 into master will decrease coverage by 1.44%.
The diff coverage is 97.95%.

@@            Coverage Diff             @@
##           master     #991      +/-   ##
==========================================
- Coverage   94.79%   93.34%   -1.45%     
==========================================
  Files          34       34              
  Lines        6568     6601      +33     
==========================================
- Hits         6226     6162      -64     
- Misses        342      439      +97

Impacted Files	Coverage Δ
databricks/koalas/missing/window.py	`100% <ø> (ø)`	⬆️
databricks/koalas/groupby.py	`91.39% <100%> (ø)`	⬆️
databricks/koalas/window.py	`93.57% <97.87%> (+2.66%)`	⬆️
databricks/koalas/usage_logging/__init__.py	`24.54% <0%> (-72.73%)`	⬇️
databricks/koalas/usage_logging/usage_logger.py	`50% <0%> (-50%)`	⬇️
databricks/koalas/__init__.py	`80.85% <0%> (-6.39%)`	⬇️
databricks/conftest.py	`93.61% <0%> (-4.26%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update f302c6d...3d4113d. Read the comment docs.

HyukjinKwon · 2019-11-05T10:45:04Z

This should be ready for a look.

cc @itholic too since you're working on rolling.

softagram-bot · 2019-11-05T10:45:45Z

Softagram Impact Report for pull/991 (head commit: `3d4113d`)

⭐ Change Overview

(Open in Softagram Desktop for full details)

📄 Full report

Permalink: Full report for pull/991

Impact Report explained. Give feedback on this report to support@softagram.com

HyukjinKwon · 2019-11-06T02:15:57Z

I'm merging this to proceed forward. I am still touching this file so please let me know if there are some comments.

itholic · 2019-11-06T02:25:42Z

@HyukjinKwon great! okay i'll start Rolling.GroupBy now.

ueshin · 2019-11-06T20:10:28Z

databricks/koalas/window.py

+
+        internal = _InternalFrame(sdf=sdf,
+                                  data_columns=[c._internal.data_columns[0] for c in applied],
+                                  index_map=new_index_map)


Don't we need to preserve column_index?

let me take a look and fix.

HyukjinKwon changed the title ~~[WIP]~~ [WIP] Implement expanding.groupby.count in Series and Frame Nov 1, 2019

HyukjinKwon force-pushed the groupby-expanding branch from 06150ed to 34f952e Compare November 5, 2019 02:50

Implement expanding.groupby.count in Series and Frame

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23
Expired

Verified
Learn about vigilant mode

3d4113d

HyukjinKwon force-pushed the groupby-expanding branch from b1dfcdd to 3d4113d Compare November 5, 2019 10:44

HyukjinKwon changed the title ~~[WIP] Implement expanding.groupby.count in Series and Frame~~ Implement expanding.groupby.count in Series and Frame Nov 5, 2019

HyukjinKwon requested a review from ueshin November 6, 2019 00:36

HyukjinKwon merged commit 5fc4b61 into databricks:master Nov 6, 2019

HyukjinKwon deleted the groupby-expanding branch November 6, 2019 02:20

ueshin reviewed Nov 6, 2019

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement expanding.groupby.count in Series and Frame #991

Implement expanding.groupby.count in Series and Frame #991

HyukjinKwon commented Nov 1, 2019 •

edited

Loading

codecov-io commented Nov 1, 2019 •

edited

Loading

HyukjinKwon commented Nov 5, 2019

softagram-bot commented Nov 5, 2019

HyukjinKwon commented Nov 6, 2019

itholic commented Nov 6, 2019

ueshin Nov 6, 2019

HyukjinKwon Nov 7, 2019

Implement expanding.groupby.count in Series and Frame #991

Implement expanding.groupby.count in Series and Frame #991

Conversation

HyukjinKwon commented Nov 1, 2019 • edited Loading

codecov-io commented Nov 1, 2019 • edited Loading

Codecov Report

HyukjinKwon commented Nov 5, 2019

softagram-bot commented Nov 5, 2019

Softagram Impact Report for pull/991 (head commit: 3d4113d)

⭐ Change Overview

📄 Full report

HyukjinKwon commented Nov 6, 2019

itholic commented Nov 6, 2019

ueshin Nov 6, 2019

Choose a reason for hiding this comment

HyukjinKwon Nov 7, 2019

Choose a reason for hiding this comment

HyukjinKwon commented Nov 1, 2019 •

edited

Loading

codecov-io commented Nov 1, 2019 •

edited

Loading

Softagram Impact Report for pull/991 (head commit: `3d4113d`)