Setting index name / names for Series #1079

itholic · 2019-11-26T04:53:55Z

Unlike pandas, koalas.Series can't set the index name like below.

>>> pser = pd.Series([1, 2, 3, 4, 5])
>>> pser.index.name = 'koalas'
>>> pser.index.name
'koalas'

>>> kser = ks.Series([1, 2, 3, 4, 5])
>>> kser.index.name = 'koalas'
>>> kser.index.name

For MultiIndex also

>>> midx = pd.MultiIndex([['lama', 'cow', 'falcon'],
...                       ['speed', 'weight', 'length']],
...                      [[0, 0, 0, 1, 1, 1, 2, 2, 2],
...                       [0, 1, 2, 0, 1, 2, 0, 1, 2]])
>>> pser = pd.Series([45, 200, 1.2, 30, 250, 1.5, 320, 1, 0.3], index=midx)
>>> pser.index.names
FrozenList([None, None])
>>> pser.index.names = ['hello', 'koalas']
>>> pser.index.names
FrozenList(['hello', 'koalas'])

>>> midx = pd.MultiIndex([['lama', 'cow', 'falcon'],
...                       ['speed', 'weight', 'length']],
...                      [[0, 0, 0, 1, 1, 1, 2, 2, 2],
...                       [0, 1, 2, 0, 1, 2, 0, 1, 2]])
>>> kser = ks.Series([45, 200, 1.2, 30, 250, 1.5, 320, 1, 0.3], index=midx)
>>> kser.index.names
[None, None]
>>> kser.index.names = ['hello', 'koalas']
>>> kser.index.names
[None, None]

So, this PR suggests that make ours possible also.

>>> kser = ks.Series([1, 2, 3, 4, 5])
>>> kser.index.name = 'koalas'
>>> kser.index.name
'koalas'

>>> midx = pd.MultiIndex([['lama', 'cow', 'falcon'],
...                       ['speed', 'weight', 'length']],
...                      [[0, 0, 0, 1, 1, 1, 2, 2, 2],
...                       [0, 1, 2, 0, 1, 2, 0, 1, 2]])
>>> kser = ks.Series([45, 200, 1.2, 30, 250, 1.5, 320, 1, 0.3], index=midx)
>>> kser.index.names
[None, None]
>>> kser.index.names = ['hello', 'koalas']
>>> kser.index.names
['hello', 'koalas']

codecov-io · 2019-11-26T05:29:58Z

Codecov Report

Merging #1079 into master will not change coverage.
The diff coverage is 100%.

@@           Coverage Diff           @@
##           master    #1079   +/-   ##
=======================================
  Coverage   95.14%   95.14%           
=======================================
  Files          35       35           
  Lines        6958     6958           
=======================================
  Hits         6620     6620           
  Misses        338      338

Impacted Files	Coverage Δ
databricks/koalas/series.py	`96.5% <100%> (ø)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 2bd7adc...8b6204c. Read the comment docs.

HyukjinKwon · 2019-12-02T06:24:43Z

databricks/koalas/series.py

@@ -958,13 +958,11 @@ def rename(self, index: Union[str, Tuple[str, ...]] = None, **kwargs):
    def index(self):
        """The index (axis labels) Column of the Series.

-        Currently not supported when the DataFrame has no index.
-


databricks/koalas/series.py

databricks/koalas/base.py

HyukjinKwon

Looks good otherwise.

databricks/koalas/base.py

softagram-bot · 2019-12-06T01:38:56Z

Softagram Impact Report for pull/1079 (head commit: `8b6204c`)

⚠️ Copy paste found

ℹ️ test_series.py: Copy paste fragment inside the same file on lines 302, 319:

                     pd.Series([True, False], name='x'),
                     pd.Series([0, 1], name='x'),
                     pd.Series([1, 2,...(truncated 330 chars)

ℹ️ test_series.py: Copy paste fragment inside the same file on lines 746, 887:

        midx = pd.MultiIndex([['lama', 'cow', 'falcon'],
                              ['speed', 'weight', 'length']],
                             [[0, 0, 0, 1, 1, 1, 2, 2, 2]...(truncated 280 chars)

ℹ️ test_series.py: Copy paste fragment inside the same file on lines 747, 888, 924:

                              ['speed', 'weight', 'length']],
                             [[0, 0, 0, 1, 1, 1, 2, 2, 2],
                      ...(truncated 256 chars)

ℹ️ test_series.py: Copy paste fragment inside the same file on lines 718, 921:


        # For MultiIndex
        midx = pd.MultiIndex([['lama', 'cow', 'falcon'],
                              ['speed', 'weight', 'length']],
                             [[...(truncated 167 chars)

ℹ️ test_series.py: Copy paste fragment inside the same file on lines 721, 747, 888:

                              ['speed', 'weight', 'length']],
                             [[0, 0, 0, 1, 1, 1, 2, 2, 2],
                      ...(truncated 117 chars)

ℹ️ test_series.py: Copy paste fragment inside the same file on lines 170, 180:

        pdf = pd.DataFrame({
            'left':  [True, False, True, False, np.nan, np.nan, True, False, np.nan],
            'right': [True, False, False, True, True, False, n...(truncated 119 chars)

ℹ️ test_series.py: Copy paste fragment inside the same file on lines 749, 790, 890, 926:

                              [0, 1, 2, 0, 1, 2, 0, 1, 2]])
        kser = ks.Series([45, 200, 1.2, 30, 250, 1.5, 320, 1, 0.3],
             ...(truncated 137 chars)

ℹ️ test_series.py: Copy paste fragment inside the same file on lines 914, 934:


        self.assert_eq(kser.pct_change(periods=-1),
                       pser.pct_change(periods=-1), almost=True)
        self.assert_eq(kser.pct_change(periods=-10...(truncated 213 chars)

ℹ️ test_series.py: Copy paste fragment inside the same file on lines 639, 664:


        index = pd.MultiIndex.from_arrays([
            ['a', 'a', 'b', 'b'], ['c', 'd', 'e', 'f']], names=('first', 'se...(truncated 151 chars)

ℹ️ series.py: Copy paste fragment on line 1248 shared with ../frame.py:


    def to_latex(self, buf=None, columns=None, col_space=None, header=True, index=True,
                 na_rep='NaN',...(truncated 256 chars)

ℹ️ series.py: Copy paste fragment inside the same file on lines 3105, 3213:

        results = sdf.select([scol] + index_scols).take(1)
        if len(results) == 0:
           ...(truncated 409 chars)

ℹ️ series.py: Copy paste fragment inside the same file on lines 4124, 4346:

        sdf = self._internal.sdf \
            .select(cols) \
            .where(reduce(lambda x, y: x & y, rows))

        if len(self._inter...(truncated 255 chars)

Now that you are on the file, it would be easier to pay back some tech. debt.

⭐ Change Overview

(Open in Softagram Desktop for full details)

💡 Insights

Co-change Alert: You modified series.py. Often frame.py (databricks/koalas) is modified at the same time.

📄 Full report

Permalink: Full report for pull/1079

Impact Report explained. Give feedback on this report to [email protected]

itholic added 2 commits November 26, 2019 13:44

setting index name for Series

81f58d0

fix docs

6fe5599

itholic changed the title ~~setting index name for Series~~ Setting index name / names for Series Nov 26, 2019

HyukjinKwon reviewed Dec 2, 2019

View reviewed changes

databricks/koalas/series.py Show resolved Hide resolved

itholic added 2 commits December 2, 2019 15:43

resolve conflicts

8e7003f

fix

6a9ef2b

HyukjinKwon reviewed Dec 2, 2019

View reviewed changes

databricks/koalas/base.py Outdated Show resolved Hide resolved

HyukjinKwon approved these changes Dec 2, 2019

View reviewed changes

fix

32e7f88

HyukjinKwon reviewed Dec 2, 2019

View reviewed changes

databricks/koalas/base.py Outdated Show resolved Hide resolved

itholic added 2 commits December 2, 2019 19:53

remove unused import

c356081

resolve conflicts

8b6204c

HyukjinKwon approved these changes Dec 6, 2019

View reviewed changes

HyukjinKwon merged commit 7844193 into databricks:master Dec 6, 2019

itholic deleted the fix_index_name branch December 10, 2019 15:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Setting index name / names for Series #1079

Setting index name / names for Series #1079

itholic commented Nov 26, 2019

codecov-io commented Nov 26, 2019 •

edited

Loading

HyukjinKwon Dec 2, 2019

HyukjinKwon left a comment

softagram-bot commented Dec 6, 2019

Setting index name / names for Series #1079

Setting index name / names for Series #1079

Conversation

itholic commented Nov 26, 2019

codecov-io commented Nov 26, 2019 • edited Loading

Codecov Report

HyukjinKwon Dec 2, 2019

Choose a reason for hiding this comment

HyukjinKwon left a comment

Choose a reason for hiding this comment

softagram-bot commented Dec 6, 2019

Softagram Impact Report for pull/1079 (head commit: 8b6204c)

⚠️ Copy paste found

⭐ Change Overview

💡 Insights

📄 Full report

codecov-io commented Nov 26, 2019 •

edited

Loading

Softagram Impact Report for pull/1079 (head commit: `8b6204c`)