Skip to content

Conversation

@beobest2
Copy link
Contributor

What changes were proposed in this pull request?

In the auto-generated documentation on pandas API support list, there are cases where the link of the function property provided in the document is not connected, so it needs to be corrected.

The current 'supported API generation' function dynamically compares the modules of PySpark.pandas and pandas to find the difference.
At this time, the inherited class is also aggregated, and the link is not generated correctly (such as CategoricalIndex.all() is used internally by inheriting Index.all().) because it does not match the pattern of each API document.

So, I modified it in such a way that it is created by excluding methods that exist in the parent class.

Why are the changes needed?

To link to the correct API document.

Does this PR introduce any user-facing change?

Yes, the "Supported pandas APIs" page has changed as below.
Screen Shot 2022-06-16 at 7 54 05 PM

How was this patch tested?

Manually check the links in the documents & the existing doc build should be passed.

…erated pandas API support list documentation
@beobest2
Copy link
Contributor Author

beobest2 commented Jun 17, 2022

@HyukjinKwon Modified to automatically generate documentation for only newly declared or overridden functions in its own class.

Most broken links have been fixed.
However, some functions are not linked. The cases are divided as follows.

For Cases A, B, and C, it seems that we can add documents in pyspark.pandas.
In case D, it seems necessary to separately check the list declared in the code but not supported by pyspark.pandas, or to remove the declaration in the code.

  • Case A: There is only one pyspark.pandas document for the same function with a different name, (pandas document exists)
    ex> divide (=div), multiply(=mul), subtract(=sub)
DataFrame
- divide
- multiply
- subtract

Series
- divde
- multiply
- subtract
  • Case B: pyspark.pandas document does not exist (pandas docs exist)
Index
- get_level_values
- holds_integer
- is_type_compatible

MultiIndex
- get_level_values

Expanding
- kurt
- skew
- std
- var

Rolling
- kurt
- skew
- std
- var
  • Case C: Documentation does not exist (even in pandas)
MultiIndex
- drop_duplicates

GroupBy
- expanding
- pad
- rolling
Index
- sort

SeriesGroupBy
- agg
- aggregate

@HyukjinKwon
Copy link
Member

cc @Yikun @xinrong-databricks @itholic FYI

Copy link
Member

@Yikun Yikun left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

from inspect import getmembers, isfunction

class Parent:
    x = 0
    def __init__(self):
        pass
    def parent_function():
        print("parent")

class Child(Parent):
    x = 1
    def __init__(self):
        pass
    def child_function(self):
        print("child")

p = Parent()
c = Child()

# Get all child + parent functions
[i for i in dict([m for m in getmembers(Child, isfunction)])]

# Get all child functions
[i for i in Child.__dict__]

Here is a simple Test to help review, this change exclude parent only functions. so LGTM to me.

@beobest2 If you could print the diff list before and after this PR may also help to review. : )

@beobest2
Copy link
Contributor Author

The difference was calculated and written as follows. Please refer to it when reviewing.

  • Count before performing this PR. : 1203
  • Count after performing this PR. : 573

--- DIFF ---

pyspark.pandas.CategoricalIndex.all
pyspark.pandas.CategoricalIndex.any
pyspark.pandas.CategoricalIndex.append
pyspark.pandas.CategoricalIndex.argmax
pyspark.pandas.CategoricalIndex.argmin
pyspark.pandas.CategoricalIndex.asof
pyspark.pandas.CategoricalIndex.asof_locs
pyspark.pandas.CategoricalIndex.copy
pyspark.pandas.CategoricalIndex.delete
pyspark.pandas.CategoricalIndex.difference
pyspark.pandas.CategoricalIndex.drop
pyspark.pandas.CategoricalIndex.drop_duplicates
pyspark.pandas.CategoricalIndex.droplevel
pyspark.pandas.CategoricalIndex.dropna
pyspark.pandas.CategoricalIndex.duplicated
pyspark.pandas.CategoricalIndex.factorize
pyspark.pandas.CategoricalIndex.fillna
pyspark.pandas.CategoricalIndex.format
pyspark.pandas.CategoricalIndex.get_indexer
pyspark.pandas.CategoricalIndex.get_indexer_for
pyspark.pandas.CategoricalIndex.get_indexer_non_unique
pyspark.pandas.CategoricalIndex.get_level_values
pyspark.pandas.CategoricalIndex.get_loc
pyspark.pandas.CategoricalIndex.get_slice_bound
pyspark.pandas.CategoricalIndex.get_value
pyspark.pandas.CategoricalIndex.groupby
pyspark.pandas.CategoricalIndex.holds_integer
pyspark.pandas.CategoricalIndex.identical
pyspark.pandas.CategoricalIndex.insert
pyspark.pandas.CategoricalIndex.intersection
pyspark.pandas.CategoricalIndex.is_
pyspark.pandas.CategoricalIndex.is_boolean
pyspark.pandas.CategoricalIndex.is_categorical
pyspark.pandas.CategoricalIndex.is_floating
pyspark.pandas.CategoricalIndex.is_integer
pyspark.pandas.CategoricalIndex.is_interval
pyspark.pandas.CategoricalIndex.is_mixed
pyspark.pandas.CategoricalIndex.is_numeric
pyspark.pandas.CategoricalIndex.is_object
pyspark.pandas.CategoricalIndex.is_type_compatible
pyspark.pandas.CategoricalIndex.isin
pyspark.pandas.CategoricalIndex.isna
pyspark.pandas.CategoricalIndex.isnull
pyspark.pandas.CategoricalIndex.item
pyspark.pandas.CategoricalIndex.join
pyspark.pandas.CategoricalIndex.memory_usage
pyspark.pandas.CategoricalIndex.notna
pyspark.pandas.CategoricalIndex.notnull
pyspark.pandas.CategoricalIndex.nunique
pyspark.pandas.CategoricalIndex.putmask
pyspark.pandas.CategoricalIndex.ravel
pyspark.pandas.CategoricalIndex.rename
pyspark.pandas.CategoricalIndex.repeat
pyspark.pandas.CategoricalIndex.set_names
pyspark.pandas.CategoricalIndex.set_value
pyspark.pandas.CategoricalIndex.shift
pyspark.pandas.CategoricalIndex.slice_indexer
pyspark.pandas.CategoricalIndex.slice_locs
pyspark.pandas.CategoricalIndex.sort
pyspark.pandas.CategoricalIndex.sort_values
pyspark.pandas.CategoricalIndex.sortlevel
pyspark.pandas.CategoricalIndex.symmetric_difference
pyspark.pandas.CategoricalIndex.take
pyspark.pandas.CategoricalIndex.to_flat_index
pyspark.pandas.CategoricalIndex.to_frame
pyspark.pandas.CategoricalIndex.to_list
pyspark.pandas.CategoricalIndex.to_native_types
pyspark.pandas.CategoricalIndex.to_numpy
pyspark.pandas.CategoricalIndex.to_series
pyspark.pandas.CategoricalIndex.transpose
pyspark.pandas.CategoricalIndex.union
pyspark.pandas.CategoricalIndex.unique
pyspark.pandas.CategoricalIndex.value_counts
pyspark.pandas.CategoricalIndex.view
pyspark.pandas.CategoricalIndex.where
pyspark.pandas.DataFrame.abs
pyspark.pandas.DataFrame.add_prefix
pyspark.pandas.DataFrame.add_suffix
pyspark.pandas.DataFrame.asof
pyspark.pandas.DataFrame.astype
pyspark.pandas.DataFrame.at_time
pyspark.pandas.DataFrame.backfill
pyspark.pandas.DataFrame.between_time
pyspark.pandas.DataFrame.bool
pyspark.pandas.DataFrame.convert_dtypes
pyspark.pandas.DataFrame.copy
pyspark.pandas.DataFrame.describe
pyspark.pandas.DataFrame.droplevel
pyspark.pandas.DataFrame.equals
pyspark.pandas.DataFrame.ewm
pyspark.pandas.DataFrame.expanding
pyspark.pandas.DataFrame.filter
pyspark.pandas.DataFrame.first
pyspark.pandas.DataFrame.first_valid_index
pyspark.pandas.DataFrame.get
pyspark.pandas.DataFrame.head
pyspark.pandas.DataFrame.infer_objects
pyspark.pandas.DataFrame.keys
pyspark.pandas.DataFrame.last
pyspark.pandas.DataFrame.last_valid_index
pyspark.pandas.DataFrame.pad
pyspark.pandas.DataFrame.pct_change
pyspark.pandas.DataFrame.pipe
pyspark.pandas.DataFrame.rank
pyspark.pandas.DataFrame.reindex_like
pyspark.pandas.DataFrame.rename_axis
pyspark.pandas.DataFrame.rolling
pyspark.pandas.DataFrame.sample
pyspark.pandas.DataFrame.set_flags
pyspark.pandas.DataFrame.slice_shift
pyspark.pandas.DataFrame.squeeze
pyspark.pandas.DataFrame.swapaxes
pyspark.pandas.DataFrame.tail
pyspark.pandas.DataFrame.take
pyspark.pandas.DataFrame.to_clipboard
pyspark.pandas.DataFrame.to_csv
pyspark.pandas.DataFrame.to_excel
pyspark.pandas.DataFrame.to_hdf
pyspark.pandas.DataFrame.to_json
pyspark.pandas.DataFrame.to_latex
pyspark.pandas.DataFrame.to_pickle
pyspark.pandas.DataFrame.to_sql
pyspark.pandas.DataFrame.to_xarray
pyspark.pandas.DataFrame.truncate
pyspark.pandas.DataFrame.tshift
pyspark.pandas.DataFrame.tz_convert
pyspark.pandas.DataFrame.tz_localize
pyspark.pandas.DataFrame.xs
pyspark.pandas.DatetimeIndex.all
pyspark.pandas.DatetimeIndex.any
pyspark.pandas.DatetimeIndex.append
pyspark.pandas.DatetimeIndex.argmax
pyspark.pandas.DatetimeIndex.argmin
pyspark.pandas.DatetimeIndex.argsort
pyspark.pandas.DatetimeIndex.asof
pyspark.pandas.DatetimeIndex.asof_locs
pyspark.pandas.DatetimeIndex.astype
pyspark.pandas.DatetimeIndex.copy
pyspark.pandas.DatetimeIndex.delete
pyspark.pandas.DatetimeIndex.difference
pyspark.pandas.DatetimeIndex.drop
pyspark.pandas.DatetimeIndex.drop_duplicates
pyspark.pandas.DatetimeIndex.droplevel
pyspark.pandas.DatetimeIndex.dropna
pyspark.pandas.DatetimeIndex.duplicated
pyspark.pandas.DatetimeIndex.equals
pyspark.pandas.DatetimeIndex.factorize
pyspark.pandas.DatetimeIndex.fillna
pyspark.pandas.DatetimeIndex.format
pyspark.pandas.DatetimeIndex.get_indexer
pyspark.pandas.DatetimeIndex.get_indexer_for
pyspark.pandas.DatetimeIndex.get_indexer_non_unique
pyspark.pandas.DatetimeIndex.get_level_values
pyspark.pandas.DatetimeIndex.get_value
pyspark.pandas.DatetimeIndex.groupby
pyspark.pandas.DatetimeIndex.holds_integer
pyspark.pandas.DatetimeIndex.identical
pyspark.pandas.DatetimeIndex.insert
pyspark.pandas.DatetimeIndex.intersection
pyspark.pandas.DatetimeIndex.is_
pyspark.pandas.DatetimeIndex.is_boolean
pyspark.pandas.DatetimeIndex.is_categorical
pyspark.pandas.DatetimeIndex.is_floating
pyspark.pandas.DatetimeIndex.is_integer
pyspark.pandas.DatetimeIndex.is_interval
pyspark.pandas.DatetimeIndex.is_mixed
pyspark.pandas.DatetimeIndex.is_numeric
pyspark.pandas.DatetimeIndex.is_object
pyspark.pandas.DatetimeIndex.is_type_compatible
pyspark.pandas.DatetimeIndex.isin
pyspark.pandas.DatetimeIndex.isna
pyspark.pandas.DatetimeIndex.isnull
pyspark.pandas.DatetimeIndex.item
pyspark.pandas.DatetimeIndex.join
pyspark.pandas.DatetimeIndex.map
pyspark.pandas.DatetimeIndex.max
pyspark.pandas.DatetimeIndex.mean
pyspark.pandas.DatetimeIndex.memory_usage
pyspark.pandas.DatetimeIndex.min
pyspark.pandas.DatetimeIndex.notna
pyspark.pandas.DatetimeIndex.notnull
pyspark.pandas.DatetimeIndex.nunique
pyspark.pandas.DatetimeIndex.putmask
pyspark.pandas.DatetimeIndex.ravel
pyspark.pandas.DatetimeIndex.reindex
pyspark.pandas.DatetimeIndex.rename
pyspark.pandas.DatetimeIndex.repeat
pyspark.pandas.DatetimeIndex.searchsorted
pyspark.pandas.DatetimeIndex.set_names
pyspark.pandas.DatetimeIndex.set_value
pyspark.pandas.DatetimeIndex.shift
pyspark.pandas.DatetimeIndex.slice_locs
pyspark.pandas.DatetimeIndex.sort
pyspark.pandas.DatetimeIndex.sort_values
pyspark.pandas.DatetimeIndex.sortlevel
pyspark.pandas.DatetimeIndex.symmetric_difference
pyspark.pandas.DatetimeIndex.take
pyspark.pandas.DatetimeIndex.to_flat_index
pyspark.pandas.DatetimeIndex.to_frame
pyspark.pandas.DatetimeIndex.to_list
pyspark.pandas.DatetimeIndex.to_native_types
pyspark.pandas.DatetimeIndex.to_numpy
pyspark.pandas.DatetimeIndex.tolist
pyspark.pandas.DatetimeIndex.transpose
pyspark.pandas.DatetimeIndex.union
pyspark.pandas.DatetimeIndex.unique
pyspark.pandas.DatetimeIndex.value_counts
pyspark.pandas.DatetimeIndex.view
pyspark.pandas.DatetimeIndex.where
pyspark.pandas.Float64Index.all
pyspark.pandas.Float64Index.any
pyspark.pandas.Float64Index.append
pyspark.pandas.Float64Index.argmax
pyspark.pandas.Float64Index.argmin
pyspark.pandas.Float64Index.argsort
pyspark.pandas.Float64Index.asof
pyspark.pandas.Float64Index.asof_locs
pyspark.pandas.Float64Index.astype
pyspark.pandas.Float64Index.copy
pyspark.pandas.Float64Index.delete
pyspark.pandas.Float64Index.difference
pyspark.pandas.Float64Index.drop
pyspark.pandas.Float64Index.drop_duplicates
pyspark.pandas.Float64Index.droplevel
pyspark.pandas.Float64Index.dropna
pyspark.pandas.Float64Index.duplicated
pyspark.pandas.Float64Index.equals
pyspark.pandas.Float64Index.factorize
pyspark.pandas.Float64Index.fillna
pyspark.pandas.Float64Index.format
pyspark.pandas.Float64Index.get_indexer
pyspark.pandas.Float64Index.get_indexer_for
pyspark.pandas.Float64Index.get_indexer_non_unique
pyspark.pandas.Float64Index.get_level_values
pyspark.pandas.Float64Index.get_loc
pyspark.pandas.Float64Index.get_slice_bound
pyspark.pandas.Float64Index.get_value
pyspark.pandas.Float64Index.groupby
pyspark.pandas.Float64Index.holds_integer
pyspark.pandas.Float64Index.identical
pyspark.pandas.Float64Index.insert
pyspark.pandas.Float64Index.intersection
pyspark.pandas.Float64Index.is_
pyspark.pandas.Float64Index.is_boolean
pyspark.pandas.Float64Index.is_categorical
pyspark.pandas.Float64Index.is_floating
pyspark.pandas.Float64Index.is_integer
pyspark.pandas.Float64Index.is_interval
pyspark.pandas.Float64Index.is_mixed
pyspark.pandas.Float64Index.is_numeric
pyspark.pandas.Float64Index.is_object
pyspark.pandas.Float64Index.is_type_compatible
pyspark.pandas.Float64Index.isin
pyspark.pandas.Float64Index.isna
pyspark.pandas.Float64Index.isnull
pyspark.pandas.Float64Index.item
pyspark.pandas.Float64Index.join
pyspark.pandas.Float64Index.map
pyspark.pandas.Float64Index.max
pyspark.pandas.Float64Index.memory_usage
pyspark.pandas.Float64Index.min
pyspark.pandas.Float64Index.notna
pyspark.pandas.Float64Index.notnull
pyspark.pandas.Float64Index.nunique
pyspark.pandas.Float64Index.putmask
pyspark.pandas.Float64Index.ravel
pyspark.pandas.Float64Index.reindex
pyspark.pandas.Float64Index.rename
pyspark.pandas.Float64Index.repeat
pyspark.pandas.Float64Index.searchsorted
pyspark.pandas.Float64Index.set_names
pyspark.pandas.Float64Index.set_value
pyspark.pandas.Float64Index.shift
pyspark.pandas.Float64Index.slice_indexer
pyspark.pandas.Float64Index.slice_locs
pyspark.pandas.Float64Index.sort
pyspark.pandas.Float64Index.sort_values
pyspark.pandas.Float64Index.sortlevel
pyspark.pandas.Float64Index.symmetric_difference
pyspark.pandas.Float64Index.take
pyspark.pandas.Float64Index.to_flat_index
pyspark.pandas.Float64Index.to_frame
pyspark.pandas.Float64Index.to_list
pyspark.pandas.Float64Index.to_native_types
pyspark.pandas.Float64Index.to_numpy
pyspark.pandas.Float64Index.to_series
pyspark.pandas.Float64Index.tolist
pyspark.pandas.Float64Index.transpose
pyspark.pandas.Float64Index.union
pyspark.pandas.Float64Index.unique
pyspark.pandas.Float64Index.value_counts
pyspark.pandas.Float64Index.view
pyspark.pandas.Float64Index.where
pyspark.pandas.Index.factorize
pyspark.pandas.Index.item
pyspark.pandas.Index.nunique
pyspark.pandas.Index.searchsorted
pyspark.pandas.Index.to_list
pyspark.pandas.Index.to_numpy
pyspark.pandas.Index.tolist
pyspark.pandas.Index.transpose
pyspark.pandas.Index.value_counts
pyspark.pandas.Int64Index.all
pyspark.pandas.Int64Index.any
pyspark.pandas.Int64Index.append
pyspark.pandas.Int64Index.argmax
pyspark.pandas.Int64Index.argmin
pyspark.pandas.Int64Index.argsort
pyspark.pandas.Int64Index.asof
pyspark.pandas.Int64Index.asof_locs
pyspark.pandas.Int64Index.astype
pyspark.pandas.Int64Index.copy
pyspark.pandas.Int64Index.delete
pyspark.pandas.Int64Index.difference
pyspark.pandas.Int64Index.drop
pyspark.pandas.Int64Index.drop_duplicates
pyspark.pandas.Int64Index.droplevel
pyspark.pandas.Int64Index.dropna
pyspark.pandas.Int64Index.duplicated
pyspark.pandas.Int64Index.equals
pyspark.pandas.Int64Index.factorize
pyspark.pandas.Int64Index.fillna
pyspark.pandas.Int64Index.format
pyspark.pandas.Int64Index.get_indexer
pyspark.pandas.Int64Index.get_indexer_for
pyspark.pandas.Int64Index.get_indexer_non_unique
pyspark.pandas.Int64Index.get_level_values
pyspark.pandas.Int64Index.get_loc
pyspark.pandas.Int64Index.get_slice_bound
pyspark.pandas.Int64Index.get_value
pyspark.pandas.Int64Index.groupby
pyspark.pandas.Int64Index.holds_integer
pyspark.pandas.Int64Index.identical
pyspark.pandas.Int64Index.insert
pyspark.pandas.Int64Index.intersection
pyspark.pandas.Int64Index.is_
pyspark.pandas.Int64Index.is_boolean
pyspark.pandas.Int64Index.is_categorical
pyspark.pandas.Int64Index.is_floating
pyspark.pandas.Int64Index.is_integer
pyspark.pandas.Int64Index.is_interval
pyspark.pandas.Int64Index.is_mixed
pyspark.pandas.Int64Index.is_numeric
pyspark.pandas.Int64Index.is_object
pyspark.pandas.Int64Index.is_type_compatible
pyspark.pandas.Int64Index.isin
pyspark.pandas.Int64Index.isna
pyspark.pandas.Int64Index.isnull
pyspark.pandas.Int64Index.item
pyspark.pandas.Int64Index.join
pyspark.pandas.Int64Index.map
pyspark.pandas.Int64Index.max
pyspark.pandas.Int64Index.memory_usage
pyspark.pandas.Int64Index.min
pyspark.pandas.Int64Index.notna
pyspark.pandas.Int64Index.notnull
pyspark.pandas.Int64Index.nunique
pyspark.pandas.Int64Index.putmask
pyspark.pandas.Int64Index.ravel
pyspark.pandas.Int64Index.reindex
pyspark.pandas.Int64Index.rename
pyspark.pandas.Int64Index.repeat
pyspark.pandas.Int64Index.searchsorted
pyspark.pandas.Int64Index.set_names
pyspark.pandas.Int64Index.set_value
pyspark.pandas.Int64Index.shift
pyspark.pandas.Int64Index.slice_indexer
pyspark.pandas.Int64Index.slice_locs
pyspark.pandas.Int64Index.sort
pyspark.pandas.Int64Index.sort_values
pyspark.pandas.Int64Index.sortlevel
pyspark.pandas.Int64Index.symmetric_difference
pyspark.pandas.Int64Index.take
pyspark.pandas.Int64Index.to_flat_index
pyspark.pandas.Int64Index.to_frame
pyspark.pandas.Int64Index.to_list
pyspark.pandas.Int64Index.to_native_types
pyspark.pandas.Int64Index.to_numpy
pyspark.pandas.Int64Index.to_series
pyspark.pandas.Int64Index.tolist
pyspark.pandas.Int64Index.transpose
pyspark.pandas.Int64Index.union
pyspark.pandas.Int64Index.unique
pyspark.pandas.Int64Index.value_counts
pyspark.pandas.Int64Index.view
pyspark.pandas.Int64Index.where
pyspark.pandas.MultiIndex.all
pyspark.pandas.MultiIndex.any
pyspark.pandas.MultiIndex.argmax
pyspark.pandas.MultiIndex.argmin
pyspark.pandas.MultiIndex.asof
pyspark.pandas.MultiIndex.asof_locs
pyspark.pandas.MultiIndex.difference
pyspark.pandas.MultiIndex.droplevel
pyspark.pandas.MultiIndex.factorize
pyspark.pandas.MultiIndex.get_indexer
pyspark.pandas.MultiIndex.get_indexer_for
pyspark.pandas.MultiIndex.get_indexer_non_unique
pyspark.pandas.MultiIndex.get_value
pyspark.pandas.MultiIndex.groupby
pyspark.pandas.MultiIndex.holds_integer
pyspark.pandas.MultiIndex.identical
pyspark.pandas.MultiIndex.intersection
pyspark.pandas.MultiIndex.is_
pyspark.pandas.MultiIndex.is_boolean
pyspark.pandas.MultiIndex.is_categorical
pyspark.pandas.MultiIndex.is_floating
pyspark.pandas.MultiIndex.is_integer
pyspark.pandas.MultiIndex.is_interval
pyspark.pandas.MultiIndex.is_mixed
pyspark.pandas.MultiIndex.is_numeric
pyspark.pandas.MultiIndex.is_object
pyspark.pandas.MultiIndex.is_type_compatible
pyspark.pandas.MultiIndex.isna
pyspark.pandas.MultiIndex.isnull
pyspark.pandas.MultiIndex.item
pyspark.pandas.MultiIndex.join
pyspark.pandas.MultiIndex.map
pyspark.pandas.MultiIndex.max
pyspark.pandas.MultiIndex.min
pyspark.pandas.MultiIndex.notna
pyspark.pandas.MultiIndex.notnull
pyspark.pandas.MultiIndex.nunique
pyspark.pandas.MultiIndex.putmask
pyspark.pandas.MultiIndex.ravel
pyspark.pandas.MultiIndex.reindex
pyspark.pandas.MultiIndex.searchsorted
pyspark.pandas.MultiIndex.set_value
pyspark.pandas.MultiIndex.shift
pyspark.pandas.MultiIndex.slice_indexer
pyspark.pandas.MultiIndex.sort
pyspark.pandas.MultiIndex.sort_values
pyspark.pandas.MultiIndex.symmetric_difference
pyspark.pandas.MultiIndex.to_list
pyspark.pandas.MultiIndex.to_native_types
pyspark.pandas.MultiIndex.to_numpy
pyspark.pandas.MultiIndex.to_series
pyspark.pandas.MultiIndex.tolist
pyspark.pandas.MultiIndex.transpose
pyspark.pandas.MultiIndex.union
pyspark.pandas.MultiIndex.value_counts
pyspark.pandas.MultiIndex.where
pyspark.pandas.Series.abs
pyspark.pandas.Series.add_prefix
pyspark.pandas.Series.add_suffix
pyspark.pandas.Series.argmax
pyspark.pandas.Series.argmin
pyspark.pandas.Series.asof
pyspark.pandas.Series.astype
pyspark.pandas.Series.at_time
pyspark.pandas.Series.backfill
pyspark.pandas.Series.between_time
pyspark.pandas.Series.bool
pyspark.pandas.Series.convert_dtypes
pyspark.pandas.Series.copy
pyspark.pandas.Series.describe
pyspark.pandas.Series.droplevel
pyspark.pandas.Series.equals
pyspark.pandas.Series.ewm
pyspark.pandas.Series.expanding
pyspark.pandas.Series.factorize
pyspark.pandas.Series.filter
pyspark.pandas.Series.first
pyspark.pandas.Series.first_valid_index
pyspark.pandas.Series.get
pyspark.pandas.Series.head
pyspark.pandas.Series.infer_objects
pyspark.pandas.Series.item
pyspark.pandas.Series.last
pyspark.pandas.Series.last_valid_index
pyspark.pandas.Series.nunique
pyspark.pandas.Series.pad
pyspark.pandas.Series.pct_change
pyspark.pandas.Series.pipe
pyspark.pandas.Series.rank
pyspark.pandas.Series.reindex_like
pyspark.pandas.Series.rename_axis
pyspark.pandas.Series.rolling
pyspark.pandas.Series.sample
pyspark.pandas.Series.set_flags
pyspark.pandas.Series.slice_shift
pyspark.pandas.Series.squeeze
pyspark.pandas.Series.swapaxes
pyspark.pandas.Series.tail
pyspark.pandas.Series.to_clipboard
pyspark.pandas.Series.to_csv
pyspark.pandas.Series.to_excel
pyspark.pandas.Series.to_hdf
pyspark.pandas.Series.to_json
pyspark.pandas.Series.to_latex
pyspark.pandas.Series.to_list
pyspark.pandas.Series.to_numpy
pyspark.pandas.Series.to_pickle
pyspark.pandas.Series.to_sql
pyspark.pandas.Series.to_xarray
pyspark.pandas.Series.tolist
pyspark.pandas.Series.transpose
pyspark.pandas.Series.truncate
pyspark.pandas.Series.tshift
pyspark.pandas.Series.tz_convert
pyspark.pandas.Series.tz_localize
pyspark.pandas.Series.value_counts
pyspark.pandas.Series.xs
pyspark.pandas.groupby.DataFrameGroupBy.all
pyspark.pandas.groupby.DataFrameGroupBy.any
pyspark.pandas.groupby.DataFrameGroupBy.apply
pyspark.pandas.groupby.DataFrameGroupBy.backfill
pyspark.pandas.groupby.DataFrameGroupBy.bfill
pyspark.pandas.groupby.DataFrameGroupBy.count
pyspark.pandas.groupby.DataFrameGroupBy.cumcount
pyspark.pandas.groupby.DataFrameGroupBy.cummax
pyspark.pandas.groupby.DataFrameGroupBy.cummin
pyspark.pandas.groupby.DataFrameGroupBy.cumprod
pyspark.pandas.groupby.DataFrameGroupBy.cumsum
pyspark.pandas.groupby.DataFrameGroupBy.describe
pyspark.pandas.groupby.DataFrameGroupBy.ewm
pyspark.pandas.groupby.DataFrameGroupBy.expanding
pyspark.pandas.groupby.DataFrameGroupBy.ffill
pyspark.pandas.groupby.DataFrameGroupBy.first
pyspark.pandas.groupby.DataFrameGroupBy.get_group
pyspark.pandas.groupby.DataFrameGroupBy.head
pyspark.pandas.groupby.DataFrameGroupBy.last
pyspark.pandas.groupby.DataFrameGroupBy.max
pyspark.pandas.groupby.DataFrameGroupBy.mean
pyspark.pandas.groupby.DataFrameGroupBy.median
pyspark.pandas.groupby.DataFrameGroupBy.min
pyspark.pandas.groupby.DataFrameGroupBy.ngroup
pyspark.pandas.groupby.DataFrameGroupBy.nth
pyspark.pandas.groupby.DataFrameGroupBy.ohlc
pyspark.pandas.groupby.DataFrameGroupBy.pad
pyspark.pandas.groupby.DataFrameGroupBy.pct_change
pyspark.pandas.groupby.DataFrameGroupBy.pipe
pyspark.pandas.groupby.DataFrameGroupBy.prod
pyspark.pandas.groupby.DataFrameGroupBy.quantile
pyspark.pandas.groupby.DataFrameGroupBy.rank
pyspark.pandas.groupby.DataFrameGroupBy.resample
pyspark.pandas.groupby.DataFrameGroupBy.rolling
pyspark.pandas.groupby.DataFrameGroupBy.sample
pyspark.pandas.groupby.DataFrameGroupBy.sem
pyspark.pandas.groupby.DataFrameGroupBy.shift
pyspark.pandas.groupby.DataFrameGroupBy.size
pyspark.pandas.groupby.DataFrameGroupBy.std
pyspark.pandas.groupby.DataFrameGroupBy.sum
pyspark.pandas.groupby.DataFrameGroupBy.tail
pyspark.pandas.groupby.DataFrameGroupBy.var
pyspark.pandas.groupby.GroupBy.agg
pyspark.pandas.groupby.GroupBy.aggregate
pyspark.pandas.groupby.GroupBy.get_group
pyspark.pandas.groupby.GroupBy.pipe
pyspark.pandas.groupby.SeriesGroupBy.all
pyspark.pandas.groupby.SeriesGroupBy.any
pyspark.pandas.groupby.SeriesGroupBy.backfill
pyspark.pandas.groupby.SeriesGroupBy.bfill
pyspark.pandas.groupby.SeriesGroupBy.count
pyspark.pandas.groupby.SeriesGroupBy.cumcount
pyspark.pandas.groupby.SeriesGroupBy.cummax
pyspark.pandas.groupby.SeriesGroupBy.cummin
pyspark.pandas.groupby.SeriesGroupBy.cumprod
pyspark.pandas.groupby.SeriesGroupBy.cumsum
pyspark.pandas.groupby.SeriesGroupBy.ewm
pyspark.pandas.groupby.SeriesGroupBy.expanding
pyspark.pandas.groupby.SeriesGroupBy.ffill
pyspark.pandas.groupby.SeriesGroupBy.first
pyspark.pandas.groupby.SeriesGroupBy.get_group
pyspark.pandas.groupby.SeriesGroupBy.head
pyspark.pandas.groupby.SeriesGroupBy.last
pyspark.pandas.groupby.SeriesGroupBy.max
pyspark.pandas.groupby.SeriesGroupBy.mean
pyspark.pandas.groupby.SeriesGroupBy.median
pyspark.pandas.groupby.SeriesGroupBy.min
pyspark.pandas.groupby.SeriesGroupBy.ngroup
pyspark.pandas.groupby.SeriesGroupBy.nth
pyspark.pandas.groupby.SeriesGroupBy.ohlc
pyspark.pandas.groupby.SeriesGroupBy.pad
pyspark.pandas.groupby.SeriesGroupBy.pct_change
pyspark.pandas.groupby.SeriesGroupBy.pipe
pyspark.pandas.groupby.SeriesGroupBy.prod
pyspark.pandas.groupby.SeriesGroupBy.quantile
pyspark.pandas.groupby.SeriesGroupBy.rank
pyspark.pandas.groupby.SeriesGroupBy.resample
pyspark.pandas.groupby.SeriesGroupBy.rolling
pyspark.pandas.groupby.SeriesGroupBy.sample
pyspark.pandas.groupby.SeriesGroupBy.sem
pyspark.pandas.groupby.SeriesGroupBy.shift
pyspark.pandas.groupby.SeriesGroupBy.size
pyspark.pandas.groupby.SeriesGroupBy.std
pyspark.pandas.groupby.SeriesGroupBy.sum
pyspark.pandas.groupby.SeriesGroupBy.tail
pyspark.pandas.groupby.SeriesGroupBy.var
pyspark.pandas.window.Expanding.validate
pyspark.pandas.window.ExpandingGroupby.agg
pyspark.pandas.window.ExpandingGroupby.aggregate
pyspark.pandas.window.ExpandingGroupby.apply
pyspark.pandas.window.ExpandingGroupby.corr
pyspark.pandas.window.ExpandingGroupby.count
pyspark.pandas.window.ExpandingGroupby.cov
pyspark.pandas.window.ExpandingGroupby.kurt
pyspark.pandas.window.ExpandingGroupby.max
pyspark.pandas.window.ExpandingGroupby.mean
pyspark.pandas.window.ExpandingGroupby.median
pyspark.pandas.window.ExpandingGroupby.min
pyspark.pandas.window.ExpandingGroupby.quantile
pyspark.pandas.window.ExpandingGroupby.rank
pyspark.pandas.window.ExpandingGroupby.sem
pyspark.pandas.window.ExpandingGroupby.skew
pyspark.pandas.window.ExpandingGroupby.std
pyspark.pandas.window.ExpandingGroupby.sum
pyspark.pandas.window.ExpandingGroupby.validate
pyspark.pandas.window.ExpandingGroupby.var
pyspark.pandas.window.Rolling.validate
pyspark.pandas.window.RollingGroupby.agg
pyspark.pandas.window.RollingGroupby.aggregate
pyspark.pandas.window.RollingGroupby.apply
pyspark.pandas.window.RollingGroupby.corr
pyspark.pandas.window.RollingGroupby.count
pyspark.pandas.window.RollingGroupby.cov
pyspark.pandas.window.RollingGroupby.kurt
pyspark.pandas.window.RollingGroupby.max
pyspark.pandas.window.RollingGroupby.mean
pyspark.pandas.window.RollingGroupby.median
pyspark.pandas.window.RollingGroupby.min
pyspark.pandas.window.RollingGroupby.quantile
pyspark.pandas.window.RollingGroupby.rank
pyspark.pandas.window.RollingGroupby.sem
pyspark.pandas.window.RollingGroupby.skew
pyspark.pandas.window.RollingGroupby.std
pyspark.pandas.window.RollingGroupby.sum
pyspark.pandas.window.RollingGroupby.validate
pyspark.pandas.window.RollingGroupby.var
pyspark.pandas.window.Window.validate

@AmplabJenkins
Copy link

Can one of the admins verify this patch?

Copy link
Member

@HyukjinKwon HyukjinKwon left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for working on this @beobest2 and @Yikun for reviewing.

@HyukjinKwon
Copy link
Member

Merged to master.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants