ARROW-1971: [Python] Add pandas serialization to the default #1462

devin-petersohn · 2018-01-06T21:42:12Z

Moving pandas register into default register.

Registering pandas in default Formatting

robertnishihara · 2018-01-06T22:21:24Z

@wesm Is there any reason to use _register_pandas_arrow_handlers over _register_custom_pandas_handlers? Why not always use _register_custom_pandas_handlers? Is there a difference in generality?

wesm · 2018-01-06T22:47:06Z

I'd be perfectly fine with the faster pandas serialization code path. I guess the one thing separating the pandas_serialization_context from the default one would be the handling of NumPy object arrays?

robertnishihara · 2018-01-06T22:53:10Z

Is that related to the choice of _register_pandas_arrow_handlers versus _register_custom_pandas_handlers? Both of those can be used with both of the custom numpy object array serializers, right?

EDIT: I think I see what you mean, yes, if we get rid of _register_pandas_arrow_handlers, then the only difference would be the handling of numpy object arrays.

This seems like the obvious thing to do unless there is some drawback with _register_custom_pandas_handlers.

wesm

+1, thanks @devin-petersohn!

robertnishihara · 2018-01-10T23:47:40Z

Yes, thanks @devin-petersohn! This PR looks good to me.

@wesm, is speed the only difference between the two implementations? Just curious if there are edge cases.

wesm · 2018-01-11T00:02:13Z

I think the new version will have marginally better support for "weird pandas data". If you run into issues please open issues so we can investigate

devin-petersohn changed the title ~~[ARROW-1971] Add pandas serialization to the default~~ [ARROW-1971] [Python] Add pandas serialization to the default Jan 6, 2018

devin-petersohn changed the title ~~[ARROW-1971] [Python] Add pandas serialization to the default~~ ARROW-1971: [Python] Add pandas serialization to the default Jan 6, 2018

devin-petersohn force-pushed the jira/1971_pandas_serialization branch from 83425c9 to e0d4de6 Compare January 6, 2018 22:03

Moving pandas register into default register

2ed3137

Registering pandas in default Formatting

devin-petersohn force-pushed the jira/1971_pandas_serialization branch from e0d4de6 to 2ed3137 Compare January 6, 2018 22:10

Removing slower codepath

b3dfd5b

wesm approved these changes Jan 10, 2018

View reviewed changes

wesm closed this in b49e8f3 Jan 10, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ARROW-1971: [Python] Add pandas serialization to the default #1462

ARROW-1971: [Python] Add pandas serialization to the default #1462

Uh oh!

devin-petersohn commented Jan 6, 2018

Uh oh!

robertnishihara commented Jan 6, 2018

Uh oh!

wesm commented Jan 6, 2018

Uh oh!

robertnishihara commented Jan 6, 2018 •

edited

Loading

Uh oh!

wesm left a comment

Uh oh!

robertnishihara commented Jan 10, 2018

Uh oh!

wesm commented Jan 11, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ARROW-1971: [Python] Add pandas serialization to the default #1462

ARROW-1971: [Python] Add pandas serialization to the default #1462

Uh oh!

Conversation

devin-petersohn commented Jan 6, 2018

Uh oh!

robertnishihara commented Jan 6, 2018

Uh oh!

wesm commented Jan 6, 2018

Uh oh!

robertnishihara commented Jan 6, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wesm left a comment

Choose a reason for hiding this comment

Uh oh!

robertnishihara commented Jan 10, 2018

Uh oh!

wesm commented Jan 11, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

robertnishihara commented Jan 6, 2018 •

edited

Loading