ENH: Enable unary math operations for pandas, sqlite by cpcloud · Pull Request #1071 · ibis-project/ibis

cpcloud · 2017-07-19T21:56:11Z

Implement decimal for pandas
Add SQLite unary ops
Fix operations in postgres that require numeric

cpcloud · 2017-07-21T15:38:09Z

@wesm can you take a look here? bit of a rabbit hole with decimals, otherwise just adding unary ops to series.

wesm

Minor comments, but otherwise LGTM

wesm · 2017-07-24T01:09:20Z

ibis/pandas/client.py

+        if column_name in schema:
+            ibis_type = dt.validate_type(schema[column_name])
+        elif dtype == np.object_:
+            inferred_dtype = infer_dtype(df[column_name].dropna())


Yikes. I guess we should make a NaN-friendly type inference function someplace (seems like an oversight in infer_dtype originally)

can u post an issue in pandas tracker about this

done: pandas-dev/pandas#17059

I PR'd it :) pandas-dev/pandas#17066.

wesm · 2017-07-24T01:12:04Z

ibis/pandas/execution.py

+def execute_series_unary_op(op, data, scope=None):
+    function = getattr(np, type(op).__name__.lower())
+    if data.dtype == np.dtype(np.object_):
+        return data.apply(functools.partial(execute_node, op, scope=scope))


Is Series.apply different from Series.map (in behavior or performance)?

Don't think so, @jreback any idea here?

So, it looks like Series.map accepts a dict, to support a simple CASE-statement-like operation, as well as callables, whereas apply only deals with callables. For callables, both methods use the same underlying function lib.map_infer to call the passed in callable in a Cython loop. Here we could use either since we're only dealing with callables.

wesm · 2017-07-24T01:13:42Z

ibis/pandas/execution.py

+def execute_series_log_with_base(op, data, base, scope=None):
+    if data.dtype == np.dtype(np.object_):
+        func = np.vectorize(functools.partial(execute_node, op, scope=scope))
+        return pd.Series(func(data, base), index=data.index, name=data.name)


Perhaps this bit could be factored out into a helper function since it's repeated a couple times (in case it's useful in future execution rules)

Yeah, good idea!

wesm · 2017-07-24T01:16:04Z

ibis/sql/postgres/compiler.py

+
+def _floor_divide(t, expr):
+    left, right = map(t.translate, expr.op().args)
+    return sa.func.floor(left / right)


Does integer division in postgres yield doubles?

No, it yields integers. This is implemented so that it works regardless of the type of left and right.

Got it, I was just curious =)

Implement decimal for pandas Add SQLite unary ops Fix operations in postgres that require numeric

cpcloud · 2017-07-27T17:06:08Z

Merging on green.

cpcloud force-pushed the unary-ops branch 3 times, most recently from 0dc84c3 to 13a0422 Compare July 20, 2017 00:30

cpcloud self-assigned this Jul 20, 2017

cpcloud added the feature Features or general enhancements label Jul 20, 2017

cpcloud added this to the 0.11.3 milestone Jul 20, 2017

cpcloud force-pushed the unary-ops branch 5 times, most recently from d6c9085 to 3322844 Compare July 21, 2017 02:39

cpcloud requested a review from wesm July 21, 2017 15:37

cpcloud force-pushed the unary-ops branch from 3322844 to e059f7f Compare July 22, 2017 18:22

wesm reviewed Jul 24, 2017

View reviewed changes

cpcloud added 2 commits July 27, 2017 13:01

ENH: Enable unary math operations for pandas, sqlite

0d863d1

Implement decimal for pandas Add SQLite unary ops Fix operations in postgres that require numeric

REF: Factor vectorize object function

d94b0c6

cpcloud force-pushed the unary-ops branch from e46867f to d94b0c6 Compare July 27, 2017 17:01

BUG: Pass args and kwargs

57ff2b1

cpcloud closed this in 9882b5a Jul 27, 2017

cpcloud deleted the unary-ops branch July 27, 2017 19:48

gerrymanoim mentioned this pull request Dec 18, 2020

Dask backend execution #2557

Merged

Conversation

cpcloud commented Jul 19, 2017

Uh oh!

cpcloud commented Jul 21, 2017

Uh oh!

wesm left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cpcloud commented Jul 27, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants