Add a typed 'substr' column method #263

pgabara · 2018-03-06T20:10:18Z

No description provided.

pgabara · 2018-03-06T20:11:42Z

Connects to #164

codecov-io · 2018-03-06T20:42:57Z

Codecov Report

Merging #263 into master will increase coverage by <.01%.
The diff coverage is 100%.

@@            Coverage Diff             @@
##           master     #263      +/-   ##
==========================================
+ Coverage   96.21%   96.22%   +<.01%     
==========================================
  Files          52       52              
  Lines         924      926       +2     
  Branches        9       11       +2     
==========================================
+ Hits          889      891       +2     
  Misses         35       35

Impacted Files	Coverage Δ
dataset/src/main/scala/frameless/TypedColumn.scala	`100% <100%> (ø)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 992248c...25145af. Read the comment docs.

OlivierBlanvillain · 2018-03-06T21:23:04Z

dataset/src/main/scala/frameless/TypedColumn.scala

+  /**
+    * An expression that returns a substring
+    * {{{
+    *   df.select(df('a).substr(0, 5))


Question: in Spark, is it possible to we mix literals and columns? (.substr(0, df('a)))

nope, there are only substr(startPos: Int, len: Int) and substr(startPos: Column, len: Column)

OlivierBlanvillain · 2018-03-06T21:28:42Z

dataset/src/main/scala/frameless/TypedColumn.scala

+    * @param startPos expression for the starting position
+    * @param len expression for the length of the substring
+    */
+  def substr[TT, W](startPos: ThisType[TT, Int], len: ThisType[TT, Int])


I think this is the first columns method we have that involves 3 columns. The way you wrote it here, with TT used in both startPos and len, you are forcing these two columns to come from the same dataset. Something like the following wouldn't typecheck:

ds1.joins(ds2)(ds1('a) === ds1('a).sustr(ds1('b), ds2('c))

I know it's a contrive example, but to make the above working you could need something like the following:

def substr[TT1, TT2, W1, W2](startPos: ThisType[TT1, Int], len: ThisType[TT2, Int]) (implicit i0: U =:= String, i1: With.Aux[T, TT1, W1], i2: With.Aux[W1, TT2, W2] ) = ...

make sense, I will fix it. thanks

OlivierBlanvillain · 2018-03-07T17:14:23Z

LGTM, thanks!

Add a typed 'substr' column method

ff26405

OlivierBlanvillain reviewed Mar 6, 2018

View reviewed changes

OlivierBlanvillain mentioned this pull request Mar 6, 2018

Missing Columns method #164

Open

Fix method types definition

25145af

OlivierBlanvillain merged commit 7be4d63 into typelevel:master Mar 7, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add a typed 'substr' column method #263

Add a typed 'substr' column method #263

Uh oh!

pgabara commented Mar 6, 2018

Uh oh!

pgabara commented Mar 6, 2018

Uh oh!

codecov-io commented Mar 6, 2018 •

edited

Loading

Uh oh!

OlivierBlanvillain Mar 6, 2018

Uh oh!

pgabara Mar 7, 2018 •

edited

Loading

Uh oh!

OlivierBlanvillain Mar 6, 2018

Uh oh!

pgabara Mar 7, 2018

Uh oh!

OlivierBlanvillain commented Mar 7, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add a typed 'substr' column method #263

Add a typed 'substr' column method #263

Uh oh!

Conversation

pgabara commented Mar 6, 2018

Uh oh!

pgabara commented Mar 6, 2018

Uh oh!

codecov-io commented Mar 6, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

OlivierBlanvillain Mar 6, 2018

Choose a reason for hiding this comment

Uh oh!

pgabara Mar 7, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

OlivierBlanvillain Mar 6, 2018

Choose a reason for hiding this comment

Uh oh!

pgabara Mar 7, 2018

Choose a reason for hiding this comment

Uh oh!

OlivierBlanvillain commented Mar 7, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov-io commented Mar 6, 2018 •

edited

Loading

pgabara Mar 7, 2018 •

edited

Loading