Skip to content

Conversation

@cloud-fan
Copy link
Contributor

What changes were proposed in this pull request?

Follow the convention and rename the metrics numRows to numOutputRows

Why are the changes needed?

FilterExec, HashAggregateExec, etc. all use numOutputRows

Does this PR introduce any user-facing change?

no

How was this patch tested?

existing tests

val dataSize = metrics("dataSize").value
val numRows = metrics("numRows").value
Statistics(dataSize, Some(numRows))
val rowCount = metrics("numOutputRows").value
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

match Statistics.rowCount

val beforeCollect = System.nanoTime()
// Use executeCollect/executeCollectIterator to avoid conversion to Scala types
val (numRows, input) = child.executeCollectIterator()
longMetric("numOutputRows") += numRows
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

always set the metrics even if failure happens.

@cloud-fan
Copy link
Contributor Author

cc @wangyum

Copy link
Member

@wangyum wangyum left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, I also considered using numOutputRows at the time. I didn’t use it because ShuffleExchange didn’t use numOutputRows either.

@cloud-fan
Copy link
Contributor Author

thanks for the review! GA passed, merging to master.

@cloud-fan cloud-fan closed this in f3ad32f Oct 14, 2020
@dongjoon-hyun
Copy link
Member

Thank you, @cloud-fan .

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants