Skip to content

Discrete entropy#13203

Closed
atavory wants to merge 0 commit intoprestodb:masterfrom
atavory:discrete_entropy
Closed

Discrete entropy#13203
atavory wants to merge 0 commit intoprestodb:masterfrom
atavory:discrete_entropy

Conversation

@atavory
Copy link
Contributor

@atavory atavory commented Aug 10, 2019

Add sample discrete-entropy UDF.

Together with #13574, this forms the basis for mutual information classification udfs #13163.

@atavory atavory force-pushed the discrete_entropy branch 3 times, most recently from 2a95f7e to 704d387 Compare August 11, 2019 06:11
@atavory atavory force-pushed the discrete_entropy branch 2 times, most recently from a5d890e to 9e3cf4f Compare August 23, 2019 09:54
@atavory atavory force-pushed the discrete_entropy branch 2 times, most recently from 6129678 to 4f4894a Compare September 21, 2019 11:57
@atavory atavory force-pushed the discrete_entropy branch 8 times, most recently from 0091364 to 194d03f Compare October 21, 2019 10:51
@atavory atavory changed the title [WIP] Discrete entropy Discrete entropy Oct 21, 2019
@atavory atavory force-pushed the discrete_entropy branch 3 times, most recently from 0068b91 to 900147c Compare October 23, 2019 14:28
@atavory
Copy link
Contributor Author

atavory commented Nov 9, 2019

@rschlussel Could you please review this too? Have rebased it on #13574. Man thanks!

@ajaygeorge
Copy link
Contributor

@atavory looks like there are some merge conflicts. Can you rebase with master

Copy link
Contributor

@shixuan-fan shixuan-fan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Skimmed till DiscreteEntropyAggregation


The thresholds are defined as a sequence whose :math:`j`-th entry is the :math:`j`-th smallest threshold.


Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Unrelated change?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It is actually related to this diff - double space between the start of the new section in the rst file.

.. math ::

H(x) = - \int x \log_2\left(f(x)\right) dx,
h(x) = - \int x \log_2\left(f(x)\right) dx,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not entirely sure if this is intended.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

With discrete entropy (in this diff), it seems better to be precise and use lowercase "h" for differential entropy.

# See problematic frame for where to report the bug.
#

--------------- T H R E A D ---------------
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This file seems to be unintended changes?

@Override
public long getEstimatedSize()
{
if (strategy == null) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have not read the whole PR but why do we remove this null check? strategy is set in setStrategy so this check seems legit.

@@ -25,35 +31,200 @@
public interface DifferentialEntropyStateStrategy
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Skimmed through this class assuming they are just code moves.

histogram.add(sample, weight);
}

// Tmp Ami @Override
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: we could clean it up :D

@shixuan-fan
Copy link
Contributor

Why is this closed 😂Is it by accident?

@rschlussel
Copy link
Contributor

spoke to @atavory this morning. He accidentally lost his commit while rebasing. He's gonna restore it and put it back up.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants