Skip to content

Conversation

@jordy25519
Copy link

What changes were proposed in this pull request?

Implemented python call to topicDistributions for pyspark.clustering.mllib.LDAModel

How was this patch tested?

Ran ./dev/run-tests, all passing
Manually verified.
Used function parameter types, return types etc. from existing API calls so all behaviour is consistent with existing behaviour.

(If this patch involves UI changes, please attach a screenshot; otherwise, remove this)

@AmplabJenkins
Copy link

Can one of the admins verify this patch?

Copy link
Contributor

@holdenk holdenk left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@supremekai Is this something you are still interested in working on?

JavaPairRDD.fromRDD(topicDistributions.asInstanceOf[RDD[(java.lang.Long, Vector)]])
}

override def topicDistributions(documents: RDD[(Long, Vector)]): RDD[(Long, Vector)] = {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this what we want here? It seems having it defined on the parent if half of the children aren't implementing it might be confusing to some users.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@holdenk I'm keen to work on this. definitely agree, but am not sure how else to approach this without implementing the logic for LDA distributed models.

@jkbradley
Copy link
Member

@supremekai Thanks for the PR! I'm sorry about the inactivity on this. However, now that it has been added to the DataFrame-based API (in pyspark.ml), we will not be adding it to the RDD-based API. Could you please close this issue?

@srowen srowen mentioned this pull request Feb 2, 2017
@asfgit asfgit closed this in 20b4ca1 Feb 3, 2017
zifeif2 pushed a commit to zifeif2/spark that referenced this pull request Nov 22, 2025
Closes apache#15736
Closes apache#16309
Closes apache#16485
Closes apache#16502
Closes apache#16196
Closes apache#16498
Closes apache#12380
Closes apache#16764

Closes apache#14394
Closes apache#14204
Closes apache#14027
Closes apache#13690
Closes apache#16279

Author: Sean Owen <[email protected]>

Closes apache#16778 from srowen/CloseStalePRs.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants