Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Release #3

Open
tvo opened this issue Oct 30, 2012 · 2 comments
Open

Release #3

tvo opened this issue Oct 30, 2012 · 2 comments

Comments

@tvo
Copy link

tvo commented Oct 30, 2012

Hey,

Any chance you could push out a new release that includes the performance optimizations?

TIA

-Tobi

@jpmckinney
Copy link

See https://github.com/opennorth/tf-idf-similarity for a gem with a number of improvements, both in performance and in tf*idf implementation. reddavis' gem normalizes the frequency of a term in a document to the number of terms in that document (which, as far as I can tell, never occurs in the academic literature) and has no normalization component. My gem uses the same formula as Lucene and other major implementations.

@arn-e
Copy link

arn-e commented Feb 5, 2013

Came here to make the same request. Thanks for the link to your project, jpmckinney.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants