-
Notifications
You must be signed in to change notification settings - Fork 340
File-based Similarity in the Comparison View #1516
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
We should discuss how the EOF Token should be handled. Currently it is excluded in the percent calculation but not in the tooltip, which may be confusing for the user. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks great, except for the one thing I commented on.
core/src/main/java/de/jplag/reporting/reportobject/ReportObjectFactory.java
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Regarding the UI, I would argue for not displaying decimal numbers to minimize visual clutter. Also, I would argue for a dark gray instead of black text color.
After merging develop with the immediat zip wirting, the time was increased from ~13 seconds to ~16 seconds for 64 copies of JPlag (comparison time about 3m 40s). This also alligned with a time mesuarement for the the counting loop which accounted for 3 seconds. We should discuss whether this is fine or whether it should be moved. We should factor in how difficult it would be to optimize that. since parsing of submissions is not done in parrallel there would be no perfomance improvement when moving this into the submission class and iterating over the list of tokens returned form the |
…le-file-similarity
I would say performance > readability here.
Doing it in each language is not something I am in favor of. |
…le-file-similarity
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just one small thing.
|
|
Displays the percentage of tokens of a file that are part of a match.
Also displays the total number of tokens in each submission.
The submission file index now holds data about the token count of each file
relates to #1109