bug: Kung Fu false positive #67

krasnoperov · 2024-07-13T19:04:22Z

Expected behavior

matcher.hasMatch('Kung-Fu') returns true

Actual behavior

matcher.hasMatch('Kung-Fu') returns false

Minimal reproducible example

import assert from 'node:assert'

import {
  englishDataset,
  englishRecommendedTransformers,
  RegExpMatcher,
} from 'obscenity'

const matcher = new RegExpMatcher({
  ...englishDataset.build(),
  ...englishRecommendedTransformers,
})

assert.equal(matcher.hasMatch('Kung-Fu'), false)
assert.equal(matcher.hasMatch('Kung Fu'), false)
assert.equal(matcher.hasMatch('Kung Fu Panda'), false)

// This one actually works
assert.equal(matcher.hasMatch('KungFu'), false)

Steps to reproduce

Run the code above
It falls with assert exception

Additional context

No response

Node.js version

v20.15.0

Obscenity version

0.2.1

Priority

Low
Medium
High

Terms

I agree to follow the project's Code of Conduct.
I have searched existing issues for similar reports.

jo3-l · 2024-07-16T06:41:49Z

The default dataset contains the pattern |fu|, which (correctly, but undesirably) matches on the -Fu in Kung-Fu. There are two potential ways we could fix this issue:

Remove the |fu| pattern entirely, or
Whitelist Kung-Fu and leave the |fu| pattern untouched.

I am leaning toward 2) at the moment: the |fu| pattern seems useful in general, and I cannot think of any other egregious false positives other than the instance you report. What do you think?

krasnoperov · 2024-07-16T13:45:31Z

I think that whitelisting Kung-Fu is a good option here. Also, it is possible to handle any future false positives by adding them to the whitelist as they arise.

jo3-l · 2024-07-17T03:52:36Z

I released v0.3.1 with the fix (please ignore v0.2.2 and v0.3.0, both of which were problematic due to my botching some release automation—sorry for the noise!). Thanks again for the report.

krasnoperov added the bug Something isn't working label Jul 13, 2024

jo3-l closed this as completed in d60b4f4 Jul 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug: Kung Fu false positive #67

bug: Kung Fu false positive #67

krasnoperov commented Jul 13, 2024

jo3-l commented Jul 16, 2024

krasnoperov commented Jul 16, 2024

jo3-l commented Jul 17, 2024 •

edited

Loading

bug: Kung Fu false positive #67

bug: Kung Fu false positive #67

Comments

krasnoperov commented Jul 13, 2024

Expected behavior

Actual behavior

Minimal reproducible example

Steps to reproduce

Additional context

Node.js version

Obscenity version

Priority

Terms

jo3-l commented Jul 16, 2024

krasnoperov commented Jul 16, 2024

jo3-l commented Jul 17, 2024 • edited Loading

jo3-l commented Jul 17, 2024 •

edited

Loading