Emoji is not supported when a profanity is found next to that character #71

rion18 · 2024-08-02T02:29:12Z

Expected behavior

Using obscenity to censor a string containing an emoji, like this one: 🤣bummer, and a dataset that contains the word bummer.

Using this strategy,

const CENSOR_STRATEGY = (censorContext) => ''.repeat(censorContext.matchLength);

for removing the profanities,

The expected output would be 🤣.

Actual behavior

Instead, the output is this: 🤣b. It matches the word bummer correctly, BUT when the matcher tries to find the matches, there's an error in the index.

Minimal reproducible example

const {
  englishDataset,
  parseRawPattern,
  DataSet,
  RegExpMatcher,
} = require('obscenity');

const data = new DataSet()
    .addAll(englishDataset)
    .addPhrase(phrase => 
      phrase
        .setMetadata({ originalWord: 'bummer' })
        .addPattern(parseRawPattern('bummer'))
    ).build();

const matcher = new RegExpMatcher({
    ...profanityDataset, // no transformers
  });

const stringBummer = '🤣bummer';
if (matcher.hasMatch(stringBummer)) {
  const matches = matcher.getAllMatches(stringBummer, true);
  return textCensor.applyTo(stringBummer, matches);
}
return stringBummer;

Steps to reproduce

No response

Additional context

No response

Node.js version

18.17.1

Obscenity version

0.3.1

Priority

Low
Medium
High

Terms

I agree to follow the project's Code of Conduct.
I have searched existing issues for similar reports.

The text was updated successfully, but these errors were encountered:

jo3-l · 2024-08-02T19:48:20Z

Thanks for the short repro. I think I know what the issue is and will take a stab at fixing it today.

jo3-l · 2024-08-02T22:52:00Z

Fix released in v0.4.0.

rion18 added the bug Something isn't working label Aug 2, 2024

jo3-l closed this as completed in 3a49579 Aug 2, 2024

jo3-l mentioned this issue Aug 6, 2024

Request: Symbols that can represent multiple letters #73

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Emoji is not supported when a profanity is found next to that character #71

Emoji is not supported when a profanity is found next to that character #71

rion18 commented Aug 2, 2024 •

edited

Loading

jo3-l commented Aug 2, 2024

jo3-l commented Aug 2, 2024

Emoji is not supported when a profanity is found next to that character #71

Emoji is not supported when a profanity is found next to that character #71

Comments

rion18 commented Aug 2, 2024 • edited Loading

Expected behavior

Actual behavior

Minimal reproducible example

Steps to reproduce

Additional context

Node.js version

Obscenity version

Priority

Terms

jo3-l commented Aug 2, 2024

jo3-l commented Aug 2, 2024

rion18 commented Aug 2, 2024 •

edited

Loading