Normalizer seems overzealous when stripping whitespace between elements #149

paulchaplin · 2014-06-18T13:02:28Z

My browser (Chrome 35) doesn't insert   between e.g. foo bar when I've selected and formatted each word, so the normaliser eats the space when running setHTML(), giving the run-together "foobar" rather than the expected "foo bar".

I locally changed what equates to line 117 in src/normalizer.coffee from:

    html = html.replace(/\>\s+\</g, '><')

to:

    html = html.replace(/\>\s+\</g, '> <')

to normalise to a single space and solve my problem, although depending on what else that might be required for, it could make the line redundant due to normal HTML whitespace-collapsing.

Is there another (better) way to solve this, and prevent genuine spaces from being removed, without manually inserting  ?

The text was updated successfully, but these errors were encountered:

Kilian · 2014-06-19T10:41:27Z

+1

jhchen · 2014-06-26T06:11:54Z

Just to clarify this is limited to the setHTML call only correct? If you have two words "one two" and bold one and italicize two through the UI the space remains (at least this is the behavior I'm seeing on my Chrome).

paulchaplin · 2014-06-26T08:44:50Z

That's correct -- there's no InsertOp entry for the space (since it's pre-stripped), so the HTML is set without it, and getHTML naturally then returns the editor's contents without it.

bjnsn · 2014-10-16T15:50:18Z

+1

bjnsn · 2014-10-16T16:10:11Z

If I understand the purpose of the whitespace stripping, a more complex RegExp that avoids stripping space between inline elements might work.

Instead of this (in Normalize.stripWhitespace):
html = html.replace(/>\s+</g, '><');

Use this:
html = html.replace(/>\s+<(?!\s*(?:b|big|i|small|tt|abbr|acronym|cite|code|dfn|em|kbd|strong|samp|var|a|bd|br|img|map|object|q|script|span|sub|sub|sup|button|input|label|select|textarea))/g, '><');

In that example, there is a negative lookahead which skips any cases that match these tags noted by MDN: https://developer.mozilla.org/en-US/docs/Web/HTML/Inline_elemente#Elements

thomsbg · 2015-11-24T01:38:13Z

I think this issue needs to be re-opened. Unfortunately, simply patching getHTML() to insert   as appropriate is not sufficient to support copy / paste. See the following repro steps:

visit http://quilljs.com/
select the space between the words Quill and Rich in the 1st line
use the toolbar or the keyboard to apply the bold format
select all text, cut and then paste over the selected contents
notice that the space between Quill and Rich has disappeared

While these repro steps may seem a bit convoluted (who would apply formatting to just whitespace?), in practice this happens often while editing rich text, especially when the authorship module is enabled (which frequently wraps runs of pure whitespace in   tags).

One could probably patch the paste-manager module to handle this case correctly, but I must admit I am confused why stripWhitespace is needed at all... it seems better to have Document#setHTML not require   between inline tags to work correctly.

Perhaps Normalizer.stripWhitespace could replace runs of whitespace with a single space, rather than the empty string? Or simply not used at all.

I need to fix this issue on my fork before 1.0 is released. Which path forward would you anticipate would be most compatible with the 1.0 branch?

jhchen · 2015-11-24T05:17:08Z

Normally HTML dictates consecutive whitespace be collapsed into one. To explicitly display more than one whitespace character you can use HTML entities such as   for spaces. But all browsers insert a space, not an HTML entity, when you hit the spacebar on a contenteditable element. This could be replaced immediately by Quill but it's always tricky to replace content around where the cursor is so ideally the editor could just work with the native browser behavior. The CSS white-space property allows this and also has the benefit of showing tab characters properly and this is currently the path Quill choses to take.

The only issue introduced by this solution is when HTML is written for human readability (including tabs and newlines for nested HTML tags) it's surprising to discover those characters would actually show up. Also copying and pasting the example docs code to try out Quill (which does include nicely indented HTML) is a use case I want to support without the aforementioned surprise. So the quick solution was to get rid of whitespace between tags which is proving to not be such a robust solution.

I'm willing to believe a more complex regex could be the solution (which I'll explore for 1.0) but if it's the case that you're saving/loading Quill's content through the API and not with nicely indented HTML in the markup then the best and easiest solution is probably to get rid of stripWhitespace in the editor.

fracz · 2016-05-03T19:36:58Z

You can override this behaviour in your application. No need to modify the sources.

normalizer = Quill.require('normalizer')
stripWhitespace = normalizer.stripWhitespace
normalizer.stripWhitespace = (html) ->
  stripped = stripWhitespace(html)
  stripped.replace(/></g, '> <')

bjnsn pushed a commit to bjnsn/quill that referenced this issue Oct 17, 2014

Fix for whitespace within inline tags being removed. See: slab#149

37379a4

bjnsn pushed a commit to bjnsn/quill that referenced this issue Oct 17, 2014

Fix for whitespace within inline tags being removed. See: slab#149

0daf22f

jhchen mentioned this issue Oct 20, 2014

Fix for whitespace within inline tags being removed. See: https://github... #230

Closed

bjnsn mentioned this issue Oct 21, 2014

Fix over zealous whitespace removal #233

Closed

jhchen closed this as completed in 23e3a65 Nov 6, 2014

thomsbg mentioned this issue Dec 4, 2014

Add a remove button to the link tooltip #256

Merged

ripper17 mentioned this issue Feb 19, 2016

Whitespaces between tags (Firefox, InternetExplorer) #585

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Normalizer seems overzealous when stripping whitespace between elements #149

Normalizer seems overzealous when stripping whitespace between elements #149

paulchaplin commented Jun 18, 2014

Kilian commented Jun 19, 2014

jhchen commented Jun 26, 2014

paulchaplin commented Jun 26, 2014

bjnsn commented Oct 16, 2014

bjnsn commented Oct 16, 2014

thomsbg commented Nov 24, 2015

jhchen commented Nov 24, 2015

fracz commented May 3, 2016

Normalizer seems overzealous when stripping whitespace between elements #149

Normalizer seems overzealous when stripping whitespace between elements #149

Comments

paulchaplin commented Jun 18, 2014

Kilian commented Jun 19, 2014

jhchen commented Jun 26, 2014

paulchaplin commented Jun 26, 2014

bjnsn commented Oct 16, 2014

bjnsn commented Oct 16, 2014

thomsbg commented Nov 24, 2015

jhchen commented Nov 24, 2015

fracz commented May 3, 2016