Faster execution of indexed STARTING WITH with UNICODE collation #6872

asfernandes · 2021-06-25T14:57:33Z

Initial execution time of indexed STARTING WITH lookup with UNICODE collation is very slow.

Test cases with timings using the debug build:

-- UTF8 without collation

recreate table t1 (c1 varchar(10) character set utf8)!
create index t1_idx on t1 (c1)!

execute block
as
    declare n integer = 0;
    declare v type of column t1.c1;
begin
    while (n < 100000)
    do
    begin
        select 1 from t1 where c1 starting with 'x' into v;
        n = n + 1;
    end
end!

-- Elapsed time = 0.422 sec

-- WIN1252 collation WIN_PTBR

recreate table t1 (c1 varchar(10) character set win1252 collate win_ptbr)!
create index t1_idx on t1 (c1)!

execute block
as
    declare n integer = 0;
    declare v type of column t1.c1;
begin
    while (n < 100000)
    do
    begin
        select 1 from t1 where c1 starting with 'x' into v;
        n = n + 1;
    end
end!

-- Elapsed time = 0.440 sec

-- UTF8 collation UNICODE

recreate table t1 (c1 varchar(10) character set utf8 collate unicode)!
create index t1_idx on t1 (c1)!

execute block
as
    declare n integer = 0;
    declare v type of column t1.c1;
begin
    while (n < 100000)
    do
    begin
        select 1 from t1 where c1 starting with 'x' into v;
        n = n + 1;
    end
end!

-- Elapsed time = 6.498 sec

The text was updated successfully, but these errors were encountered:

… UNICODE collation.

hvlad · 2021-07-29T22:31:06Z

Adriano, could you look at https://groups.google.com/g/firebird-support/c/VCXnWp0IZVw ?
It looks like another incarnation of this issue.

asfernandes · 2021-07-30T13:34:54Z

Adriano, could you look at https://groups.google.com/g/firebird-support/c/VCXnWp0IZVw ?
It looks like another incarnation of this issue.

This does not happen only with UTF8/UNICODE.

The problem with some characters is that they are part (start of) contractions that generate sort keys to order them in different place.

So the last character of a key that is the start of a contraction must be excluded from the key, otherwise the lookup will not work.

This issue is about how to verify that contractions faster than before.

javihonza · 2021-08-02T10:47:44Z

Hi,
unfortunately this fix did not solve the speed problem
https://groups.google.com/g/firebird-support/c/VCXnWp0IZVw

asfernandes · 2021-08-02T17:59:27Z

Hi,
unfortunately this fix did not solve the speed problem
https://groups.google.com/g/firebird-support/c/VCXnWp0IZVw

I'm verifying a way to improve this.

asfernandes · 2021-08-02T20:52:39Z

Hi,
unfortunately this fix did not solve the speed problem
https://groups.google.com/g/firebird-support/c/VCXnWp0IZVw

Created #6915 to track this problem.

… UNICODE collation.

asfernandes added affect-version: 3.0.7 affect-version: 4.0.0 affect-version: 5.0 Initial labels Jun 25, 2021

asfernandes self-assigned this Jun 25, 2021

asfernandes changed the title ~~Indexed STARTING WITH execution very slow with UNICODE collation~~ Indexed STARTING WITH execution is very slow with UNICODE collation Jun 25, 2021

asfernandes added the type: improvement label Jun 25, 2021

asfernandes added a commit that referenced this issue Jun 25, 2021

Improvement #6872 - Indexed STARTING WITH execution is very slow with…

d680aed

… UNICODE collation.

asfernandes added the fix-version: 5.0 Beta 1 label Jun 25, 2021

asfernandes closed this as completed Jun 25, 2021

pavel-zotov added the qa: done successfully label Jun 26, 2021

asfernandes added a commit that referenced this issue Jun 29, 2021

Improvement #6872 - Indexed STARTING WITH execution is very slow with…

b9d5ac0

… UNICODE collation.

asfernandes added the fix-version: 4.0.1 label Jun 29, 2021

asfernandes added a commit that referenced this issue Feb 16, 2022

Improvement #6872 - Indexed STARTING WITH execution is very slow with…

ca35fcd

… UNICODE collation.

asfernandes added a commit that referenced this issue Mar 16, 2022

Improvement #6872 - Indexed STARTING WITH execution is very slow with…

de49ddb

… UNICODE collation.

asfernandes added the fix-version: 3.0.10 label Mar 16, 2022

dyemanov changed the title ~~Indexed STARTING WITH execution is very slow with UNICODE collation~~ Faster execution of indexed STARTING WITH with UNICODE collation May 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Faster execution of indexed STARTING WITH with UNICODE collation #6872

Faster execution of indexed STARTING WITH with UNICODE collation #6872

asfernandes commented Jun 25, 2021 •

edited

Loading

hvlad commented Jul 29, 2021

asfernandes commented Jul 30, 2021

javihonza commented Aug 2, 2021

asfernandes commented Aug 2, 2021

asfernandes commented Aug 2, 2021

Faster execution of indexed STARTING WITH with UNICODE collation #6872

Faster execution of indexed STARTING WITH with UNICODE collation #6872

Comments

asfernandes commented Jun 25, 2021 • edited Loading

hvlad commented Jul 29, 2021

asfernandes commented Jul 30, 2021

javihonza commented Aug 2, 2021

asfernandes commented Aug 2, 2021

asfernandes commented Aug 2, 2021

asfernandes commented Jun 25, 2021 •

edited

Loading