Skip to content

Elasticsearch connector improvements#14811

Merged
zhenxiao merged 8 commits intoprestodb:masterfrom
zhenxiao:elk-numeric
Jul 10, 2020
Merged

Elasticsearch connector improvements#14811
zhenxiao merged 8 commits intoprestodb:masterfrom
zhenxiao:elk-numeric

Conversation

@zhenxiao
Copy link
Collaborator

@zhenxiao zhenxiao commented Jul 8, 2020

== RELEASE NOTES ==

Elasticsearch Changes
* Support Elasticsearch numeric keyword
* Support composite publish_address in Elasticsearch
* Add configurations to improve concurrency in Elasticsearch

zhenxiao and others added 8 commits July 7, 2020 16:31
Cherry-pick of trinodb/trino@101d554

Co-authored-by:  Chun Han Hsiao <chhsiao90@gmail.com>
Starting with version 7.x, the publish_address field can contain
addresses of the following form:

    cname/ip:port
    ip:port

If the CNAME is present, the connector will now use the CNAME and port.
Otherwise, it will use the IP and port.

This change also validates that the address matches one of the above formats.

Cherry-pick of trinodb/trino@68c9392

Co-authored-by: Martin Traverso <mtraverso@gmail.com>
In that case, the connector only needs to produce pages with a row count. We leverage Elasticsearch's count API.

Cherry-pick of trinodb/trino@176d229

Co-authored-by: Martin Traverso <mtraverso@gmail.com>
Elastic Search connector exploited column properties map in
ColumnMetadata to store extra information needed for execution. Namely:
 - original colum name (not lower cased)
 - if column can be used for pushdown

Those were not registered in ColumnPropertyManager. That resulted in
failures when SHOW CREATE TABLE was executed for Elastic Search table.

We fix the issue by not storing extra properties in ColumnMetadata at
all. Instead we build Elatics Search ColumnHandle directly from Index
Metadata.

Cherry-pick of trinodb/trino@d5982fa

Co-authored-by: Łukasz Osipiuk <lukasz@osipiuk.net>
Cherry-pick of trinodb/trino@69804f4

Co-authored-by: Łukasz Osipiuk <lukasz@osipiuk.net>
Cherry-pick of trinodb/trino@d312077

Co-authored-by: Alex Albu <alex.albu@starburstdata.com>
@zhenxiao zhenxiao requested a review from highker July 8, 2020 20:07
Copy link

@highker highker left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@zhenxiao zhenxiao merged commit 13ea2ea into prestodb:master Jul 10, 2020
@caithagoras caithagoras mentioned this pull request Jul 28, 2020
13 tasks
@zhenxiao zhenxiao deleted the elk-numeric branch August 2, 2020 05:23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants