Skip to content

Add release notes for 0.239#14908

Merged
caithagoras merged 1 commit intoprestodb:masterfrom
mayankgarg1990:release-notes-0.239
Aug 7, 2020
Merged

Add release notes for 0.239#14908
caithagoras merged 1 commit intoprestodb:masterfrom
mayankgarg1990:release-notes-0.239

Conversation

@caithagoras
Copy link
Contributor

@caithagoras caithagoras commented Jul 28, 2020

Missing Release Notes

Leiqing Cai

Nikhil Collooru

Ravion

  • 5d16273 Added documentation for Presto on Spark

Rebecca Schlussel

Tim Meehan

Weidong Duan

Wenlei Xie

Zhenxiao Luo

Zhi Wen

tgorthi

Extracted Release Notes

  • Aggregation ORDER BY & DISTINCT spilling #14527 (Author: Saksham Sachdev): Aggregation ORDER BY & DISTINCT spilling
    • Add local disk spilling support for aggregation functions with ORDER BY or DISTINCT syntax.
  • Add allowed roles for HTTP endpoints #14632 (Author: Zhi Wen): Add allowed roles for HTTP endpoints
    • Specify allowed roles for HTTP endpoints.
  • Support common sub-expression optimization in CursorProcessorCompiler #14696 (Author: frank.hu): Support common sub-expression optimization in CursorProcessorCompiler
    • Add optimization for cursor projection & filter by extract and compute common subexpressions among all projections & filter first. This optimization can be turned off by session property optimize_common_sub_expressions.
  • Adding Pinot SQL endpoint support #14704 (Author: Xiang Fu): Adding Pinot SQL endpoint support
    • Add Pinot SQL endpoint support.
  • Fix confusion matrix computation for classification function #14740 (Author: fornaix): Fix confusion matrix computation for classification function
    • Fix :func:classification_miss_rate and :func:classification_fall_out functions (:pr:14740).
  • Add invoker security mode for views #14743 (Author: Sanket Dige): Add invoker security mode for views
    • Add support for 2 security modes for views. The default DEFINER security mode is the same as the previous behavior. Tables referenced in the view are accessed using the permissions of the view owner (the creator or.
    • Definer** of the view) rather than the user executing the query. In the INVOKER security mode, tables referenced in the view are accessed using the permissions of the query user (the invoker of the view).
  • Revert "Reliable Resource Groups with versioning" #14752 (Author: Swapnil Tailor): Revert "Reliable Resource Groups with versioning"
    • Reverting Reliable Resource Group Versioning.
  • Implement PrestoDatabaseMetaData getClientInfoProperties #14753 (Author: Adam J. Shook): Implement PrestoDatabaseMetaData getClientInfoProperties
    • Implemented DatabaseMetaData.getClientInfoProperties API.
  • Fix thread listing under heavy workloads (thread churn) #14754 (Author: Partha Kanuparthy): Fix thread listing under heavy workloads (thread churn)
    • Fix NPE in /v1/thread end point.
  • Fix thread snapshot UI due to variable initialization #14775 (Author: Partha Kanuparthy): Fix thread snapshot UI due to variable initialization
    • The worker page's thread snapshot UI does not work (no stack trace displayed on click) when there is active query load (tested under Chrome). This patch fixes an uninitialized variable in client JS that was causing this UI behavior.
  • Fix query completion events for non-dispatched queries #14784 (Author: Tim Meehan): Fix query completion events for non-dispatched queries
    • Fix missing query completion events for queries which fail prior to dispatching.
  • Partition filtering warning #14800 (Author: Ariel Weisberg): Partition filtering warning
    • Add support for warning on unfiltered partition keys using partition-keys-to-warn-on-no-filtering system property.
  • Elasticsearch connector improvements #14811 (Author: Zhenxiao Luo): Elasticsearch connector improvements
    • Support Elasticsearch numeric keyword.
    • Support composite publish_address in Elasticsearch.
    • Add configurations to improve concurrency in Elasticsearch.
  • Add support of caching for glue metastore #14820 (Author: Adli Mousa): Add support of caching for glue metastore
    • Add support for caching the Glue metastore.
  • Move DistinctLimitNode to spi #14831 (Author: Xiang Fu): Move DistinctLimitNode to spi
    • Move DistinctLimitNode to presto-spi module for connectors to push down.
  • Implement ORDER BY spilling #14836 (Author: Saksham Sachdev): Implement ORDER BY spilling
    • Add local disk spilling support for ORDER BY syntax.
  • Add aggregation function SET_UNION #14842 (Author: prithvip): Add aggregation function SET_UNION
    • Add aggregation function SET_UNION.
  • Support evaluating min/max only metadata query #14845 (Author: Shixuan Fan): Support evaluating min/max only metadata query
    • Add support to optimize min/max only metadata query. This is controlled by existing config optimizer.optimize-metadata-queries and session property optimize_metadata_queries. Note that enabling this config/session property might change query result if there are metadata that refers to empty data, e.g. empty hive partition.
  • Honor ignore_stats_calculator_failures in getTableStatistics #14849 (Author: Rebecca Schlussel): Honor ignore_stats_calculator_failures in getTableStatistics
    • Fix an issue where the property ignore_stats_calculator_failures would not be honored for certain queries that had filters pushed down to the table scan.
  • Upgrade JTS to 1.17.0 #14857 (Author: James Gill): Upgrade JTS to 1.17.0
    • Update JTS to 1.17.0. This changes the implementation of ST_Buffer: the output might change by a small (1e-10) amount.
  • Pushdown DistinctLimitNode in Pinot Connector #14863 (Author: Xiang Fu): Pushdown DistinctLimitNode in Pinot Connector
    • Pushdown DistinctLimitNode to Pinot Query in SQL mode.
  • Fix infinite loop in non-legacy SqlQueryScheduler #14879 (Author: Rebecca Schlussel): Fix infinite loop in non-legacy SqlQueryScheduler
    • Fix potential infinite loop when the setting use_legacy_scheduler is set to false.
  • Enable async page transport by default #14880 (Author: Vic Zhang): Enable async page transport by default
    • Enable async page transport with non-blocking IO by default. This can be disabled by setting exchange.async-page-transport-enabled configuration property to false.
  • Unify grouped execution properties #14886 (Author: Rebecca Schlussel): Unify grouped execution properties
    • Enable dynamic-schedule-for-grouped-execution by default. In future releases, we will remove this property, and grouped execution will always use dynamic scheduling.
    • Enable grouped-execution-for-aggregation and experimental.grouped-execution-for-eligible-table-scans by default.
    • Introduce new configuration property grouped-execution-enabled and session property grouped_execution to turn grouped execution on or off. This property is true by default. If set to false, it is equivalent to setting all of grouped-execution-for-aggregation, grouped-execution-for-join, and experimental.grouped-execution-for-eligible-table-scans to false. In future releases we will remove these other properties and only have a single switch for enabling and disabling grouped execution.
  • Revert "Support grouped execution for eligible table scans" #14900 (Author: Rebecca Schlussel): Revert "Support grouped execution for eligible table scans"
    • Remove experimental feature to perform grouped execution for eligible table scans and its associated configuration property experimental.grouped-execution-for-elligible-table-scans and session property grouped_execution_for_eligible_table_scans.

All Commits

  • 6d5cdaa Increase default size for http timeout executors (Vic Zhang)
  • c963c79 Enable async page transport by default (Vic Zhang)
  • ef4b537 Support evaluating min/max only metadata query (Shixuan Fan)
  • b1fc202 Remove unused field in MetadataQueryOptimizer (Shixuan Fan)
  • 259f461 Allow EnforceSingleRowNode for predicate extractor (Shixuan Fan)
  • 5944e15 Push expression translation above MetadataQueryOptimizer (Shixuan Fan)
  • 03e80fc Revert "Support grouped execution for eligible table scans" (Rebecca Schlussel)
  • ee0a3af Fetch results eagerly from coordinator (Tim Meehan)
  • 7cb1d7a Unify grouped execution properties (Rebecca Schlussel)
  • c8152d8 Enable grouped execution by default (Rebecca Schlussel)
  • f38828f Don't try grouped execution for forceSingleNode (Rebecca Schlussel)
  • 82cd9bc Minor refactor to RemoteSourceFactory (Wenlei Xie)
  • 57c24f1 Fix flaky testRuntimeOptimizedStagesCorrectness (Peizhen Guo)
  • 9ae3a21 Clean up KdbTree node intersection logic (James Gill)
  • 7e3d217 Upgrade JTS to 1.17.0 (James Gill)
  • 23082e8 Fix infinite loop in scheduler for finished query (Rebecca Schlussel)
  • 6f01b1e Remove extra new line (Rebecca Schlussel)
  • 81f6349 Handle NaN in Parquet statistics (Zhenxiao Luo)
  • 4c2b8c2 Add aggregation function SET_UNION (prithvip)
  • 392ec92 Close spiller on Operator#close for ORDER BY spilling (Saksham Sachdev)
  • e418fe7 Extract order by queries tests to separate class (Saksham Sachdev)
  • 4722c75 Convert revocable memory to user memory on OrderBy finish (Saksham Sachdev)
  • 1a56a5e Use OrderingCompiler in OrderBy spilling (Saksham Sachdev)
  • 9d11bdb Add Spill To Disk for ORDER BY (Saksham Sachdev)
  • 37f6c26 Use WorkProcessor in OrderByOperator (Saksham Sachdev)
  • 3b7f846 Extract DummySpillerFactory from TestHashAggregationOperator (Saksham Sachdev)
  • 4d8ead5 Produce more than single page in testHashAggregation (Saksham Sachdev)
  • 93255a2 Allow memory revoke only during operator finish phase (Saksham Sachdev)
  • 9b50094 Revoke memory after initial output page has been produced in tests (Saksham Sachdev)
  • 6a55eb4 Adding DistinctLimit pushdown support in Pinot Connector (Xiang Fu)
  • 8045dbe Collect statistics of files committed by OrcFileWriter (Nikhil Collooru)
  • 89632d3 Improve Alluxio cache documentation (Bin Fan)
  • 9831c83 support caching for glue metastore (Adli Mousa)
  • 6354727 Add getSplitSizeInBytes in SPI ConnectorSplit (Vic Zhang)
  • d35acde Add properties for Presto-on-Spark split distribution (Vic Zhang)
  • 58585b2 Add test checking QueryCompletedEvent (Peizhen Guo)
  • ae5770a Add runtimeOptimizedStages in QueryCompletedEvent (Peizhen Guo)
  • 2cb629e Make maxConcurrency optional in ConcurrentExecutionPhase (Leiqing Cai)
  • 4514b99 Refactor PhaseExecutor (Leiqing Cai)
  • a4e7a78 Allow session properties to be specified with queries (Leiqing Cai)
  • f8cfad6 Refactor BenchmarkSuite (Leiqing Cai)
  • 515ccf9 Fix PhaseSpecification (Leiqing Cai)
  • c61baf4 Make benchmark runner extensible (Leiqing Cai)
  • af8058a Make BenchmarkSuiteSupplier extensible (Leiqing Cai)
  • b743588 Specify allowed roles for HTTP endpoints (Zhi Wen)
  • f28613e Move authentication support from Presto to Airlift (Zhi Wen)
  • 3469944 Refactor userAgent handle in Presto-on-Spark (Wenlei Xie)
  • 9d2efe6 Allow export Spark queue in Presto-on-Spark query event (Wenlei Xie)
  • 38ee80c Allow customizing clientInfo and userAgent in Presto-on-Spark (Wenlei Xie)
  • 4f1b626 Upgrade airlift to 0.193 and discovery-server to 1.32 (Zhi Wen)
  • bad48ac Pass user info into PrestoSparkRunner (Wenlei Xie)
  • 7d9d8dc Change TableStatistics constructor to private (Peizhen Guo)
  • 1ef8df5 Remove DistributionStat for PeakRunningTasks (Vic Zhang)
  • 6e98e41 Bugfixing: escape pinot query string before set it into json request (Xiang Fu)
  • 9499923 Fix subfield pruning to be enabled by featureConfig (Mayank Garg)
  • bfc5ea8 Avoid creating tiny splits at the end of block boundaries (James Petty)
  • 91b058e Add invoker security mode for views (Sanket Dige)
  • d7949f7 Remove unused PushdownFilterResult (James Sun)
  • 2d2693f Fix failure during optimization of LIKE expression (Rebecca Schlussel)
  • 9e012a2 Don't ignore stats failures in tests (Rebecca Schlussel)
  • aa049cc Honor ignore_stats_calculator_failures in getTableStatistics (Rebecca Schlussel)
  • 589325b Support Pinot new SQL endpoint for broker queries. (Xiang Fu)
  • 573b8d1 Fix configuration for Alluxio metastore module (David Zhu)
  • f5868d4 Explicitly not supporting external functions in lambda and join filter (Rongrong Zhong)
  • b8f4c99 Add rule to rewrite filter with remote function to project (Rongrong Zhong)
  • 4aa2ded Plan projection with remote functions (Rongrong Zhong)
  • 2608642 Add Locality to ProjectNode (Rongrong Zhong)
  • 0890b3d Invoke CBO at SqlQueryScheduler for Join Swapping (Peizhen Guo)
  • d86aeee Add a session property for runtime optimizer (Peizhen Guo)
  • 3cf8069 Support return type coercion in CREATE FUNCTION (tgorthi)
  • 19228af Warn on configured unfiltered partitions (Ariel Weisberg)
  • 816e977 Support for checking partition key filter coverage (Ariel Weisberg)
  • 7cc8bd3 Rename PlanSanityChecker to PlanChecker (Ariel Weisberg)
  • 4f2f854 Introduce CacheQuotaRequirementProvider (Ke Wang)
  • a63cb8a Allow configurable parser option in Presto-on-Spark (Wenlei Xie)
  • b14f68a Do not prune duplicate ConstantExpression (Rongrong Zhong)
  • 2869400 Fix bad verify check in DwrfEncryptionInfo (Rebecca Schlussel)
  • 670ecb5 Allow query info to be stored in a file upon query finish (Andrii Rosa)
  • 6e98dd1 Add ability to register event listener in Presto on Spark (Andrii Rosa)
  • 3f69d1f Implement QueryMonitor callbacks for Presto on Spark (Andrii Rosa)
  • 2f0575c Collect TaskInfo for Presto on Spark tasks (Andrii Rosa)
  • 323592f Prepare to collect TaskInfo from Presto on Spark tasks (Andrii Rosa)
  • 41483c3 Apply compression to Presto on Spark splits (Andrii Rosa)
  • 829b667 Fix inconsistent session value from both server and client side (Weidong Duan)
  • 20a1726 Fix query completion events prior to execution (Tim Meehan)
  • 3177a90 Skip isSplittable reflection for RCFileInputFormat (Rohit Jain)
  • 058bda4 Include scheduling stats of bucketed table to NodeSelectionStats (Ke Wang)
  • dece3cd Fix bytecode generation for SQL functions (prithvip)
  • bd300f7 Move DistinctLimitNode to spi (Xiang Fu)
  • 728b8b0 Fix document in Presto-on-Spark (Wenlei Xie)
  • 3943983 fix druid incorrect real data value for DruidBrokerPageSource (Weidong Duan)
  • 5d16273 Added documentation for Presto on Spark (Ravion)
  • 84b7b0a Allow DictionaryBlock for MinMaxBy aggregation (Vic Zhang)
  • 13ea2ea Add documentation for Elasticsearch connector concurrency configs (Zhenxiao Luo)
  • 0e4e646 Improve concurrency for Elasticsearch calls (Zhenxiao Luo)
  • 93f7843 Enable testShowCreateTable for Elasticsearch connector (Zhenxiao Luo)
  • 277da41 Remove fake properties from Elastic Search column metadata (Zhenxiao Luo)
  • 15ff958 Optimize Elasticsearch queries that fetch no columns (Zhenxiao Luo)
  • d58624d Fix ElasticsearchClient package name (Zhenxiao Luo)
  • 2cafb69 Support composite publish_address in Elasticsearch (Zhenxiao Luo)
  • 25b636f Support elasticsearch numeric keyword (Zhenxiao Luo)
  • e182413 Fix reading encrypted streams at offset 0 (Rebecca Schlussel)
  • 6143044 Avoid fetching encryption information when no columns requested (Mayank Garg)
  • 86a3cb0 Implement additional serialization methods for Spark shuffle (Andrii Rosa)
  • 56f9022 Fix file stats for encryption groups (Rebecca Schlussel)
  • 443d105 Fix encryption with dictionary encodings (Rebecca Schlussel)
  • 863a534 Fix dwrf decryption for empty files (Rebecca Schlussel)
  • a54efd9 Minor cleanup in TestDecryption (Rebecca Schlussel)
  • 4a3d46c Use the same RowExpressionCompiler in CursorProcessorCompiler (Rongrong Zhong)
  • b8b3059 Attach user and query ID to JDBC SQL strings (Naveen Cherukuri)
  • e22ebb0 Add documentation for driver.max-page-partitioning-buffer-count (Ying Su)
  • bbda157 Run PruneRedundantProjectionAssignments after RowExpression change (Rongrong Zhong)
  • f2b3910 Make TestElasticsearchIntegrationSmokeTest single threaded (Zhenxiao Luo)
  • 92c3d3c Change getSupportedColumnStatistics for temp table (Peizhen Guo)
  • 7fdb787 Fix flaky testAndInFilter test (Rebecca Schlussel)
  • b9cde3a Add support for column names with spaces to SubfieldTokenizer (Masha Basmanova)
  • 57428cc Add a test for Presto-on-Spark (Wenlei Xie)
  • 2f5d368 adding dist code (Partha Kanuparthy)
  • 22b9d9c Fix thread snapshot UI due to variable initialization (Partha Kanuparthy)
  • 7b71284 Revert "Fix thread snapshot UI due to variable initialization" (Partha Kanuparthy)
  • 9796999 Fix thread snapshot UI due to variable initialization (Partha Kanuparthy)
  • 709d229 fix for thread listing under heavy workloads (Partha Kanuparthy)
  • ee71bf5 Allow configure SQL invoked function implementation type (Rongrong Zhong)
  • 54f7e41 Add support for DWRF encryption without compression (Rebecca Schlussel)
  • a543417 Pass DwrfWriterEncryption into OrcWriter (Rebecca Schlussel)
  • c561c50 Write support for DWRF encryption at rest (Mayank Garg)
  • 9f89256 Add TestingHiveEventListener (James Sun)
  • 64e1f90 Fix query completion events for non-dispatched queries (Tim Meehan)
  • a101186 Bump zookeeper from 3.4.13 to 3.4.14 (Rebecca Schlussel)
  • dc2a10a Set sequenceId for dwrf columnEncodings (Rebecca Schlussel)
  • 1f5a29c Fix stats for dwrf encryption (Rebecca Schlussel)
  • 6b51cbf Refactor buffer max capacity calculation (Ying Su)
  • 91cb2ec Allow additional error margin for estimatedMaxCapacity (Ying Su)
  • cfcf392 Always make space for nullsBuffer and hashTablesBuffer (Ying Su)
  • 724e1f9 Add tests for max buffer capacity estimation (Ying Su)
  • c5b8979 Fix estimatedSerializedSizeInBytes for RLE and Dictionary Blocks (Ying Su)
  • 184c000 Fix getLogicalSizeInBytes() for Blocks (Ying Su)
  • 33bca28 Remove childrenEstimatedSerializedSizeInBytes from DecodedBlockNode (Ying Su)
  • ea35ddf Fix serialized size estimation in BlockEncodingBuffers (Ying Su)
  • 3f7dca0 Implement ORDER BY and DISTINCT spilling for aggregation (Saksham Sachdev)
  • 53e23a1 Clean up create geometry calls (James A. Gill)
  • c899dc4 Remove unneeded Esri NaN writing (James A. Gill)
  • 4803c01 Refactor case statement to use enums (James Gill)
  • 4cad3f8 Parquet: Handle missing struct columns as part of the nested column pruning (Venki Korukanti)
  • 5aef7a4 Implement PrestoDatabaseMetaData getClientInfoProperties (Adam J. Shook)
  • 83d37f7 Remove workaround for ConcurrentMap.compute (Rebecca Schlussel)
  • 1bc3803 Remove unused methods (Rebecca Schlussel)
  • e72fe86 Use byte[] instead of slice for EncryptionLibrary (Rebecca Schlussel)
  • da1ccfa Fix reading and writing of column encodings for DWRF (Rebecca Schlussel)
  • 168a0e9 Fix encryption/decryption of encrypted metadata (Rebecca Schlussel)
  • 583ee11 Decrypt separately from compression (Rebecca Schlussel)
  • 21f4cd2 Generate DataEncryptionKeys in OrcWriter (Rebecca Schlussel)
  • d72661f Support common sub-expression optimization in CursorProcessorCompiler (frank.hu)
  • d60030f Revert "Reliable Resource Groups with versioning" (Swapnil Tailor)
  • f88def2 Fix artificially high queued query metrics (Jeremy DeGroot)
  • 9e77bf2 Turn off failure detector by default in tests (Rebecca Schlussel)
  • d80fc51 Fix confusion matrix computation for classification function (fornaix)
  • e345254 Make TestRaptorIntegrationSmokeTestMySql single threaded (James Sun)
  • 02e0460 Add documentation for a deployment example (Adam J. Shook)
  • aabe214 Report field name when decoding Elasticsearch value fails (Zhenxiao Luo)
  • 31e6bc0 Add support for nested types in Elasticsearch (Zhenxiao Luo)
  • d136b07 Fix error message typo in Elasticsearch decoders (Zhenxiao Luo)
  • 997ed9f Handle empty object field in Elasticsearch (Zhenxiao Luo)
  • b7f6c00 Add Elasticsearch array support using definitions in the _meta field (Zhenxiao Luo)
  • d389051 Add option to ignore Elasticsearch publish_address (Zhenxiao Luo)
  • acd5590 Iterate over Elasticsearch documents in index order (Zhenxiao Luo)
  • 05f5f46 Refresh Elasticsearch index before test queries (Zhenxiao Luo)
  • e2772b0 Add support for querying Elasticsearch aliases (Zhenxiao Luo)
  • 1785a94 Use filter clause for Elasticsearch queries (Zhenxiao Luo)
  • 38d8f40 Fix predicate pushdown for Elasticsearch (Zhenxiao Luo)
  • b825bdd Handle mixed-case columns in Elasticsearch (Zhenxiao Luo)
  • d5ab359 Add support for datetimes exposed as numbers in Elasticsearch (Zhenxiao Luo)
  • 54879c5 Add support for AWS IAM authorization to Elasticsearch connector (Zhenxiao Luo)
  • 4e42579 Move ElasticsearchClient to client package (Zhenxiao Luo)
  • 2824ab2 Support nodes with no public http address (Zhenxiao Luo)
  • 03c26f4 Use absolute paths for Elasticsearch requests (Zhenxiao Luo)
  • 83aea2f Add system.nodes table to Elasticsearch (Zhenxiao Luo)
  • 5da00ca Move IndexMetadata, NodesResponse, SearchShardsResponse, Shard, ElasticsearchNode into Elasticsearch client directory (Zhenxiao Luo)
  • dd0e2e6 Refresh Elasticsearch nodes periodically (Zhenxiao Luo)
  • ba21188 Make Elasticsearch retry timeout configurable (Zhenxiao Luo)
  • 639897b Add support for Elasticsearch query string syntax (Zhenxiao Luo)
  • 7f56aab Adjust Elasticsearch timeout defaults (Zhenxiao Luo)
  • d914af4 Load tables dynamically in Elasticsearch (Zhenxiao Luo)
  • ca40434 bulk load elasticsearch test data (Zhenxiao Luo)
  • 0cbc76f Update Elasticsearch connector to use Elasticsearch Http client (Zhenxiao Luo)
  • 24d807a Move metadata specific methods to ElasticsearchMetadata (Zhenxiao Luo)
  • 017661c Embed index and type in Elasticsearch connector handle (Zhenxiao Luo)
  • 03c8f38 Use shard primary host in Elasticsearch connector to save extra hop in Elasticsearch (Zhenxiao Luo)
  • f8bded1 Encapsulate logic for fetching shards (Zhenxiao Luo)
  • 00d45bd inline method for Elasticsearch getSearchShards() (Zhenxiao Luo)
  • cc6be46 Remove indexExactMatch from Elasticsearch connector (Zhenxiao Luo)
  • 42fd681 Simplify logic for Elasticsearch connector getSearchShards (Zhenxiao Luo)
  • bd4a17f Remove unused parameter in Elasticsearch getSearchShards() (Zhenxiao Luo)
  • 3f7665a clear Elasticsearch scroll when connector query close (Zhenxiao Luo)
  • dd5bc6c Make ElasticsearchQueryBuilder stateless (Zhenxiao Luo)
  • 25b1792 Cache Elasticsearch clients (Zhenxiao Luo)
  • c5c5ce5 Make Elasticsearch config catalog wide (Zhenxiao Luo)
  • f92194a Rename ElasticSearchConnectorConfig to ElasticSearchConfig (Zhenxiao Luo)
  • 8e34f48 Fix ObjectMapper typo in ElasticsearchClient (Zhenxiao Luo)
  • 83931c6 Revert "Fix ObjectMapper typo in ElasticsearchClient" (Zhenxiao Luo)
  • fa5f78b Revert "Rename ElasticSearchConnectorConfig to ElasticSearchConfig" (Zhenxiao Luo)
  • 8367c65 Revert "Make Elasticsearch config catalog wide" (Zhenxiao Luo)
  • ca6a8bf Revert "Cache Elasticsearch clients" (Zhenxiao Luo)
  • 911db66 Revert "Make ElasticsearchQueryBuilder stateless" (Zhenxiao Luo)
  • 60e7b59 Revert "clear Elasticsearch scroll when connector query close" (Zhenxiao Luo)
  • c8bb762 Revert "Remove unused parameter in Elasticsearch getSearchShards()" (Zhenxiao Luo)
  • 091933f Revert "Simplify logic for Elasticsearch connector getSearchShards" (Zhenxiao Luo)
  • 890782e Revert "Remove indexExactMatch from Elasticsearch connector" (Zhenxiao Luo)
  • 31e4ab9 Revert "inline method for Elasticsearch getSearchShards()" (Zhenxiao Luo)
  • 738be0b Revert "Encapsulate logic for fetching shards" (Zhenxiao Luo)
  • 326dc71 Revert "Use shard primary host in Elasticsearch connector to save extra hop in Elasticsearch" (Zhenxiao Luo)
  • 6eba421 Revert "Embed index and type in Elasticsearch connector handle" (Zhenxiao Luo)
  • 54f1cbb Revert "Move metadata specific methods to ElasticsearchMetadata" (Zhenxiao Luo)
  • 59154cf Revert "Update Elasticsearch connector to use Elasticsearch Http client" (Zhenxiao Luo)
  • 8186abe Revert "bulk load elasticsearch test data" (Zhenxiao Luo)
  • 2777136 Revert "Load tables dynamically in Elasticsearch" (Zhenxiao Luo)
  • 9ed2baf Revert "Adjust Elasticsearch timeout defaults" (Zhenxiao Luo)
  • d51a968 Revert "Add support for Elasticsearch query string syntax" (Zhenxiao Luo)
  • 561a4b4 Revert "Make Elasticsearch retry timeout configurable" (Zhenxiao Luo)
  • 5510d4d Revert "Refresh Elasticsearch nodes periodically" (Zhenxiao Luo)
  • 6ac8dbe Revert "Move IndexMetadata, NodesResponse, SearchShardsResponse, Shard, ElasticsearchNode into Elasticsearch client directory" (Zhenxiao Luo)
  • 7df099e Revert "Add system.nodes table to Elasticsearch" (Zhenxiao Luo)
  • f7521f2 Revert "Use absolute paths for Elasticsearch requests" (Zhenxiao Luo)
  • 323b60a Revert "Support nodes with no public http address" (Zhenxiao Luo)
  • 2a9081f Revert "Move ElasticsearchClient to client package" (Zhenxiao Luo)
  • 7e1e8ce Revert "Add support for AWS IAM authorization to Elasticsearch connector" (Zhenxiao Luo)
  • 5b8c54c Revert "Add support for datetimes exposed as numbers in Elasticsearch" (Zhenxiao Luo)
  • 76d8dc6 Revert "Handle mixed-case columns in Elasticsearch" (Zhenxiao Luo)
  • c32856d Revert "Fix predicate pushdown for Elasticsearch" (Zhenxiao Luo)
  • b3705cc Revert "Use filter clause for Elasticsearch queries" (Zhenxiao Luo)
  • 6be1765 Revert "Add support for querying Elasticsearch aliases" (Zhenxiao Luo)
  • 5e9e097 Revert "Refresh Elasticsearch index before test queries" (Zhenxiao Luo)
  • df2e257 Revert "Iterate over Elasticsearch documents in index order" (Zhenxiao Luo)
  • 8138809 Revert "Add option to ignore Elasticsearch publish_address" (Zhenxiao Luo)
  • ac103a5 Revert "Add Elasticsearch array support using definitions in the _meta field" (Zhenxiao Luo)
  • 4e726f1 Revert "Handle empty object field in Elasticsearch" (Zhenxiao Luo)
  • 51fdf3a Revert "Fix error message typo in Elasticsearch decoders" (Zhenxiao Luo)
  • 5c2233c Revert "Add support for nested types in Elasticsearch" (Zhenxiao Luo)
  • 6b88754 Revert "Report field name when decoding Elasticsearch value fails" (Zhenxiao Luo)
  • ca60019 Pass encryption info from hive to ORC reader/writer (Rebecca Schlussel)
  • 38358b8 Add more testing for encryption/decryption (Rebecca Schlussel)
  • 00d0011 Add DWRF encryption support to ORC writer (Rebecca Schlussel)
  • d4a1822 Add encryption support to ORC ColumnWriters (Rebecca Schlussel)
  • 9ca3b95 Add DWRF encryption support to ORC reader (Rebecca Schlussel)
  • 81a6b0d Move creation of includedOrcColumns to RecordReader (Rebecca Schlussel)
  • f51c5c5 Make columnEncodings into a map (Rebecca Schlussel)
  • 37b5701 Add metadata support for encrypted DWRF files (Rebecca Schlussel)
  • c6415c4 Remove unused field (Rebecca Schlussel)
  • f1bfd8c Fix copywrite for TestByteArrayUtils (Rebecca Schlussel)
  • 7213ae3 Enable statistics aggregation for temporary table. (Peizhen Guo)
  • 1fbfb40 Disable encryption tests temporarily (Mayank Garg)

@mayankgarg1990
Copy link

Can you comment with the missing release notes for the PRs that you authored/merged - @caithagoras , @highker , @rschlussel , @mbasmanova , @zhenxiao , @arhimondr

* Add local disk spilling support for aggregation functions with `ORDER BY` or `DISTINCT` syntax.
* Add optimization for cursor projection & filter by extract and compute common subexpressions among all projections & filter first. This optimization can be turned off by session property ``optimize_common_sub_expressions``.
* Add support for 2 security modes for views. The default `DEFINER` security mode is the same as the previous behavior. Tables referenced in the view are accessed using the permissions of the view owner (the **creator** or.
* Add support for warning on unfiltered partition keys using `partition-keys-to-warn-on-no-filtering` system property.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I updated the description with the release note for this PR for #14800 but I can't seem to open a PR against your repo to update it here.

@rschlussel
Copy link
Contributor

I'm not sure if this needs a release note, but it is a dependency change. Also, maybe it should be under the accumulo and kafka release notes since that is where the dependency is actually used.

General Changes

  • Update zookeeper version from 3.4.13 to 3.4.14

* Remove experimental feature to perform grouped execution for eligible table scans and its associated configuration property ``experimental.grouped-execution-for-elligible-table-scans`` and session property ``grouped_execution_for_eligible_table_scans``.
* Definer** of the view) rather than the user executing the query. In the `INVOKER` security mode, tables referenced in the view are accessed using the permissions of the query user (the **invoker** of the view).
* Enable ``dynamic-schedule-for-grouped-execution`` by default. In future releases, we will remove this property, and grouped execution will always use dynamic scheduling.
* Enable ``grouped-execution-for-aggregation`` and ``experimental.grouped-execution-for-eligible-table-scans`` by default.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

sorry for the confusion. The commit for Remove experimental feature to perform grouped execution for eligible table scans ... was merged after this one, so this note should just say

Enable ``grouped-execution-for-aggregation`` by default.

* Enable ``dynamic-schedule-for-grouped-execution`` by default. In future releases, we will remove this property, and grouped execution will always use dynamic scheduling.
* Enable ``grouped-execution-for-aggregation`` and ``experimental.grouped-execution-for-eligible-table-scans`` by default.
* Enable async page transport with non-blocking IO by default. This can be disabled by setting ``exchange.async-page-transport-enabled`` configuration property to false.
* Introduce new configuration property ``grouped-execution-enabled`` and session property ``grouped_execution`` to turn grouped execution on or off. This property is true by default. If set to false, it is equivalent to setting all of ``grouped-execution-for-aggregation``, ``grouped-execution-for-join``, and ``experimental.grouped-execution-for-eligible-table-scans`` to false. In future releases we will remove these other properties and only have a single switch for enabling and disabling grouped execution.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

remove experimental.grouped-execution-for-eligilbe-table-scans from this list, as we decided to get rid of this feature entirely instead.

* Add support for warning on unfiltered partition keys using `partition-keys-to-warn-on-no-filtering` system property.
* Add support to optimize min/max only metadata query. This is controlled by existing config ``optimizer.optimize-metadata-queries`` and session property ``optimize_metadata_queries``. Note that enabling this config/session property might change query result if there are metadata that refers to empty data, e.g. empty hive partition.
* Remove experimental feature to perform grouped execution for eligible table scans and its associated configuration property ``experimental.grouped-execution-for-elligible-table-scans`` and session property ``grouped_execution_for_eligible_table_scans``.
* Definer** of the view) rather than the user executing the query. In the `INVOKER` security mode, tables referenced in the view are accessed using the permissions of the query user (the **invoker** of the view).
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this should be part of the note Add support for 2 security modes for views. Looks like somehow some other bullets came in between.

* Definer** of the view) rather than the user executing the query. In the `INVOKER` security mode, tables referenced in the view are accessed using the permissions of the query user (the **invoker** of the view).
* Enable ``dynamic-schedule-for-grouped-execution`` by default. In future releases, we will remove this property, and grouped execution will always use dynamic scheduling.
* Enable ``grouped-execution-for-aggregation`` and ``experimental.grouped-execution-for-eligible-table-scans`` by default.
* Enable async page transport with non-blocking IO by default. This can be disabled by setting ``exchange.async-page-transport-enabled`` configuration property to false.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can we get more information about what this means?

* Enable ``grouped-execution-for-aggregation`` and ``experimental.grouped-execution-for-eligible-table-scans`` by default.
* Enable async page transport with non-blocking IO by default. This can be disabled by setting ``exchange.async-page-transport-enabled`` configuration property to false.
* Introduce new configuration property ``grouped-execution-enabled`` and session property ``grouped_execution`` to turn grouped execution on or off. This property is true by default. If set to false, it is equivalent to setting all of ``grouped-execution-for-aggregation``, ``grouped-execution-for-join``, and ``experimental.grouped-execution-for-eligible-table-scans`` to false. In future releases we will remove these other properties and only have a single switch for enabling and disabling grouped execution.
* Reverting Reliable Resource Group Versioning.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

needs more information and to be in proper release note format.

* Enable async page transport with non-blocking IO by default. This can be disabled by setting ``exchange.async-page-transport-enabled`` configuration property to false.
* Introduce new configuration property ``grouped-execution-enabled`` and session property ``grouped_execution`` to turn grouped execution on or off. This property is true by default. If set to false, it is equivalent to setting all of ``grouped-execution-for-aggregation``, ``grouped-execution-for-join``, and ``experimental.grouped-execution-for-eligible-table-scans`` to false. In future releases we will remove these other properties and only have a single switch for enabling and disabling grouped execution.
* Reverting Reliable Resource Group Versioning.
* Specify allowed roles for HTTP endpoints.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

needs to be fleshed out

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I will introduce this point once we have the proper documentation for this feature.

* Introduce new configuration property ``grouped-execution-enabled`` and session property ``grouped_execution`` to turn grouped execution on or off. This property is true by default. If set to false, it is equivalent to setting all of ``grouped-execution-for-aggregation``, ``grouped-execution-for-join``, and ``experimental.grouped-execution-for-eligible-table-scans`` to false. In future releases we will remove these other properties and only have a single switch for enabling and disabling grouped execution.
* Reverting Reliable Resource Group Versioning.
* Specify allowed roles for HTTP endpoints.
* The worker page's thread snapshot UI does not work (no stack trace displayed on click) when there is active query load (tested under Chrome). This patch fixes an uninitialized variable in client JS that was causing this UI behavior.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Something like
Fix a bug where the UI for the worker's thread snapshot wouldn't display the stack trace.

Also, should be in a separate Web UI section.


Hive Changes
____________
* Add support for caching the Glue metastore.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We should add a release note for encryption at rest @mayankgarg1990

  • Add support for reading and writing DWRF files with encryption. To create a table with encrypted columns....

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Given that encryption cannot be used readily and neither do we have a user guide for how to set it up - my opinion is that we don't publish it in the release notes.

* Enable ``dynamic-schedule-for-grouped-execution`` by default. In future releases, we will remove this property, and grouped execution will always use dynamic scheduling.
* Enable ``grouped-execution-for-aggregation`` and ``experimental.grouped-execution-for-eligible-table-scans`` by default.
* Enable async page transport with non-blocking IO by default. This can be disabled by setting ``exchange.async-page-transport-enabled`` configuration property to false.
* Introduce new configuration property ``grouped-execution-enabled`` and session property ``grouped_execution`` to turn grouped execution on or off. This property is true by default. If set to false, it is equivalent to setting all of ``grouped-execution-for-aggregation``, ``grouped-execution-for-join``, and ``experimental.grouped-execution-for-eligible-table-scans`` to false. In future releases we will remove these other properties and only have a single switch for enabling and disabling grouped execution.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

also, can this be moved above the previous note so that all the grouped execution property changes are next to each other.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have done some overall re-formatting - will let you review the final version.

@mayankgarg1990
Copy link

A quick note - I have not cleaned up the release notes yet - I am still working on cleaning up and reorganizing them.

@mayankgarg1990
Copy link

@tdcmeehan , @rschlussel - the release notes are ready for review

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

just realized the property is called grouped-execution-for-aggregation-enabled.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you also mention here that grouped-execution-for-aggregation-enabled and grouped-execution-for-join-enabled will be removed in a future release.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

mentioned grouped-execution-for-aggregation-enabled next to the point below where I say that it is enabled by default. Added the point about join one here

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

what are the configurations?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

elasticsearch.max-http-connections : Maximum number of persistent HTTP connections to Elasticsearch.
elasticsearch.http-thread-count : Number of threads handling HTTP connections to Elasticsearch

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

added the detailed configuration information

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

what's a composite publish_address? (maybe this is just because I don't know elasticsearch)

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

publish_address field can contain addresses of the following form:

cname/ip:port
ip:port

If the CNAME is present, elasticsearch connector will use the CNAME and port.
Otherwise, it will use the IP and port.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated publish_address to be back quoted and also linked to the PR for context

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This description is a bit too technical. If this is accurate, maybe say

Optimize queries with repeated expressions in filters or projections by computing the common expressions only once.  This can be disabled by the session property
  ``optimize_common_sub_expressions``.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

what's a min/max only metadata query?

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Queries that are doing min(), max() only kind of operations

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have reworded this to be more clear (hopefully)

@mayankgarg1990 mayankgarg1990 force-pushed the release-notes-0.239 branch 2 times, most recently from 508c7bb to aa23eed Compare August 3, 2020 13:46
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
* Fix ``NullPointerException`` in ``/v1/thread`` end point.
* Fix error in ``/v1/thread`` end point.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
for certain queries that had filters pushed down to the table scan.
for certain queries.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
* Fix missing query completion events for queries which fail prior to dispatching.
* Fix missing query completion events for queries which fail prior to execution.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
* Fix potential infinite loop when the setting ``use_legacy_scheduler`` is set to ``false``.
* Fix potential performance regression when setting ``use_legacy_scheduler`` is set to ``false``.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
* Fix a bug where the UI for the worker's thread snapshot wouldn't display the stack trace.
* Fix worker thread snapshot UI to correctly display the stack trace.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What is the default setting?

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Generally when we say enable using, my assumption is that it is disabled by default.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  • Fix incorrect results from :func:classification_miss_rate, :func:classification_fall_out, and :func:classification_precision (:pr:14740).

Copy link
Contributor

@rschlussel rschlussel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please fix the merge conflict

Copy link
Contributor

@tdcmeehan tdcmeehan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Copy link
Contributor Author

@caithagoras caithagoras Aug 7, 2020

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think you need

:pr:`14740`

Could you visually check the other content of the html by running

cd presto-docs
make clean
make html

@caithagoras caithagoras merged commit 412e489 into prestodb:master Aug 7, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants