Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
13 changes: 11 additions & 2 deletions docs/src/main/sphinx/connector/hive-s3.rst
Original file line number Diff line number Diff line change
Expand Up @@ -384,7 +384,7 @@ workload:
Considerations and limitations
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

* Only objects stored in CSV format are supported. Objects can be uncompressed,
* Only objects stored in CSV and JSON format are supported. Objects can be uncompressed,
or optionally compressed with gzip or bzip2.
* The "AllowQuotedRecordDelimiters" property is not supported. If this property
is specified, the query fails.
Expand All @@ -399,7 +399,16 @@ Enabling S3 Select pushdown
You can enable S3 Select Pushdown using the ``s3_select_pushdown_enabled``
Hive session property, or using the ``hive.s3select-pushdown.enabled``
configuration property. The session property overrides the config
property, allowing you enable or disable on a per-query basis.
property, allowing you enable or disable on a per-query basis. Non-filtering
queries (``SELECT * FROM table``) are not pushed down to S3 Select,
as they retrieve the entire object content.

For uncompressed files, S3 Select scans ranges of bytes in parallel. The scan range
requests run across the byte ranges of the internal Hive splits for the query fragments
pushed down to S3 Select. Changes in the Hive connector :ref:`performance tuning
configuration properties <hive-performance-tuning-configuration>` are likely to impact
S3 Select pushdown performance.


Understanding and tuning the maximum connections
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Expand Down
2 changes: 2 additions & 0 deletions docs/src/main/sphinx/connector/hive.rst
Original file line number Diff line number Diff line change
Expand Up @@ -731,6 +731,8 @@ Property Name Description
This is mutually exclusive with a global JSON key file.
============================================ =================================================================

.. _hive-performance-tuning-configuration:

Performance tuning configuration properties
-------------------------------------------

Expand Down