Conversation
|
Thank you for your pull request and welcome to our community. We require contributors to sign our Contributor License Agreement, and we don't seem to have you on file. In order for us to review and merge your code, please submit the signed CLA to cla@prestosql.io. If you are contributing on behalf of someone else (e.g., your employer), the individual CLA may not be sufficient and your employer may need the Corporate CLA signed. |
1e3e043 to
565f7f0
Compare
|
Thank you for your pull request and welcome to our community. We require contributors to sign our Contributor License Agreement, and we don't seem to have you on file. In order for us to review and merge your code, please submit the signed CLA to cla@prestosql.io. If you are contributing on behalf of someone else (e.g., your employer), the individual CLA may not be sufficient and your employer may need the Corporate CLA signed. |
565f7f0 to
7e28f25
Compare
|
Thank you for your pull request and welcome to our community. We require contributors to sign our Contributor License Agreement, and we don't seem to have you on file. In order for us to review and merge your code, please submit the signed CLA to cla@prestosql.io. If you are contributing on behalf of someone else (e.g., your employer), the individual CLA may not be sufficient and your employer may need the Corporate CLA signed. |
|
CLA check is red, but the commits are annotated with |
|
Thank you for your pull request and welcome to our community. We require contributors to sign our Contributor License Agreement, and we don't seem to have you on file. In order for us to review and merge your code, please submit the signed CLA to cla@prestosql.io. If you are contributing on behalf of someone else (e.g., your employer), the individual CLA may not be sufficient and your employer may need the Corporate CLA signed. |
electrum
left a comment
There was a problem hiding this comment.
Please squash the fixups in the last commit into the previous commits as appropriate
There was a problem hiding this comment.
Use dashes in file names (like other files)
There was a problem hiding this comment.
Please wrap documentation text at 80-100 characters. These line that wrap in GitHub are too long. Pre-formatted text like explain plans are fine to be as along as necessary, of course.
There was a problem hiding this comment.
What is "root" here? Maybe just remove that word.
There was a problem hiding this comment.
Use a link
:doc:`/sql/explain`
There was a problem hiding this comment.
t does not exist in the example. How about
will show statistics for the table layout representing a subset of data after applying the given filtering condition
There was a problem hiding this comment.
Change since we don't support a column list in the select clause
The filtering condition used in the ``WHERE`` clause can reference table columns.
Although I'm not sure this sentence adds any value. What else would it reference or why else would it be used?
There was a problem hiding this comment.
How about
If the ``WHERE`` clause filters out all of the data in the table, then
``SHOW STATS`` will return no statistics, as shown in the example below.
There was a problem hiding this comment.
Remove this section since it is no longer supported
There was a problem hiding this comment.
Please update this section to use Presto's ANALYZE
There was a problem hiding this comment.
I am dropping this whole section (below for reference). I added For the Hive connector, refer to :ref:`Hive connector <hive_analyze>` documentation if you want to update table statistics. instead.
Updating Statistics For Hive Tables
-----------------------------------
For the Hive connector, Presto uses the statistics that are managed by Hive and
exposed via the Hive metastore API. Depending on the Hive configuration, table
statistics may not be updated automatically.
If statistics are not updated automatically, the user needs to trigger a
statistics update via the Hive CLI.
The following command can be used in the Hive CLI to update table statistics for
non-partitioned table ``t``::
hive> ANALYZE TABLE t COMPUTE STATISTICS;
hive> ANALYZE TABLE t COMPUTE STATISTICS FOR COLUMNS;
For partitioned tables, partitioning information must be specified in the
command. Assuming table ``t`` has two partitioning keys ``a`` and ``b``, the
following command would update the table statistics for all partitions::
hive> ANALYZE TABLE t PARTITION (a, b) COMPUTE STATISTICS FOR COLUMNS;
It is also possible to update statistics for just a subset of partitions. This
command will update statistics for all partitions for which partitioning key
``a`` is equal to ``1``::
hive> ANALYZE TABLE t PARTITION (a=1, b) COMPUTE STATISTICS FOR COLUMNS;
And this command will update statistics for just one partition::
hive> ANALYZE TABLE t PARTITION (a=1, b=5) COMPUTE STATISTICS FOR COLUMNS;
For documentation on Hive's statistics mechanism see
https://cwiki.apache.org/confluence/display/Hive/StatsDev
bb09b12 to
0d33efe
Compare
|
ac |
electrum
left a comment
There was a problem hiding this comment.
A few nits, otherwise looks great!
There was a problem hiding this comment.
Nit: don't wrap a single word. We wrap to improve readability, but this is less readable
There was a problem hiding this comment.
Add "the" at end
refer to the ...
There was a problem hiding this comment.
How about
refer to the ... documentation to learn how to update table statistics.
There was a problem hiding this comment.
The sentence sounds tautological. How about
With cost based join distribution selection, Presto automatically chooses to use a partitioned or broadcast join.
There was a problem hiding this comment.
Remove code quotes and lowercase on the description part, since we are describing the type of join, not referencing a config option
* ``BROADCAST`` - broadcast join distribution is used for all joins
There was a problem hiding this comment.
Let's change this to
* Partitioned: each node participating in the query builds a hash table
from only fraction of the data
* Broadcast: each node participating in the query builds a hash table
from all of the data (data is replicated to each node)
Since we are talking about logical operations, not config options. Also a few minor wording changes.
0d33efe to
2d98e04
Compare
|
Thank you for your pull request and welcome to our community. We require contributors to sign our Contributor License Agreement, and we don't seem to have you on file. In order for us to review and merge your code, please submit the signed CLA to cla@prestosql.io. If you are contributing on behalf of someone else (e.g., your employer), the individual CLA may not be sufficient and your employer may need the Corporate CLA signed. |
|
ac |
2d98e04 to
6d48e0f
Compare
|
Thank you for your pull request and welcome to our community. We require contributors to sign our Contributor License Agreement, and we don't seem to have you on file. In order for us to review and merge your code, please submit the signed CLA to cla@prestosql.io. If you are contributing on behalf of someone else (e.g., your employer), the individual CLA may not be sufficient and your employer may need the Corporate CLA signed. |
Extracted-From: https://github.com/starburstdata/presto Cherry-pick of trinodb/trino#127 Co-authored-by: Ariel Weisberg <aweisberg@fb.com>
Extracted-From: https://github.com/starburstdata/presto Cherry-pick of trinodb/trino#127 Co-authored-by: Łukasz Osipiuk <lukasz@osipiuk.net>
Extracted-From: https://github.com/starburstdata/presto Cherry-pick of trinodb/trino#127 Co-authored-by: Ariel Weisberg <aweisberg@fb.com>
Extracted-From: https://github.com/starburstdata/presto Cherry-pick of trinodb/trino#127 Co-authored-by: Łukasz Osipiuk <lukasz@osipiuk.net>
Extracted-From: https://github.com/starburstdata/presto Cherry-pick of trinodb/trino#127 Co-authored-by: Ariel Weisberg <aweisberg@fb.com>
Extracted-From: https://github.com/starburstdata/presto Cherry-pick of trinodb/trino#127 Co-authored-by: Łukasz Osipiuk <lukasz@osipiuk.net>
See https://docs.starburstdata.com/latest/optimizer.html how this looks like.