Order by clause with Hive causes performance degradation.#454
Order by clause with Hive causes performance degradation.#454grafke wants to merge 4 commits intoapache:masterfrom grafke:master
Conversation
…case the query fails
…ase the query will fail. eg. SELECT columnA AS columnA, columnB AS `timestamp`, SUM(columnC) AS sum__columnC FROM table <...> GROUP BY columnA, columnB ORDER BY SUM(columnC) DESC -- Invalid clause;
|
|
|
Wait. That |
|
I see, yes, good point. I agree Hive is not the best choice. Would it be OK to add an if clause, i.e. if Hive then don't do the ordering? |
|
Series Limit would then act as randomly selected set of series, and only for Hive? Makes no sense to me. |
|
You're absulutely right, but the subselect on Hive at the moment does not
|
* word cloud control panel * oops, control panel should actually be used
* word cloud control panel * oops, control panel should actually be used
* word cloud control panel * oops, control panel should actually be used
* word cloud control panel * oops, control panel should actually be used
Order by clause with Hive causes performance degradation. It doesn't really do anything else except on some cases causes queries to fail. I'm not sure if this is the right way to fix it, but on the other hand disabling order by really helps with performance. Otherwise, perhaps it would be wise to add an order by button to the web interface. So that the user could control it.