This repository was archived by the owner on May 9, 2024. It is now read-only.
Replace dictionary proxies with nested dictionaries 01/N #661
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This patch series aims to remove the
StringDictionaryProxy
class by extendingStringDictionary
functionality to cover all use cases of the proxy class. The reason is that the dictionary encoding used for ResultSets is incompatible with the dictionary encoding used by ArrowStorage and expected by execution kernels. It means we simply cannot use dictionary-encoded columns from ResultSets and need to convert them (currently, ResultSetRegistry lacks such functionality, see #588). Instead of implementing such a conversion, I want to change the coding so that we wouldn't need to convert anything.There are a few major changes required to achieve this:
I try to make changes in many small steps to simplify the review process. So, many patches are very trivial refactoring or a small part of the new functionality.
This patch removes unused StringDictionaryProxy code.