[WIP] Query plan pushdown by hellium01 · Pull Request #12375 · prestodb/presto

hellium01 · 2019-02-22T18:40:49Z

Related to: #12368

Will add test later when we can convert the column expression back to SQL expression

…ter stage

…ng of all subclasses

highker

Typo "funtion" in commit title.

Finished reviewing commit "Move funtion signature to SPI". Minor comments only. I think currently it makes sense to move Signature to spi.function rather than a new package given most function dependencies won't go to the new package and Signature is a common helper that will be used by connectors.

highker · 2019-02-23T06:35:54Z

presto-main/src/main/java/com/facebook/presto/metadata/InternalFunction.java

+        return new Signature(name, FunctionKind.SCALAR, ImmutableList.of(), ImmutableList.of(), returnType, argumentTypes, false);
+    }
+
+    public static SignatureBuilder builder()


Moving Signature to presto-spi but leaving SignatureBuilder in presto-main reminds me that we have Type in presto-spi but all type coercion in presto-main. This weakens the power of the moved class.

We can

move SignatureBuilder + OperatorSignatureUtils to presto-spi

rename OperatorSignatureUtils to SignatureUtils.

I think SignatureBuilder depends on guava.

We will use UnmodifiableList

presto-spi/src/main/java/com/facebook/presto/spi/function/Signature.java

highker · 2019-02-23T06:45:33Z

presto-spi/src/main/java/com/facebook/presto/spi/function/Signature.java

@@ -229,9 +209,4 @@ public static LongVariableConstraint longVariableExpression(String variable, Str
    {
        return new LongVariableConstraint(variable, expression);
    }
-
-    public static SignatureBuilder builder()


Leave as is

highker

Finished reviewing "Move RowExpression to SPI".
Maybe add commit body "Move RowExpression to SPI and rename RowExpression to ColumnExpression".

This commit fails to compile. There are uncleaned RowExpression existing in the codebase. For example, SqlToRowExpressionTranslator is barely a file move without altering the class name.

presto-spi/src/main/java/com/facebook/presto/spi/type/FunctionType.java

presto-spi/src/main/java/com/facebook/presto/spi/relation/column/CallExpression.java

...to-spi/src/main/java/com/facebook/presto/spi/relation/column/LambdaDefinitionExpression.java

presto-spi/src/main/java/com/facebook/presto/spi/type/FunctionType.java

highker

Finished commit "Add ColumnReferenceExpression". Please update the commit body to say: ColumnReferenceExpression is a new expression for table source blah blah. The only concern of this commit is the translate() interface. That may need some more discussion. Otherwise looks good to me.

highker · 2019-02-23T08:10:50Z

presto-main/src/main/java/com/facebook/presto/sql/relational/Expressions.java

                }
+
+                @Override
+                public Void visitColumnReference(ColumnReferenceExpression columnReferenceExpression, Void context)


columnReferenceExpression -> reference

highker · 2019-02-23T08:12:40Z

...o-main/src/main/java/com/facebook/presto/sql/relational/SqlToColumnExpressionTranslator.java

+import static java.util.stream.Collectors.toMap;

-public final class SqlToRowExpressionTranslator
+public final class SqlToColumnExpressionTranslator


Either squash the current commit with the previous one ("Move RowExpression to SPI") or move the renaming change to the previous one in order to not to fail the compilation.

highker · 2019-02-23T08:23:40Z

...o-main/src/main/java/com/facebook/presto/sql/relational/SqlToColumnExpressionTranslator.java

+        return translate(expression, functionKind, types, ImmutableMap.of(), functionRegistry, typeManager, session, optimize);
+    }
+
+    public static ColumnExpression translate(


Users can be confused by these interfaces. Just keep the most expressive one and remove the other two. Update the callers' parameters with ImmutableList.of() if necessary.

highker · 2019-02-23T08:30:05Z

...o-main/src/main/java/com/facebook/presto/sql/relational/SqlToColumnExpressionTranslator.java

+            if (columnHandles.containsKey(node.getName())) {
+                return new ColumnReferenceExpression(columnHandles.get(node.getName()), getType(node));
+            }
+            else if (inputChannels.containsKey(node.getName())) {


remove else

highker · 2019-02-23T08:32:53Z

...o-main/src/main/java/com/facebook/presto/sql/relational/SqlToColumnExpressionTranslator.java

+        public ColumnExpression visitCall(CallExpression call, Void context)
+        {
+            return call.replaceChildren(call.getArguments().stream()
+                    .map(argument -> argument.accept(this, context)).collect(toImmutableList()));


we usually do

something .func1() .func2() .func3();

highker · 2019-02-23T08:57:36Z

...o-main/src/main/java/com/facebook/presto/sql/relational/SqlToColumnExpressionTranslator.java

        }
    }
+
+    private static class InlineInputChannelVistor


Typo Vistor and call it InlineInputChannelsVisitor

highker · 2019-02-23T08:57:50Z

...o-main/src/main/java/com/facebook/presto/sql/relational/SqlToColumnExpressionTranslator.java

+            this.inputs = requireNonNull(inputs, "input is null");
+        }
+
+        public static ColumnExpression inlineInputs(ColumnExpression columnExpression, List<ColumnExpression> inputs)


inlineInputChannels

highker · 2019-02-23T08:59:21Z

...o-main/src/main/java/com/facebook/presto/sql/relational/SqlToColumnExpressionTranslator.java

+        public ColumnExpression visitInputReference(InputReferenceExpression reference, Void context)
+        {
+            int field = reference.getField();
+            checkArgument(field >= 0 && field < inputs.size(), "Unknown input field");


format("Unknown input field %s", field)

highker · 2019-02-23T09:57:18Z

...o-main/src/main/java/com/facebook/presto/sql/relational/SqlToColumnExpressionTranslator.java

+            FunctionKind functionKind,
+            Map<NodeRef<Expression>, Type> types,
+            Map<String, ColumnHandle> columnHandles,
+            List<String> inputChannelNames,


Not a big fan passing these two parameters in. They are too specific for table source. But before we address them, I find columnHandles is always empty passed from the callers (even in your last commit). Is that true? If it is, let's just remove it. If not, I'm thinking maybe leveraging context to tell the visitor what exactly to translate: a filter, a project, ... in order to hide the parameters.

highker · 2019-02-23T09:58:03Z

...o-main/src/main/java/com/facebook/presto/sql/relational/SqlToColumnExpressionTranslator.java

-        protected RowExpression visitSymbolReference(SymbolReference node, Void context)
+        protected ColumnExpression visitSymbolReference(SymbolReference node, Void context)
        {
+            if (columnHandles.containsKey(node.getName())) {


This condition uses the implication of column names are symbol names. I feel a bit risky here. Also, these if statements kinda break abstraction because they are basically serving two different callers. One possibility is to leverage Context as stated above. But let's figure out whether columnHandles is used or not...

highker

Skip the review of "Convert ColumnExpression back to Expression" for now. I will come back to this commit after going through all following commits to see if there is way to refactor PlanNode to avoid dependency on sql.tree.

highker

Finished review "Add translation from planNode to tableExpression". Most are minor issues. TableExpression is a much cleaner IR than the current PlanNode. It's very unfortunate we cannot use PlanNode for connectors.

The biggest concern is the translation, not the correctness per se but consistency. Especially if someone alters the PlanNode in the future. Currently, the tests with reflection can avoid this to some extent. But still, I'm thinking if we can move the translation to some place (e.g., RelationPlanner) to have a consistent translation.

highker · 2019-02-24T21:52:10Z

...com/facebook/presto/sql/planner/iterative/connector/TableExpressionToPlanNodeTranslator.java

+
+    public TableExpressionToPlanNodeTranslator(PlanNodeIdAllocator idAllocator, SymbolAllocator symbolAllocator, LiteralEncoder literalEncoder, Metadata metadata)
+    {
+        requireNonNull(metadata, "metadata is null");


Move private final Metadata metadata; as the first member variable. Make this line this.metadata = requireNonNull(metadata, "metadata is null");

highker · 2019-02-24T21:54:23Z

...com/facebook/presto/sql/planner/iterative/connector/TableExpressionToPlanNodeTranslator.java

+        return tableExpression.accept(new Visitor(session, connectorId), new Context(outputSymbols));
+    }
+
+    private static class Context


PlanNodeTranslatorContext

highker · 2019-02-24T21:54:34Z

...com/facebook/presto/sql/planner/iterative/connector/TableExpressionToPlanNodeTranslator.java

+        }
+    }
+
+    private class Visitor


PlanNodeTranslatorVisitor

highker · 2019-02-24T23:21:48Z

...com/facebook/presto/sql/planner/iterative/connector/PlanNodeToTableExpressionTranslator.java

+        return plan.accept(new PlanRewriter(), null);
+    }
+
+    private class PlanRewriter


TableExpressionTranslatorVisitor

highker · 2019-02-24T23:23:01Z

...com/facebook/presto/sql/planner/iterative/connector/PlanNodeToTableExpressionTranslator.java

+        private ColumnExpression toColumnExpression(Expression expression, List<Symbol> inputs, FunctionKind type)
+        {
+            Map<NodeRef<Expression>, Type> expressionTypes =
+                    getExpressionTypes(session, metadata, parser, types, expression, ImmutableList.of(), WarningCollector.NOOP, false);


Move this to the previous line

highker · 2019-02-25T06:21:20Z

presto-spi/src/main/java/com/facebook/presto/spi/relation/FilterExpression.java

+    @Override
+    public String toString()
+    {
+        return "FilterExpression{" +


Use com.google.common.base.MoreObjects.toStringHelper

highker · 2019-02-25T06:21:30Z

presto-spi/src/main/java/com/facebook/presto/spi/relation/AggregateExpression.java

+    @Override
+    public String toString()
+    {
+        return "AggregateExpression{" +


Use com.google.common.base.MoreObjects.toStringHelper

highker · 2019-02-25T06:22:24Z

presto-spi/src/main/java/com/facebook/presto/spi/relation/ProjectExpression.java

+    @Override
+    public String toString()
+    {
+        return "ProjectExpression{" +


Use toStringHelper

highker · 2019-02-25T06:22:44Z

presto-spi/src/main/java/com/facebook/presto/spi/relation/TableExpression.java

+
+import java.util.List;
+
+public abstract class TableExpression


Add getSources if you want.

highker · 2019-02-25T06:23:24Z

presto-spi/src/main/java/com/facebook/presto/spi/relation/TableScanExpression.java

+    @Override
+    public String toString()
+    {
+        return "TableScanExpression{" +


Use toStringHelper

highker

Finished review "Add connector based optimizer". Pretty clean patch % minor comments

highker · 2019-02-25T06:32:25Z

presto-main/src/main/java/com/facebook/presto/connector/ConnectorManager.java

            PageSourceManager pageSourceManager,
            IndexManager indexManager,
-            NodePartitioningManager nodePartitioningManager,
+            ConnectorOptimizationRuleManager optimizationRuleManager, NodePartitioningManager nodePartitioningManager,


One parameter a line

highker · 2019-02-25T06:38:17Z

presto-main/src/main/java/com/facebook/presto/server/ServerMainModule.java

        // index manager
        binder.bind(IndexManager.class).in(Scopes.SINGLETON);

+        binder.bind(ConnectorOptimizationRuleManager.class).in(Scopes.SINGLETON);


Abstract ConnectorOptimizationRuleManager out with RuleManager as the interface. Create NoopRuleManager for worker. Bind different implementation for coordinator and worker.

highker · 2019-02-25T06:38:30Z

presto-main/src/main/java/com/facebook/presto/connector/ConnectorOptimizationRuleManager.java

+
+public class ConnectorOptimizationRuleManager
+{
+    private final ConcurrentMap<ConnectorId, ConnectorRuleProvider> providers = new ConcurrentHashMap<>();


Use Map interface

highker · 2019-02-25T06:40:46Z

presto-main/src/main/java/com/facebook/presto/sql/planner/iterative/ConnectorOptimizer.java

+    }
+
+    @Override
+    public PlanNode optimize(PlanNode plan, Session session, TypeProvider types, SymbolAllocator symbolAllocator, PlanNodeIdAllocator idAllocator, WarningCollector warningCollector)


Add requireNonNulls in the function body.

highker · 2019-02-25T06:42:11Z

presto-main/src/main/java/com/facebook/presto/sql/planner/iterative/ConnectorOptimizer.java

+        return progress;
+    }
+
+    private static class Context


ConnectorOptimizerContext

highker · 2019-02-25T06:44:27Z

presto-main/src/main/java/com/facebook/presto/sql/planner/iterative/Memo.java

+    {
+        requireNonNull(node, "node is null");
+        TraitGroup merged = TraitGroup.emptyTraitGroup();
+        node.getSources().stream()


new line stream(). Same for the one below.

highker · 2019-02-25T06:45:16Z

presto-main/src/main/java/com/facebook/presto/sql/planner/iterative/Memo.java

+        traitCollectors.stream()
+                .filter(traitCollector -> traitCollector.canApplyTo(node))
+                .map(traitCollector -> traitCollector.exploreTraits(node))
+                .forEach(traitGroup -> merged.addAll(traitGroup));


merged::addAll

highker · 2019-02-25T06:56:22Z

presto-main/src/main/java/com/facebook/presto/sql/planner/iterative/Memo.java

+
+        public void addAll(TraitGroup other)
+        {
+            other.traitGroups.entrySet().stream()


The following statement is good enough

other.traitGroups.forEach((type, traits) -> traits.getValues().forEach(value -> addTrait(type, value)));

highker · 2019-02-25T06:59:43Z

presto-spi/src/main/java/com/facebook/presto/spi/connector/ConnectorOptimizationRule.java

+
+    boolean match(ConnectorSession session, TableExpression tableExpression);
+
+    Optional<TableExpression> optimize(ConnectorSession session, TableExpression tableExpression);


Why Optional? If there is nothing to optimize, we can return the given tableExpression.

highker

Finished review "Prevent overriding picked layout and move connector optimzation to later stage". LGTM. Refine the title as "Move connector optimization after picking table layout".

highker · 2019-02-25T07:28:16Z

presto-main/src/main/java/com/facebook/presto/sql/planner/optimizations/AddExchanges.java

        {
            List<PlanNode> possiblePlans = PickTableLayout.listTableLayouts(node, predicate, true, session, types, idAllocator, metadata, parser, domainTranslator);
-            List<PlanWithProperties> possiblePlansWithProperties = possiblePlans.stream()
+            List<PlanWithProperties> possiblePlansWithProperties = Stream.concat(node.getLayout().isPresent() ? Stream.of(node) : Stream.empty(), possiblePlans.stream())


Add a comment

highker · 2019-02-25T07:29:37Z

presto-main/src/main/java/com/facebook/presto/sql/planner/PlanOptimizers.java

                        .addAll(new ExtractSpatialJoins(metadata, splitManager, pageSourceManager, sqlParser).rules())
                        .add(new InlineProjections())
                        .build()));
+        builder.add(new ConnectorOptimizer(metadata, sqlParser, connectorOptimizationRuleManager, new LiteralEncoder(metadata.getBlockEncodingSerde())));


Add a comment saying why we put the connector optimizer here.

highker

Finished review "Add checks to prevent uncleaned relation be returned from connector". Alter the title to "Add checks to prevent uncleaned relation returned from connectors"

BTW, the commit timestamps are messed up. Use git rebase master --ignore-date -x 'git commit --amend -C HEAD --date=\"$(date -R)\" && sleep 1.05' to clean up timestamps before sending out a PR.

highker · 2019-02-25T07:32:23Z

...com/facebook/presto/sql/planner/iterative/connector/TableExpressionToPlanNodeTranslator.java

+
+        private void checkNoColumnInputs(UnaryTableExpression node)
+        {
+            int numColumnReferences = node.getOutput().stream()


Put stream() to its own line.

highker · 2019-02-25T07:32:59Z

...o-main/src/main/java/com/facebook/presto/sql/relational/SqlToColumnExpressionTranslator.java

 import static java.util.stream.Collectors.toMap;

-public final class SqlToColumnExpressionTranslator
+public final class


This seems to be a typo

highker · 2019-02-25T07:33:28Z

...o-main/src/main/java/com/facebook/presto/sql/relational/SqlToColumnExpressionTranslator.java

+    public static class InputCollector
+            implements ColumnExpressionVisitor<Set<ColumnExpression>, Void>
+    {
+        private InputCollector()


private InputCollector() {}

highker · 2019-02-25T07:33:49Z

...o-main/src/main/java/com/facebook/presto/sql/relational/SqlToColumnExpressionTranslator.java

+        @Override
+        public Set<ColumnExpression> visitCall(CallExpression call, Void context)
+        {
+            return call.getArguments().stream()


Put stream() to its own line.

highker

Reviewed "Allow use intermediate type as input of aggregation functin" but has not finished yet. I will pause it here for a moment given the logic is quite hard to follow. This commit may need some refactoring. But let's don't worry about this for now and focus on the translation part first.

highker · 2019-02-25T07:55:02Z

presto-main/src/main/java/com/facebook/presto/metadata/FunctionRegistry.java

+                .anyMatch(this::isSpecializedType);
+    }
+
+    private boolean isSpecializedType(TypeSignature typeSignature)


highker · 2019-02-25T07:55:13Z

presto-main/src/main/java/com/facebook/presto/metadata/FunctionRegistry.java

+
+    private boolean isSpecializedType(TypeSignature typeSignature)
+    {
+        return !typeSignature.getParameters()


inline and use

return typeSignature.getParameters().stream().noneMatch(TypeSignatureParameter::isVariable);

highker · 2019-02-25T07:55:56Z

presto-main/src/main/java/com/facebook/presto/metadata/FunctionRegistry.java

        try {
            return specializedAggregationCache.getUnchecked(getSpecializedFunctionKey(signature));
        }
-        catch (UncheckedExecutionException e) {


Why this change?

highker · 2019-02-25T07:56:54Z

presto-main/src/main/java/com/facebook/presto/metadata/FunctionRegistry.java

        throw new PrestoException(FUNCTION_NOT_FOUND, message);
    }

+    private boolean isSpecializedFunction(Signature signature)


highker · 2019-02-25T07:57:05Z

presto-main/src/main/java/com/facebook/presto/metadata/FunctionRegistry.java


+    private boolean isSpecializedFunction(Signature signature)
+    {
+        return isSpecializedType(signature.getReturnType()) && signature.getArgumentTypes().stream()


return isSpecializedType(signature.getReturnType()) && signature.getArgumentTypes().stream().anyMatch(FunctionRegistry::isSpecializedType);

highker · 2019-02-25T08:13:46Z

presto-main/src/main/java/com/facebook/presto/operator/ScanFilterAndProjectOperator.java

            ConnectorPageSource source = pageSourceProvider.createPageSource(operatorContext.getSession(), split, columns);
            if (source instanceof RecordPageSource) {
                cursor = ((RecordPageSource) source).getCursor();
+                pageSource = source;


This is not right.... a typo?

highker · 2019-02-25T08:32:45Z

Did a first round review. Two high-level comments:

Given the amount of code moved to SPI is little, now I'm NOT that concerned whether Signature or TableExpression should go to SPI. It is ok to put them there for now (maybe even with visitor as a first attempt; ultimately we will get rid of visitor pattern).
Translation is an issue. From long term, TableExpression will be the IR and PlanNode is the execution plan. To achieve that, we may consider the following bullets while providing the capability for connectors to participate plan optimization:
- Can we move out dependency of Expression from PlanNode to avoid ColumnExpression translation back to Expression?
- Can we unify the translation of RelationPlan and PlanNodeToTableExpression/TableExpressionToPlanNode? The reasons are (1) making sure the translation is consistent and (2) moving towards the long term goal to deprecate PlanNodeToTableExpression translation and break RelationPlan translation into AST->IR and IR->PlanNode (TableExpressionToPlanNode).

highker · 2019-02-25T08:36:24Z

@hellium01, as a first step, I think it is safe to separate the first commit ("move signature to SPI") as a separate PR to prevent future rebase. That one looks clean.

rongrong · 2019-02-25T18:42:49Z

@highker @hellium01 Except that it should be FunctionHandle that's moved to SPI rather than Signature after #12360. I'd suggest you wait for that one. Otherwise this (move signature to SPI) will need to be reverted later.

highker

High-level comments regarding commit "Convert ColumnExpression back to Expression":

ColumnExpression is not expressive enough as the IR representation for Expression. Indeed, I feel there is a need to introduce the concept of IR fully. An IR is the augmented representation for AST with metadata info in Reverse Polish notation.
With IR introduced, the refactoring can be much easier. Most Expression operations are just doing disjunction/conjunction decomposition, which can be easily achieve by IR. I was also thinking directly use Expression given it has been deeply rooted in our codebase doing the job that originally belongs to IR.

cc: @wenleix @rongrong

highker · 2019-02-25T22:54:39Z

@hellium01 Add up to the previous comment:
ColumnExpression is very close to what we want now; we can just increment it a bit. Shouldn't be very hard. BTW, instead of using ColumnExpression or RowExpression, how about calling it Value or ValueExpression? PyTorch (https://github.com/pytorch/pytorch/wiki/PyTorch-IR) uses this term...

highker · 2019-02-26T21:08:20Z

I think I might have a way to avoid explicit translation. The ultimate goal is to

Make sure the bi-directional translation is loss-less (i.e., do not lose information after roundtrip).
Make sure the bi-directional translation is complete (i.e., every member of PlanNode should be mapped).
The future translation (join, union, etc) is scalable (i.e., users do not need to writer visitors or rules).

Proposing annotation-based translation

Have a "mapping manager" (e.g., TableExpressionManager or ConnectorPlanNodeManager) to keep the mapping between the two nodes.
Like JsonPropery/JsonCreator, we introduce @PlanNodeCreator, @PlanNodeProperty, etc. Each annotation should have a corresponding function to translate each type. Table filter for example:

@Immutable
@PlanNodeMapping(FilterExpression.class)
public class FilterNode
        extends PlanNode
{
    private final PlanNode source;
    private final Expression predicate;

    @JsonCreator
    @PlanNodeCreator
    public FilterNode(
            @JsonProperty("id") PlanNodeId id,
            @JsonProperty("source") @PlanNodeProperty("source") PlanNode source,
            @JsonProperty("predicate") @PlanNodeProperty("predicate") Expression predicate)
    {
        super(id);

        this.source = source;
        this.predicate = predicate;
    }

    @PlanNodeProperty  // with name or not is optional; it can be inferred from getter.
    @JsonProperty("predicate")
    public Expression getPredicate()
    {
        return predicate;
    }

    @PlanNodeProperty
    @JsonProperty("source")
    public PlanNode getSource()
    {
        return source;
    }
}

@PlanNodeProperty tells connectorPlanNodeManager the field to translate. The translation should based on types. Each annotated type should have all its inner types translatable. For example Expression <-> RowExpression is the leaf translation; while PlanNode <-> TableExpression is recursively. In terms of what PlanNode is translatable, it depends on what has been annotated by @PlanNodeMapping.

Both FilterNode and FilterExpression should have annotations to denote their mapping (together with the mappings for their member variables).

The benefit of this approach is to shift node-oriented translation to type-oriented translation, which can easily scale up in the future.

highker · 2019-02-27T00:34:49Z

Some more example code as a starting point

@Target({ElementType.TYPE})
@Retention(RetentionPolicy.RUNTIME)
public @interface PlanNodeMapping
{
    Class<? extends TableExpression> value() default UnmappedTableExpression.class;

    final class UnmappedTableExpression
            extends TableExpression
    {
        @Override
        public List<ColumnExpression> getOutput()
        {
            throw new UnsupportedOperationException();
        }
    }
}



public class TestTranslationManager
{
    @Test
    public void testTranslation()
    {
        ConnectorPlanNodeManager manager = new ConnectorPlanNodeManager();
        manager.register(FilterNode.class);
        manager.toTableExprssion(new FilterNode(new PlanNodeId("1"), null, new LogicalBinaryExpression(AND, new BooleanLiteral("False"), new BooleanLiteral("true"))));
    }

    public class ConnectorPlanNodeManager
    {
        private final BiMap<Class<? extends PlanNode>, Class<? extends TableExpression>> mapping = HashBiMap.create();

        public ConnectorPlanNodeManager()
        {
        }

        public void register(Class<? extends PlanNode> clazz)
        {
            PlanNodeMapping planNodeMapping = clazz.getAnnotation(PlanNodeMapping.class);
            mapping.put(clazz, planNodeMapping.value());

            // TODO: sanity check to make sure the translation is (1) complete and (2) loss-less
        }

        public TableExpression toTableExprssion(PlanNode planNode)
        {
            System.out.println(mapping.get(planNode.getClass()));
            Method[] methods = planNode.getClass().getMethods();

            for (Method method : methods) {
                PlanNodeProperty planNodeProperty = method.getAnnotation(PlanNodeProperty.class);
                if (planNodeProperty != null) {
                    String property = planNodeProperty.value();
                    if (property.equals(USE_DEFAULT_NAME)) {
                        String methodName = method.getName();
                        if (methodName.startsWith("is")) {
                            property = methodName.substring(2);
                            property = Character.toLowerCase(property.charAt(0)) + property.substring(1);
                        }
                        else if (methodName.startsWith("get")) {
                            property = methodName.substring(3);
                            property = Character.toLowerCase(property.charAt(0)) + property.substring(1);
                        }
                        else {
                            throw new IllegalArgumentException();
                        }
                    }
                    Class<?> fromType = method.getReturnType();
                    // TODO: translate
                }
            }

            return null;
        }

        // TODO: put type-to-type translation here
        private ColumnExpression toColumnExpression(Session session, Expression projectionExpression)
        {
            // TODO: put translation here
            return null;
        }
    }
}

hellium01 · 2019-02-27T03:40:53Z

Thanks for the review, I will start addressing it in following days once I am back. About the type to type translation, the difficulty is we sometimes have more than 1 node to represent a relation.
For example, in the case of aggregation with grouping set, we have planNode aggregation -> groupId. GroupId is actually implementation details from engine which we don't want or have no need to let the connector to know about.

hellium01 · 2019-02-27T04:16:18Z

Expression has too much semantic information that we actually never use later in planning phase. The only reason we convert columnExpression (or whatever name we decide) back to expression is because we are currently using expression in planNode. That being said, ColumnExpression should be able to fully converted back to expression given enough care but this means we should always attempt to make this mapping work (we today have to manage conversion from expression to columnExpression anyway).

I think we should provide only the simple relation information to connector and leave all the implementation related information in the engine. Simpler representation should always be better since it is easier to be operated on. Rewrite columnExpression into connector logic will be much easier than rewrite expression. Assuming in the future we want to match if a subtree has corresponding materialized table, matching column/table expression (simple relation) will be much easier than matching planNode mapped expression which has more information that connector don't care but has to handle (so even we don't do it in engine level, connector will most likely eventually do a translation itself so that it can reduce complexity so that it can operate and rewrite).

Simpler representation (tableExpression vs planNode) also prevents connector from seeing changing node types or node structure, which makes connector optimization rules more stable across versions: the only care we need is when we have a change in planNode, we need to make sure the translation back/forth does not break which is much easier than making sure all different connectors have no broken rule.

highker · 2019-02-27T04:28:41Z

Nah, I'm not object to a simple representation, especially given I have proposed the annotation-based translator. I'm totally cool with a "view" concept on top of PlanNode. In terms of how we should implement the translation in details, we can work it out later. Whether it is 1:1 or m:n mapping. Annotation is very powerful to have the capability of supporting m:n mapping as well.

highker · 2019-06-19T23:22:22Z

superseded by #12920

hellium01 added 9 commits February 21, 2019 23:04

Move funtion signature to SPI

174731c

Move RowExpression to SPI

6d13700

Add ColumnReferenceExpression

887edc3

Will add test later when we can convert the column expression back to SQL expression

Convert ColumnExpression back to Expression

fd993fe

Add translation from planNode to tableExpression

3ac8b37

Add connector based optimizer

83698b5

Prevent overriding picked layout and move connector optimzation to la…

87c3a90

…ter stage

Add checks to prevent uncleaned relation be returned from connector

8751416

Allow use intermediate type as input of aggregation functin

420dd0d

facebook-github-bot added the CLA Signed label Feb 22, 2019

hellium01 requested a review from highker February 22, 2019 18:42

Replace TableExpressionVisitor with rewriter does not require iterati…

7d0df09

…ng of all subclasses

hellium01 force-pushed the QueryPlanPushdown branch from 22e0466 to 7d0df09 Compare February 22, 2019 19:11

highker self-assigned this Feb 22, 2019

highker reviewed Feb 23, 2019

View reviewed changes

highker requested changes Feb 23, 2019

View reviewed changes

highker reviewed Feb 23, 2019

View reviewed changes

highker reviewed Feb 24, 2019

View reviewed changes

highker reviewed Feb 25, 2019

View reviewed changes

highker assigned hellium01 Feb 25, 2019

highker reviewed Feb 25, 2019

View reviewed changes

highker mentioned this pull request Feb 27, 2019

Pushdown fields to update column handle and implement Parquet nested fields pruning #11454

Closed

hellium01 mentioned this pull request Mar 5, 2019

Move FunctionHandle to SPI #12432

Merged

hellium01 mentioned this pull request Mar 19, 2019

Move row expression to spi #12490

Merged

highker removed their assignment Mar 30, 2019

highker closed this Jun 19, 2019


		boolean match(ConnectorSession session, TableExpression tableExpression);

		Optional<TableExpression> optimize(ConnectorSession session, TableExpression tableExpression);

Conversation

hellium01 commented Feb 22, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

highker left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

highker left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

highker left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

highker left a comment

Choose a reason for hiding this comment

Uh oh!

highker left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

highker left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hellium01 commented Feb 22, 2019 •

edited

Loading