Skip to content

Conversation

@StephanEwen
Copy link
Contributor

No description provided.

StephanEwen added a commit to StephanEwen/flink that referenced this pull request Nov 19, 2014
@asfgit asfgit closed this in ce822bf Nov 19, 2014
@StephanEwen StephanEwen deleted the compiler_fix branch November 24, 2014 23:40
zhijiangW pushed a commit to zhijiangW/flink that referenced this pull request Jul 23, 2019
Submitting the markup file for a blog post by Andrey and Fabian about state TTL.

This closes apache#216
HuangZhenQiu pushed a commit to HuangZhenQiu/flink that referenced this pull request Sep 20, 2022
tharvey5 added a commit to tharvey5/flink that referenced this pull request Jun 25, 2025
Enable's storage partitioned joined to work e2e with a
DynamicTableSource such as Iceberg Table that Supports Partitioning

- calls applyPartitionedRead from Flink Planner to notify any table
source that implements `SupportsPartitioning` that SPJ can be used
- Improve `isPartitionedBy` check which now can account for filter
pushdown optimizations when determining whether SPJ can be applied

**Following queries DO utilize SPJ**
```
select count(*) from iceberg.db.simple_bucketed as table1 join iceberg.db.simple_bucketed_2 as table2 on table1.user_id = table2.user_id2
```

```
select count(*) from iceberg.db.user_id_two_partition_cols t1 join iceberg.db.user_id_two_partition_cols_2 t2 on t1.dt = t2.dt and t1.user_id = t2.user_id
```

```
select count(*) from iceberg.db.user_id_two_partition_cols t1 join iceberg.db.user_id_two_partition_cols_2 t2 on t1.dt = t2.dt and t1.user_id = t2.user_id  where t1.dt = '\''2025-05-01'\'' and t2.dt = '\''2025-05-01'\''
```

**The following Query DO NOT apply SPJ optimization**

```
select count(*) from iceberg.db.simple_bucketed as table1 join iceberg.db.simple_bucketed_32 as table2 on table1.user_id = table2.user_id2
```

Results all came back as expected as compared with Spark/Presto

  - Dependencies (does it add or upgrade a dependency): (yes / **no**)
- The public API, i.e., is any changed class annotated with
`@Public(Evolving)`: (yes / **no**)
  - The serializers: (yes / **no** / don't know)
- The runtime per-record code paths (performance sensitive): (yes /
**no** / don't know)
- Anything that affects deployment or recovery: JobManager (and its
components), Checkpointing, Kubernetes/Yarn, ZooKeeper: (yes / **no** /
don't know)
  - The S3 file system connector: (yes / **no** / don't know)

  - Does this pull request introduce a new feature? (yes / **no**)
- If yes, how is the feature documented? (not applicable / docs /
JavaDocs / not documented)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants