Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Scaffolding to support generating arbitrary filter pipelines #23

Merged
merged 15 commits into from
Mar 25, 2024

Conversation

rroelke
Copy link
Member

@rroelke rroelke commented Mar 22, 2024

The missing piece from Attribute::eq was the filter pipeline. To get this in there with the testing infrastructure we have and/or will need, we need to do Debug, Eq, Serialize and Deserialize for FilterList and Filter.

That turns out to require a bunch of things:

  • to keep the sys crate with minimal dependencies, I moved Datatype to the api crate where it can #[derive(Deserialize, Serialize)]. This also is a big step towards keeping ffi:: out of the public APIs.
  • not strictly a requirement, but the enum FilterData is really useful because it can #[derive(PartialEq)] which helps prevent us developers from making mistakes. Now those mistakes can live in Filter::create and/or Filter::filter_data instead.
  • FilterData::transform_datatype is an internal C API, but we need to use it for property-based testing to make sure that we construct valid filter pipelines. The output of each filter in the pipeline needs to be a valid input to the next one.

@rroelke rroelke requested a review from davisp March 22, 2024 19:50
@rroelke
Copy link
Member Author

rroelke commented Mar 22, 2024

This doesn't quite work yet, the Attribute proptest is failing to construct because the pipeline viability code I wrote is not quite right.

Copy link
Collaborator

@davisp davisp left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Whoa. Yeah, that' turned out way better than I was expecting when we discussed possible approaches on the Thursday call. It really shines when looking at the proptest definitions as well.

Most things I had were either minor style nits or general questions that don't need answering for this PR. The only real thing I saw was that c_uchar vs c_uint question.

Otherwise, +1 once CI is green.

}
if let Some(reinterpret_datatype) = reinterpret_datatype {
let c_datatype =
reinterpret_datatype.capi_enum() as std::ffi::c_uchar;
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is std::ffi::c_uchar correct here? I would have expected u32 or std::ffi::c_uint.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Surprisingly yes - it expects just one byte inside of libtiledb

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Well that's awkward. In a note to future selves, I tracked this down to the fact that most of our uses of Datatype refelect the public C API typedef enum { ... } tiledb_datatype_t definition which is apparently backed by a u32 according to bindgen. However, the Filter::set_option pipes the void* all the way to CompressionFilter::set_option_impl which is casting it to a tiledb::sm::Datatype which is enum class Datatype : uint8_t { ... } definition which means it really does want a single byte ini this case.

@@ -67,6 +68,12 @@ impl<'ctx> FilterList<'ctx> {
}
}

pub fn to_vec(&self) -> TileDBResult<Vec<Filter<'ctx>>> {
(0..self.get_num_filters()?)
.map(|f| self.get_filter(f))
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Huh. TIL that collect() returns the first error or the whole list. That's awesome!

@rroelke rroelke merged commit db2c1e7 into main Mar 25, 2024
1 check passed
@rroelke rroelke deleted the rr/attribute-PartialEq branch March 25, 2024 15:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants