-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement conversion between TIleDB Schema and Arrow Schema #31
Conversation
This pull request has been linked to Shortcut Story #43826: Invertible function to convert between TileDB Schema and Arrow Schema. |
use tiledb::{error::Error as TileDBError, fn_typed, Result as TileDBResult}; | ||
|
||
use crate::datatype::{arrow_type_physical, tiledb_type_physical}; | ||
|
||
/// Encapsulates TileDB filter data for storage in Arrow Field metadata | ||
#[derive(Deserialize, Serialize)] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I moved this to its own module arrow/src/filter.rs
.
use proptest::prelude::*; | ||
|
||
#[test] | ||
fn test_tiledb_arrow_tiledb() { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In the attribute
module we also have test_arrow_tiledb_arrow
. For attribute the tiledb metadata is optional, but for dimension it is required, so it's much harder to write that test.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1
DimensionKey::Index(idx) => Ok(idx < self.ndim()), | ||
DimensionKey::Name(name) => { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oooh! Nice, this is a pattern to remember to avoid having to have multiple functions in a lot of the places where we allow lookups by index or name.
This pull request implements functions to convert between TileDB schema and Arrow schema.
As in #28, we use a serializable objects to store details about the TileDB schema and/or dimension which do not have a natural field in the Arrow Schema.
There aren't any notable new designs here, just lots of details which were missing including several
PartialEq
implementations, some fields fromDebug
, someffi
methods which were not implemented yet, and some updates to the proptest strategies. Among the missing methods were the Schema's filter list setters and getters - these depend on the Domain to generate sound values, so I was unable to add them to proptest as we have discussed as length in Slack.Resolves sc#43826.