You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It assumes that field.children is an iterable and, to my understanding, this assumption is usually warranted as for the last elements, when the leaves of the tree are reached, field.children is Arrow.Flatbuf.Field[]. However, we have observed that when Arrow files are written in C# field.children === nothing and trying to iterate throws an error. I can share an example file.
I could fix this in my deved, local Arrow.jl by simply conditioning the for loop:
if field.children !==nothingfor child in field.children
getdictionaries!(dictencoded, child)
endend
I am not sure if this is the right fix but I am opening this issue here because I am more familiar with the Arrow implementation in Julia and I think it is reasonable to be more defensive. But, as reading the same data works once the file is imported-exported with Python/pyarrow, my guess is that it is the C# implementation of Arrow that is doing something fishy here. Maybe this can also be fixed at the site where Julia produces the nothing when translating the Arrow types?
The text was updated successfully, but these errors were encountered:
that looks like a good fix to me; if you make a PR, that'd be great. If you have an example arrow file that exhibits the problem that we can use to test it with, even better.
Closes#375
This PR fixes a problem in reading an Arrow IPC file generated by C#
Apache.Arrow 10.0.1 where the last recursive iteration to get the
dictionaries returns `nothing` instead of an empty iterable.
The problem I have encountered is due to this for loop:
arrow-julia/src/table.jl
Line 387 in 528c3ba
It assumes that
field.children
is an iterable and, to my understanding, this assumption is usually warranted as for the last elements, when the leaves of the tree are reached,field.children
isArrow.Flatbuf.Field[]
. However, we have observed that when Arrow files are written in C#field.children === nothing
and trying to iterate throws an error. I can share an example file.I could fix this in my
dev
ed, local Arrow.jl by simply conditioning the for loop:I am not sure if this is the right fix but I am opening this issue here because I am more familiar with the Arrow implementation in Julia and I think it is reasonable to be more defensive. But, as reading the same data works once the file is imported-exported with Python/pyarrow, my guess is that it is the C# implementation of Arrow that is doing something fishy here. Maybe this can also be fixed at the site where Julia produces the
nothing
when translating the Arrow types?The text was updated successfully, but these errors were encountered: