Skip to content

Conversation

@robert3005
Copy link

The use case is that I want to reuse existing listing of files and avoid doing it again when opening streams. This is in case where filesystem.open is expensive but you have other means of obtaining input stream for a file.

@robert3005 robert3005 force-pushed the robertk/allow-reading-footers-from-streams branch from c93a22d to b64c890 Compare July 31, 2016 01:45
@robert3005 robert3005 changed the title allow reading footers from provided file listing and streams PARQUET-669 allow reading footers from provided file listing and streams Jul 31, 2016
@robert3005 robert3005 force-pushed the robertk/allow-reading-footers-from-streams branch from b64c890 to 22bc0fa Compare July 31, 2016 22:12
@robert3005 robert3005 changed the title PARQUET-669 allow reading footers from provided file listing and streams PARQUET-669: allow reading footers from provided file listing and streams Jul 31, 2016
* @return the metadata blocks in the footer
* @throws IOException if an error occurs while reading the file
*/
public static final ParquetMetadata readFooter(FileStatus file, FSDataInputStream f, MetadataFilter filter) throws IOException {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We only need need the file size here not the full FileStatus object.
(since we're making this public)

@robert3005 robert3005 force-pushed the robertk/allow-reading-footers-from-streams branch from 22bc0fa to 4d8a54c Compare August 2, 2016 21:48
@julienledem
Copy link
Member

LGTM

@asfgit asfgit closed this in 76a2ac8 Aug 3, 2016
rdblue pushed a commit to rdblue/parquet-mr that referenced this pull request Jan 6, 2017
…eams

The use case is that I want to reuse existing listing of files and avoid doing it again when opening streams. This is in case where filesystem.open is expensive but you have other means of obtaining input stream for a file.

Author: Robert Kruszewski <[email protected]>

Closes apache#357 from robert3005/robertk/allow-reading-footers-from-streams and squashes the following commits:

4d8a54c [Robert Kruszewski] allow reading footers from provided file listing and streams
rdblue pushed a commit to rdblue/parquet-mr that referenced this pull request Jan 10, 2017
…eams

The use case is that I want to reuse existing listing of files and avoid doing it again when opening streams. This is in case where filesystem.open is expensive but you have other means of obtaining input stream for a file.

Author: Robert Kruszewski <[email protected]>

Closes apache#357 from robert3005/robertk/allow-reading-footers-from-streams and squashes the following commits:

4d8a54c [Robert Kruszewski] allow reading footers from provided file listing and streams
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants