Skip to main content

ParquetReader<T>

A typed Parquet reader, wrapping the base Parquet reader.

Extends

  • ParquetReader

Type Parameters

Type Parameter
T extends ParquetRecordLike

Implements

  • AsyncDisposable

Constructors

new ParquetReader()

new ParquetReader<T>(
metadata,
envelopeReader,
opts?): ParquetReader<T>

Create a new parquet reader from the file metadata and an envelope reader. It is not recommended to call this constructor directly except for advanced and internal use cases. Consider using one of the open methods instead

Parameters

ParameterType
metadataFileMetaDataExt
envelopeReaderParquetEnvelopeReader
opts?BufferReaderOptions

Returns

ParquetReader<T>

Inherited from

BaseParquetReader.constructor

Properties

envelopeReader

envelopeReader: null | ParquetEnvelopeReader;

Inherited from

BaseParquetReader.envelopeReader


metadata

metadata: null | FileMetaDataExt;

Inherited from

BaseParquetReader.metadata


schema

schema: ParquetSchema<T>;

Overrides

BaseParquetReader.schema

Methods

[asyncDispose]()

asyncDispose: Promise<void>

Returns

Promise<void>

Implementation of

AsyncDisposable.[asyncDispose]


[asyncIterator]()

asyncIterator: AsyncGenerator<T, void, unknown>

Returns

AsyncGenerator<T, void, unknown>

Overrides

BaseParquetReader.[asyncIterator]


close()

close(): Promise<void>

Close this parquet reader. You MUST call this method once you're finished reading rows

Returns

Promise<void>

Inherited from

BaseParquetReader.close


decodePages()

decodePages(buffer, opts): Promise<PageData>

Parameters

ParameterType
bufferBuffer
optsOptions

Returns

Promise<PageData>

Inherited from

BaseParquetReader.decodePages


dispose()

dispose(): Promise<void>

Returns

Promise<void>


exportMetadata()

exportMetadata(indent): Promise<string>

Parameters

ParameterType
indentundefined | string | number

Returns

Promise<string>

Inherited from

BaseParquetReader.exportMetadata


getBloomFiltersFor()

getBloomFiltersFor(columnNames): Promise<Record<string, object[]>>

Parameters

ParameterType
columnNamesstring[]

Returns

Promise<Record<string, object[]>>

Inherited from

BaseParquetReader.getBloomFiltersFor


getCursor()

getCursor(columnList?): ParquetCursor

Return a cursor to the file. You may open more than one cursor and use them concurrently. All cursors become invalid once close() is called on the reader object.

The required_columns parameter controls which columns are actually read from disk. An empty array or no value implies all columns. A list of column names means that only those columns should be loaded from disk.

Parameters

ParameterType
columnList?unknown[][]

Returns

ParquetCursor

Inherited from

BaseParquetReader.getCursor


getMetadata()

getMetadata(): Record<string, unknown>

Returns the user (key/value) metadata for this file

Returns

Record<string, unknown>

Inherited from

BaseParquetReader.getMetadata


getRowCount()

getRowCount(): Int64

Return the number of rows in this file. Note that the number of rows is not necessarily equal to the number of rows in each column.

Returns

Int64

Inherited from

BaseParquetReader.getRowCount


getSchema()

getSchema(): ParquetSchema

Returns the ParquetSchema for this file

Returns

ParquetSchema

Inherited from

BaseParquetReader.getSchema


openBuffer()

static openBuffer<T>(buffer, options?): Promise<ParquetReader<T>>

Type Parameters

Type Parameter
T extends ParquetRecordLike

Parameters

ParameterType
bufferBuffer
options?BufferReaderOptions

Returns

Promise<ParquetReader<T>>

Overrides

BaseParquetReader.openBuffer


openEnvelopeReader()

static openEnvelopeReader<T>(envelopeReader, opts?): Promise<ParquetReader<T>>

Type Parameters

Type Parameter
T extends ParquetRecordLike

Parameters

ParameterType
envelopeReaderParquetEnvelopeReader
opts?BufferReaderOptions

Returns

Promise<ParquetReader<T>>

Overrides

BaseParquetReader.openEnvelopeReader


openFile()

static openFile<T>(filePath, options?): Promise<ParquetReader<T>>

Open the parquet file pointed to by the specified path and return a new parquet reader

Type Parameters

Type Parameter
T extends ParquetRecordLike

Parameters

ParameterType
filePathPathBuilderLike
options?BufferReaderOptions

Returns

Promise<ParquetReader<T>>

Overrides

BaseParquetReader.openFile


openS3()

static openS3(
client,
params,
options?): Promise<ParquetReader>

Open the parquet file from S3 using the supplied aws client [, commands] and params The params have to include Bucket and Key to the file requested, If using v3 of the AWS SDK, combine the client and commands into an object wiht keys matching the original module names, and do not instantiate the commands; pass them as classes/modules.

This function returns a new parquet reader [ or throws an Error.]

Parameters

ParameterType
clientany
paramsClientParameters
options?BufferReaderOptions

Returns

Promise<ParquetReader>

Inherited from

BaseParquetReader.openS3


openUrl()

static openUrl(params, options?): Promise<ParquetReader>

Open the parquet file from a url using the supplied request module params should either be a string (url) or an object that includes a url property. This function returns a new parquet reader

Parameters

ParameterType
paramsstring | URL | Parameter
options?BufferReaderOptions

Returns

Promise<ParquetReader>

Inherited from

BaseParquetReader.openUrl