Expand description
API to perform page-level filtering (also known as indexes)
Structs§
ColumnPageStatistics
contains the minimum, maximum, and null_count of each page of a parquet column, as anArray
. This struct has the following invariants:- An enum describing a page that was either selected in a filter pushdown or skipped
- An interval
Enums§
- Page statistics of an Arrow field.
Functions§
- Returns the set of (row) intervals of the pages.
- Checks whether the row group have page index information (page statistics)
- Reads the column indexes from the reader assuming a valid set of derived Arrow fields for all parquet the columns in the file.
- Reads all page locations and index locations (IO-bounded) and uses
predicate
to compute the set ofFilteredPage
that fulfill the predicate.