Enum MirRelationExpr

Help

pub enum MirRelationExpr {
Show 15 variants    Constant {
        rows: Result<Vec<(Row, Diff)>, EvalError>,
        typ: RelationType,
    },
    Get {
        id: Id,
        typ: RelationType,
        access_strategy: AccessStrategy,
    },
    Let {
        id: LocalId,
        value: Box<MirRelationExpr>,
        body: Box<MirRelationExpr>,
    },
    LetRec {
        ids: Vec<LocalId>,
        values: Vec<MirRelationExpr>,
        limits: Vec<Option<LetRecLimit>>,
        body: Box<MirRelationExpr>,
    },
    Project {
        input: Box<MirRelationExpr>,
        outputs: Vec<usize>,
    },
    Map {
        input: Box<MirRelationExpr>,
        scalars: Vec<MirScalarExpr>,
    },
    FlatMap {
        input: Box<MirRelationExpr>,
        func: TableFunc,
        exprs: Vec<MirScalarExpr>,
    },
    Filter {
        input: Box<MirRelationExpr>,
        predicates: Vec<MirScalarExpr>,
    },
    Join {
        inputs: Vec<MirRelationExpr>,
        equivalences: Vec<Vec<MirScalarExpr>>,
        implementation: JoinImplementation,
    },
    Reduce {
        input: Box<MirRelationExpr>,
        group_key: Vec<MirScalarExpr>,
        aggregates: Vec<AggregateExpr>,
        monotonic: bool,
        expected_group_size: Option<u64>,
    },
    TopK {
        input: Box<MirRelationExpr>,
        group_key: Vec<usize>,
        order_key: Vec<ColumnOrder>,
        limit: Option<MirScalarExpr>,
        offset: usize,
        monotonic: bool,
        expected_group_size: Option<u64>,
    },
    Negate {
        input: Box<MirRelationExpr>,
    },
    Threshold {
        input: Box<MirRelationExpr>,
    },
    Union {
        base: Box<MirRelationExpr>,
        inputs: Vec<MirRelationExpr>,
    },
    ArrangeBy {
        input: Box<MirRelationExpr>,
        keys: Vec<Vec<MirScalarExpr>>,
    },
}

Expand description

An abstract syntax tree which defines a collection.

The AST is meant to reflect the capabilities of the differential_dataflow::Collection type, written generically enough to avoid run-time compilation work.

derived_hash_with_manual_eq was complaining for the wrong reason: This lint exists because it’s bad when Eq doesn’t agree with Hash, which is often quite likely if one of them is implemented manually. However, our manual implementation of Eq will agree with the derived one. This is because the reason for the manual implementation is not to change the semantics from the derived one, but to avoid stack overflows.

Variants§

§

Constant

A constant relation containing specified rows.

The runtime memory footprint of this operator is zero.

When you would like to pattern match on this, consider using MirRelationExpr::as_const instead, which looks behind ArrangeBys. You might want this matching behavior because constant folding doesn’t remove ArrangeBys.

Fields

§rows: Result<Vec<(Row, Diff)>, EvalError>

Rows of the constant collection and their multiplicities.

§typ: RelationType

Schema of the collection.

§

Get

Get an existing dataflow.

The runtime memory footprint of this operator is zero.

Fields

§id: Id

The identifier for the collection to load.

§typ: RelationType

Schema of the collection.

§access_strategy: AccessStrategy

If this is a global Get, this will indicate whether we are going to read from Persist or from an index, or from a different object in objects_to_build. If it’s an index, then how downstream dataflow operations will use this index is also recorded. This is filled by prune_and_annotate_dataflow_index_imports. Note that this is not used by the lowering to LIR, but is used only by EXPLAIN.

§

Let

Introduce a temporary dataflow.

The runtime memory footprint of this operator is zero.

Fields

§id: LocalId

The identifier to be used in Get variants to retrieve value.

§value: Box<MirRelationExpr>

The collection to be bound to id.

§body: Box<MirRelationExpr>

The result of the Let, evaluated with id bound to value.

§

LetRec

Introduce mutually recursive bindings.

Each LocalId is immediately bound to an initially empty collection with the type of its corresponding MirRelationExpr. Repeatedly, each binding is evaluated using the current contents of each other binding, and is refreshed to contain the new evaluation. This process continues through all bindings, and repeats as long as changes continue to occur.

The resulting value of the expression is body evaluated once in the context of the final iterates.

A zero-binding instance can be replaced by body. A single-binding instance is equivalent to MirRelationExpr::Let.

The runtime memory footprint of this operator is zero.

Fields

§ids: Vec<LocalId>

The identifiers to be used in Get variants to retrieve each value.

§values: Vec<MirRelationExpr>

The collections to be bound to each id.

§limits: Vec<Option<LetRecLimit>>

Maximum number of iterations, after which we should artificially force a fixpoint. (Whether we error or just stop is configured by LetRecLimit::return_at_limit.) The per-LetRec limit that the user specified is initially copied to each binding to accommodate slicing and merging of LetRecs in MIR transforms (e.g., NormalizeLets).

§body: Box<MirRelationExpr>

The result of the Let, evaluated with id bound to value.

§

Project

Project out some columns from a dataflow

The runtime memory footprint of this operator is zero.

Fields

§input: Box<MirRelationExpr>

The source collection.

§outputs: Vec<usize>

Indices of columns to retain.

§

Map

Append new columns to a dataflow

The runtime memory footprint of this operator is zero.

Fields

§input: Box<MirRelationExpr>

The source collection.

§scalars: Vec<MirScalarExpr>

Expressions which determine values to append to each row. An expression may refer to columns in input or expressions defined earlier in the vector

§

FlatMap

Like Map, but yields zero-or-more output rows per input row

The runtime memory footprint of this operator is zero.

Fields

§input: Box<MirRelationExpr>

The source collection

§func: TableFunc

The table func to apply

§exprs: Vec<MirScalarExpr>

The argument to the table func

§

Filter

Keep rows from a dataflow where all the predicates are true

The runtime memory footprint of this operator is zero.

Fields

§input: Box<MirRelationExpr>

The source collection.

§predicates: Vec<MirScalarExpr>

Predicates, each of which must be true.

§

Join

Join several collections, where some columns must be equal.

For further details consult the documentation for MirRelationExpr::join.

The runtime memory footprint of this operator can be proportional to the sizes of all inputs and the size of all joins of prefixes. This may be reduced due to arrangements available at rendering time.

Fields

§inputs: Vec<MirRelationExpr>

A sequence of input relations.

§equivalences: Vec<Vec<MirScalarExpr>>

A sequence of equivalence classes of expressions on the cross product of inputs.

Each equivalence class is a list of scalar expressions, where for each class the intended interpretation is that all evaluated expressions should be equal.

Each scalar expression is to be evaluated over the cross-product of all records from all inputs. In many cases this may just be column selection from specific inputs, but more general cases exist (e.g. complex functions of multiple columns from multiple inputs, or just constant literals).

§implementation: JoinImplementation

Join implementation information.

§

Reduce

Group a dataflow by some columns and aggregate over each group

The runtime memory footprint of this operator is at most proportional to the number of distinct records in the input and output. The actual requirements can be less: the number of distinct inputs to each aggregate, summed across each aggregate, plus the output size. For more details consult the code that builds the associated dataflow.

Fields

§input: Box<MirRelationExpr>

The source collection.

§group_key: Vec<MirScalarExpr>

Column indices used to form groups.

§aggregates: Vec<AggregateExpr>

Expressions which determine values to append to each row, after the group keys.

§monotonic: bool

True iff the input is known to monotonically increase (only addition of records).

§expected_group_size: Option<u64>

User hint: expected number of values per group key. Used to optimize physical rendering.

§

TopK

Groups and orders within each group, limiting output.

The runtime memory footprint of this operator is proportional to its input and output.

Fields

§input: Box<MirRelationExpr>

The source collection.

§group_key: Vec<usize>

Column indices used to form groups.

§order_key: Vec<ColumnOrder>

Column indices used to order rows within groups.

§limit: Option<MirScalarExpr>

Number of records to retain

§offset: usize

Number of records to skip

§monotonic: bool

True iff the input is known to monotonically increase (only addition of records).

§expected_group_size: Option<u64>

User-supplied hint: how many rows will have the same group key.

§

Negate

Return a dataflow where the row counts are negated

The runtime memory footprint of this operator is zero.

Fields

§input: Box<MirRelationExpr>

The source collection.

§

Threshold

Keep rows from a dataflow where the row counts are positive

The runtime memory footprint of this operator is proportional to its input and output.

Fields

§input: Box<MirRelationExpr>

The source collection.

§

Union

Adds the frequencies of elements in contained sets.

The runtime memory footprint of this operator is zero.

Fields

§base: Box<MirRelationExpr>

A source collection.

§inputs: Vec<MirRelationExpr>

Source collections to union.

§

ArrangeBy

Technically a no-op. Used to render an index. Will be used to optimize queries on finer grain. Each keys item represents a different index that should be produced from the keys.

The runtime memory footprint of this operator is proportional to its input.

Fields

§input: Box<MirRelationExpr>

The source collection

§keys: Vec<Vec<MirScalarExpr>>

Columns to arrange input by, in order of decreasing primacy

Enum MirRelationExprCopy item path

Variants§

Constant

Fields

Get

Fields

Let

Fields

LetRec

Fields

Project

Fields

Map

Fields

FlatMap

Fields

Filter

Fields

Join

Fields

Reduce

Fields

TopK

Fields

Negate

Fields

Threshold

Fields

Union

Fields

ArrangeBy

Fields

Implementations§

impl MirRelationExpr

pub fn typ(&self) -> RelationType

pub fn typ_with_input_types(&self, input_types: &[RelationType]) -> RelationType

pub fn col_with_input_cols<'a, I>(&self, input_types: I) -> Vec<ColumnType>where I: Iterator<Item = &'a Vec<ColumnType>>,

pub fn try_col_with_input_cols<'a, I>( &self, input_types: I, ) -> Result<Vec<ColumnType>, String>where I: Iterator<Item = &'a Vec<ColumnType>>,

pub fn keys_with_input_keys<'a, I, J>( &self, input_arities: I, input_keys: J, ) -> Vec<Vec<usize>>where I: Iterator<Item = usize>, J: Iterator<Item = &'a Vec<Vec<usize>>>,

pub fn arity(&self) -> usize

pub fn arity_with_input_arities<I>(&self, input_arities: I) -> usizewhere I: Iterator<Item = usize>,

pub fn num_inputs(&self) -> usize

pub fn constant(rows: Vec<Vec<Datum<'_>>>, typ: RelationType) -> Self

pub fn constant_diff( rows: Vec<(Vec<Datum<'_>>, Diff)>, typ: RelationType, ) -> Self

pub fn as_const( &self, ) -> Option<(&Result<Vec<(Row, Diff)>, EvalError>, &RelationType)>

pub fn as_const_mut( &mut self, ) -> Option<(&mut Result<Vec<(Row, Diff)>, EvalError>, &mut RelationType)>

pub fn as_const_err(&self) -> Option<&EvalError>

pub fn is_constant_singleton(&self) -> bool

pub fn local_get(id: LocalId, typ: RelationType) -> Self

pub fn global_get(id: GlobalId, typ: RelationType) -> Self

pub fn project(self, outputs: Vec<usize>) -> Self

pub fn map(self, scalars: Vec<MirScalarExpr>) -> Self

pub fn map_one(self, scalar: MirScalarExpr) -> Self

pub fn flat_map(self, func: TableFunc, exprs: Vec<MirScalarExpr>) -> Self

pub fn filter<I>(self, predicates: I) -> Selfwhere I: IntoIterator<Item = MirScalarExpr>,

pub fn product(self, right: Self) -> Self

pub fn join( inputs: Vec<MirRelationExpr>, variables: Vec<Vec<(usize, usize)>>, ) -> Self

§Example

pub fn join_scalars( inputs: Vec<MirRelationExpr>, equivalences: Vec<Vec<MirScalarExpr>>, ) -> Self

pub fn reduce( self, group_key: Vec<usize>, aggregates: Vec<AggregateExpr>, expected_group_size: Option<u64>, ) -> Self

pub fn top_k( self, group_key: Vec<usize>, order_key: Vec<ColumnOrder>, limit: Option<MirScalarExpr>, offset: usize, expected_group_size: Option<u64>, ) -> Self

pub fn negate(self) -> Self

pub fn distinct(self) -> Self

pub fn distinct_by(self, group_key: Vec<usize>) -> Self

pub fn threshold(self) -> Self

pub fn union_many(inputs: Vec<Self>, typ: RelationType) -> Self

pub fn union(self, other: Self) -> Self

pub fn arrange_by(self, keys: &[Vec<MirScalarExpr>]) -> Self

pub fn is_empty(&self) -> bool

pub fn is_negated_project(&self) -> Option<(&MirRelationExpr, &[usize])>

pub fn pretty(&self) -> String

pub fn explain( &self, config: &ExplainConfig, humanizer: Option<&dyn ExprHumanizer>, ) -> String

pub fn take_safely(&mut self, typ: Option<RelationType>) -> MirRelationExpr

pub fn take_safely_with_col_types( &mut self, typ: Vec<ColumnType>, ) -> MirRelationExpr

pub fn take_dangerous(&mut self) -> MirRelationExpr

pub fn replace_using<F>(&mut self, logic: F)where F: FnOnce(MirRelationExpr) -> MirRelationExpr,

pub fn let_in<Body, E>( self, id_gen: &mut IdGen, body: Body, ) -> Result<MirRelationExpr, E>where Body: FnOnce(&mut IdGen, MirRelationExpr) -> Result<MirRelationExpr, E>,

pub fn anti_lookup<E>( self, id_gen: &mut IdGen, keys_and_values: MirRelationExpr, default: Vec<(Datum<'_>, ScalarType)>, ) -> Result<MirRelationExpr, E>

pub fn lookup<E>( self, id_gen: &mut IdGen, keys_and_values: MirRelationExpr, default: Vec<(Datum<'static>, ScalarType)>, ) -> Result<MirRelationExpr, E>

pub fn contains_temporal(&self) -> bool

Enum MirRelationExpr

pub fn col_with_input_cols<'a, I>(&self, input_types: I) -> Vec<ColumnType>
where I: Iterator<Item = &'a Vec<ColumnType>>,

pub fn try_col_with_input_cols<'a, I>( &self, input_types: I, ) -> Result<Vec<ColumnType>, String>
where I: Iterator<Item = &'a Vec<ColumnType>>,

pub fn keys_with_input_keys<'a, I, J>( &self, input_arities: I, input_keys: J, ) -> Vec<Vec<usize>>
where I: Iterator<Item = usize>, J: Iterator<Item = &'a Vec<Vec<usize>>>,

pub fn arity_with_input_arities<I>(&self, input_arities: I) -> usize
where I: Iterator<Item = usize>,

pub fn filter<I>(self, predicates: I) -> Self
where I: IntoIterator<Item = MirScalarExpr>,

pub fn replace_using<F>(&mut self, logic: F)
where F: FnOnce(MirRelationExpr) -> MirRelationExpr,

pub fn let_in<Body, E>( self, id_gen: &mut IdGen, body: Body, ) -> Result<MirRelationExpr, E>
where Body: FnOnce(&mut IdGen, MirRelationExpr) -> Result<MirRelationExpr, E>,

pub fn try_visit_scalars_mut1<F, E>(&mut self, f: &mut F) -> Result<(), E>
where F: FnMut(&mut MirScalarExpr) -> Result<(), E>,

pub fn try_visit_scalars_mut<F, E>(&mut self, f: &mut F) -> Result<(), E>
where F: FnMut(&mut MirScalarExpr) -> Result<(), E>, E: From<RecursionLimitError>,

pub fn visit_scalars_mut<F>(&mut self, f: &mut F)
where F: FnMut(&mut MirScalarExpr),

pub fn try_visit_scalars_1<F, E>(&self, f: &mut F) -> Result<(), E>
where F: FnMut(&MirScalarExpr) -> Result<(), E>,

pub fn try_visit_scalars<F, E>(&self, f: &mut F) -> Result<(), E>
where F: FnMut(&MirScalarExpr) -> Result<(), E>, E: From<RecursionLimitError>,

pub fn visit_scalars<F>(&self, f: &mut F)
where F: FnMut(&MirScalarExpr),

fn deserialize<D>(deserializer: D) -> Result<Self, D::Error>
where __D: Deserializer<'de>,