Skip to main content

Module source_reader_pipeline

Module source_reader_pipeline 

Source
Expand description

Types related to the creation of dataflow raw sources.

Raw sources are differential dataflow collections of data directly produced by the upstream service. The main export of this module is create_raw_source, which turns RawSourceCreationConfigs into the aforementioned streams.

The full source, which is the differential stream that represents the actual object created by a CREATE SOURCE statement, is created by composing create_raw_source with decoding, SourceEnvelope rendering, and more.

Structs§

RawSourceCreationConfig
Shared configuration information for all source types. This is used in the create_raw_source functions, which produce raw sources.
SourceExportCreationConfig
Reduced version of RawSourceCreationConfig that is used when rendering each export.

Functions§

create_raw_source
Creates a source dataflow operator graph from a source connection. The type of SourceConnection determines the type of connection that should be created.
reclock_committed_upper 🔒
Reclocks an IntoTime frontier stream into a FromTime frontier stream. This is used for the virtual (through persist) feedback edge so that we convert the IntoTime resumption frontier into the FromTime frontier that is used with the source’s OffsetCommiter.
remap_operator 🔒
Mints new contents for the remap shard based on summaries about the source upper it receives from the raw reader operators.
source_render_operator 🔒
Renders the source dataflow fragment from the given SourceConnection. This returns a collection timestamped with the source specific timestamp type.