Expand description
Types related to the creation of dataflow raw sources.
Raw sources are differential dataflow collections of data directly produced by the
upstream service. The main export of this module is create_raw_source,
which turns RawSourceCreationConfigs into the aforementioned streams.
The full source, which is the differential stream that represents the actual object
created by a CREATE SOURCE statement, is created by composing
create_raw_source with
decoding, SourceEnvelope rendering, and more.
Structsยง
- Frontier
Capture - RawSource
Creation Config - Shared configuration information for all source types. This is used in the
create_raw_sourcefunctions, which produce raw sources. - Source
Export Creation Config - Reduced version of
RawSourceCreationConfigthat is used when rendering each export.
Functionsยง
- create_
raw_ source - Creates a source dataflow operator graph from a source connection. The type of SourceConnection determines the type of connection that should be created.
- reclock_
committed_ ๐upper - Reclocks an
IntoTimefrontier stream into aFromTimefrontier stream. This is used for the virtual (through persist) feedback edge so that we convert theIntoTimeresumption frontier into theFromTimefrontier that is used with the sourceโsOffsetCommiter. - remap_
operator ๐ - Mints new contents for the remap shard based on summaries about the source upper it receives from the raw reader operators.
- source_
render_ ๐operator - Renders the source dataflow fragment from the given SourceConnection. This returns a
collection timestamped with the source specific timestamp type. Also returns a second stream
that can be used to learn about the
source_upperthat all the source reader instances know about. This second stream will be used by theremap_operatorto mint new timestamp bindings into the remap shard. - synthesize_
probes ๐ - Synthesizes a probe stream that produces the frontier of the given progress stream at the given interval.