Struct Spine

Help

struct Spine<T> {
    effort: usize,
    next_id: usize,
    since: Antichain<T>,
    upper: Antichain<T>,
    merging: Vec<MergeState<T>>,
}

Expand description

An append-only collection of update batches.

The Spine is a general-purpose trace implementation based on collection and merging immutable batches of updates. It is generic with respect to the batch type, and can be instantiated for any implementor of trace::Batch.

§Design

This spine is represented as a list of layers, where each element in the list is either

MergeState::Vacant empty
MergeState::Single a single batch
MergeState::Double a pair of batches

Each “batch” has the option to be None, indicating a non-batch that nonetheless acts as a number of updates proportionate to the level at which it exists (for bookkeeping).

Each of the batches at layer i contains at most 2^i elements. The sequence of batches should have the upper bound of one match the lower bound of the next. Batches may be logically empty, with matching upper and lower bounds, as a bookkeeping mechanism.

Each batch at layer i is treated as if it contains exactly 2^i elements, even though it may actually contain fewer elements. This allows us to decouple the physical representation from logical amounts of effort invested in each batch. It allows us to begin compaction and to reduce the number of updates, without compromising our ability to continue to move updates along the spine. We are explicitly making the trade-off that while some batches might compact at lower levels, we want to treat them as if they contained their full set of updates for accounting reasons (to apply work to higher levels).

We maintain the invariant that for any in-progress merge at level k there should be fewer than 2^k records at levels lower than k. That is, even if we were to apply an unbounded amount of effort to those records, we would not have enough records to prompt a merge into the in-progress merge. Ideally, we maintain the extended invariant that for any in-progress merge at level k, the remaining effort required (number of records minus applied effort) is less than the number of records that would need to be added to reach 2^k records in layers below.

§Mathematics

When a merge is initiated, there should be a non-negative deficit of updates before the layers below could plausibly produce a new batch for the currently merging layer. We must determine a factor of proportionality, so that newly arrived updates provide at least that amount of “fuel” towards the merging layer, so that the merge completes before lower levels invade.

§Deficit:

A new merge is initiated only in response to the completion of a prior merge, or the introduction of new records from outside. The latter case is special, and will maintain our invariant trivially, so we will focus on the former case.

When a merge at level k completes, assuming we have maintained our invariant then there should be fewer than 2^k records at lower levels. The newly created merge at level k+1 will require up to 2^k+2 units of work, and should not expect a new batch until strictly more than 2^k records are added. This means that a factor of proportionality of four should be sufficient to ensure that the merge completes before a new merge is initiated.

When new records get introduced, we will need to roll up any batches at lower levels, which we treat as the introduction of records. Each of these virtual records introduced should either be accounted for the fuel it should contribute, as it results in the promotion of batches closer to in-progress merges.

We like the idea of applying fuel preferentially to merges at lower levels, under the idea that they are easier to complete, and we benefit from fewer total merges in progress. This does delay the completion of merges at higher levels, and may not obviously be a total win. If we choose to do this, we should make sure that we correctly account for completed merges at low layers: they should still extract fuel from new updates even though they have completed, at least until they have paid back any “debt” to higher layers by continuing to provide fuel as updates arrive.

Fields§

§effort: usize§next_id: usize§since: Antichain<T>§upper: Antichain<T>§merging: Vec<MergeState<T>>

Struct SpineCopy item path

§Design

§Mathematics

§Deficit:

§Fuel sharing

Fields§

Implementations§

impl<T> Spine<T>

pub fn spine_batches(&self) -> impl Iterator<Item = &SpineBatch<T>>

pub fn spine_batches_mut( &mut self, ) -> impl DoubleEndedIterator<Item = &mut SpineBatch<T>>

impl<T: Timestamp + Lattice> Spine<T>

pub fn new() -> Self

fn exert(&mut self, effort: usize, log: &mut SpineLog<'_, T>) -> bool

pub fn next_id(&mut self) -> SpineId

pub fn insert(&mut self, batch: HollowBatch<T>, log: &mut SpineLog<'_, T>)

fn reduced(&self) -> bool

fn describe(&self) -> Vec<(usize, usize)>

fn introduce_batch( &mut self, batch: SpineBatch<T>, batch_index: usize, log: &mut SpineLog<'_, T>, )

fn roll_up(&mut self, index: usize, log: &mut SpineLog<'_, T>)

pub fn apply_fuel(&mut self, fuel: &isize, log: &mut SpineLog<'_, T>)

fn insert_at(&mut self, batch: SpineBatch<T>, index: usize)

fn complete_at( &mut self, index: usize, log: &mut SpineLog<'_, T>, ) -> Option<SpineBatch<T>>

fn tidy_layers(&mut self)

fn validate(&self) -> Result<(), String>

Trait Implementations§

impl<T: Clone> Clone for Spine<T>

fn clone(&self) -> Spine<T>

fn clone_from(&mut self, source: &Self)

impl<T: Debug> Debug for Spine<T>

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Auto Trait Implementations§

impl<T> Freeze for Spine<T>where T: Freeze,

impl<T> RefUnwindSafe for Spine<T>where T: RefUnwindSafe,

impl<T> Send for Spine<T>where T: Send + Sync,

impl<T> Sync for Spine<T>where T: Sync + Send,

impl<T> Unpin for Spine<T>where T: Unpin,

impl<T> UnwindSafe for Spine<T>where T: RefUnwindSafe + UnwindSafe,

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T, U> CastInto<U> for Twhere U: CastFrom<T>,

fn cast_into(self) -> U

impl<T> CloneToUninit for Twhere T: Clone,

unsafe fn clone_to_uninit(&self, dst: *mut u8)

impl<T> CopyAs<T> for T

fn copy_as(self) -> T

impl<T> DynClone for Twhere T: Clone,

fn __clone_box(&self, _: Private) -> *mut ()

impl<T> From<T> for T

fn from(t: T) -> T

impl<T> FromRef<T> for Twhere T: Clone,

fn from_ref(input: &T) -> T

impl<T> FutureExt for T

fn with_context(self, otel_cx: Context) -> WithContext<Self>

fn with_current_context(self) -> WithContext<Self>

impl<T> Instrument for T

fn instrument(self, span: Span) -> Instrumented<Self>

fn in_current_span(self) -> Instrumented<Self>

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T> IntoEither for T

fn into_either(self, into_left: bool) -> Either<Self, Self>

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>where F: FnOnce(&Self) -> bool,

impl<T> IntoRequest<T> for T

fn into_request(self) -> Request<T>

impl<Unshared, Shared> IntoShared<Shared> for Unsharedwhere Shared: FromUnshared<Unshared>,

fn into_shared(self) -> Shared

impl<T> Pointable for T

const ALIGN: usize = _

type Init = T

unsafe fn init(init: <T as Pointable>::Init) -> usize

unsafe fn deref<'a>(ptr: usize) -> &'a T

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

unsafe fn drop(ptr: usize)

impl<P, R> ProtoType<R> for Pwhere R: RustType<P>,

fn into_rust(self) -> Result<R, TryFromProtoError>

Struct Spine

impl<T> Freeze for Spine<T>
where T: Freeze,

impl<T> RefUnwindSafe for Spine<T>
where T: RefUnwindSafe,

impl<T> Send for Spine<T>
where T: Send + Sync,

impl<T> Sync for Spine<T>
where T: Sync + Send,

impl<T> Unpin for Spine<T>
where T: Unpin,

impl<T> UnwindSafe for Spine<T>
where T: RefUnwindSafe + UnwindSafe,

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T, U> CastInto<U> for T
where U: CastFrom<T>,

impl<T> CloneToUninit for T
where T: Clone,

impl<T> DynClone for T
where T: Clone,

impl<T> FromRef<T> for T
where T: Clone,

impl<T, U> Into<U> for T
where U: From<T>,

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

impl<Unshared, Shared> IntoShared<Shared> for Unshared
where Shared: FromUnshared<Unshared>,

impl<P, R> ProtoType<R> for P
where R: RustType<P>,

impl<'a, S, T> Semigroup<&'a S> for T
where T: Semigroup<S>,

impl<T> ToOwned for T
where T: Clone,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

impl<T> Allocation for T
where T: RefUnwindSafe + Send + Sync,

impl<T> Data for T
where T: Clone + 'static,