乐趣区

关于golang:聊聊cortex的Backoff

本文次要钻研一下 cortex 的 Backoff

Backoff

github.com/cortexproject/cortex/pkg/util/backoff.go

// Backoff implements exponential backoff with randomized wait times
type Backoff struct {
    cfg          BackoffConfig
    ctx          context.Context
    numRetries   int
    nextDelayMin time.Duration
    nextDelayMax time.Duration
}

// NewBackoff creates a Backoff object. Pass a Context that can also terminate the operation.
func NewBackoff(ctx context.Context, cfg BackoffConfig) *Backoff {
    return &Backoff{
        cfg:          cfg,
        ctx:          ctx,
        nextDelayMin: cfg.MinBackoff,
        nextDelayMax: doubleDuration(cfg.MinBackoff, cfg.MaxBackoff),
    }
}

Backoff 定义了 cfg、ctx、numRetries、nextDelayMin、nextDelayMax 属性;NewBackoff 提供了基于 BackoffConfig 的工厂办法,默认的 nextDelayMin 为 cfg.MinBackoff

BackoffConfig

github.com/cortexproject/cortex/pkg/util/backoff.go

// BackoffConfig configures a Backoff
type BackoffConfig struct {
    MinBackoff time.Duration `yaml:"min_period"`  // start backoff at this level
    MaxBackoff time.Duration `yaml:"max_period"`  // increase exponentially to this level
    MaxRetries int           `yaml:"max_retries"` // give up after this many; zero means infinite retries
}

BackoffConfig 定义了 MinBackoff、MaxBackoff、MaxRetries 属性

Ongoing

github.com/cortexproject/cortex/pkg/util/backoff.go

// Reset the Backoff back to its initial condition
func (b *Backoff) Reset() {
    b.numRetries = 0
    b.nextDelayMin = b.cfg.MinBackoff
    b.nextDelayMax = doubleDuration(b.cfg.MinBackoff, b.cfg.MaxBackoff)
}

// Ongoing returns true if caller should keep going
func (b *Backoff) Ongoing() bool {
    // Stop if Context has errored or max retry count is exceeded
    return b.ctx.Err() == nil && (b.cfg.MaxRetries == 0 || b.numRetries < b.cfg.MaxRetries)
}

// Err returns the reason for terminating the backoff, or nil if it didn't terminate
func (b *Backoff) Err() error {if b.ctx.Err() != nil {return b.ctx.Err()
    }
    if b.cfg.MaxRetries != 0 && b.numRetries >= b.cfg.MaxRetries {return fmt.Errorf("terminated after %d retries", b.numRetries)
    }
    return nil
}

// NumRetries returns the number of retries so far
func (b *Backoff) NumRetries() int {return b.numRetries}

// Wait sleeps for the backoff time then increases the retry count and backoff time
// Returns immediately if Context is terminated
func (b *Backoff) Wait() {
    // Increase the number of retries and get the next delay
    sleepTime := b.NextDelay()

    if b.Ongoing() {
        select {case <-b.ctx.Done():
        case <-time.After(sleepTime):
        }
    }
}

func (b *Backoff) NextDelay() time.Duration {
    b.numRetries++

    // Handle the edge case the min and max have the same value
    // (or due to some misconfig max is < min)
    if b.nextDelayMin >= b.nextDelayMax {return b.nextDelayMin}

    // Add a jitter within the next exponential backoff range
    sleepTime := b.nextDelayMin + time.Duration(rand.Int63n(int64(b.nextDelayMax-b.nextDelayMin)))

    // Apply the exponential backoff to calculate the next jitter
    // range, unless we've already reached the max
    if b.nextDelayMax < b.cfg.MaxBackoff {b.nextDelayMin = doubleDuration(b.nextDelayMin, b.cfg.MaxBackoff)
        b.nextDelayMax = doubleDuration(b.nextDelayMax, b.cfg.MaxBackoff)
    }

    return sleepTime
}

func doubleDuration(value time.Duration, max time.Duration) time.Duration {
    value = value * 2

    if value <= max {return value}

    return max
}

Backoff 次要提供了 Ongoing 及 Wait 办法;Ongoing 返回 bool 用于示意是否能够持续,如果 err 为 nil 且 b.cfg.MaxRetries 或者 b.numRetries < b.cfg.MaxRetries 返回 true;Wait 办法会期待执行实现或者是 b.NextDelay() 工夫到;NextDelay 办法会递增 numRetries 而后计算 sleepTime;Err 办法返回 ctx 的 Err 或者是重试次数超限的谬误

实例

// NewBackoffRetry gRPC middleware.
func NewBackoffRetry(cfg util.BackoffConfig) grpc.UnaryClientInterceptor {return func(ctx context.Context, method string, req, reply interface{}, cc *grpc.ClientConn, invoker grpc.UnaryInvoker, opts ...grpc.CallOption) error {backoff := util.NewBackoff(ctx, cfg)
        for backoff.Ongoing() {err := invoker(ctx, method, req, reply, cc, opts...)
            if err == nil {return nil}

            if status.Code(err) != codes.ResourceExhausted {return err}

            backoff.Wait()}
        return backoff.Err()}
}

NewBackoffRetry 展现了如何应用 backoff,通过 for 循环,条件为 backoff.Ongoing(),两头执行要重试的操作,最初执行 backoff.Wait(),如果没有提前返回最初返回 backoff.Err()

小结

cortex 提供了 Backoff,能够基于 MinBackoff、MaxBackoff、MaxRetries 来进行重试。

doc

  • cortex
退出移动版