Ristretto

GitHub License chat GitHub Repo stars GitHub commit activity Go Report Card

Ristretto is a fast, concurrent cache library built with a focus on performance and correctness.

The motivation to build Ristretto comes from the need for a contention-free cache in Dgraph.

Features

Status

Ristretto is production-ready. See Projects using Ristretto.

Getting Started

Installing

To start using Ristretto, install Go 1.21 or above. Ristretto needs go modules. From your project, run the following command

go get github.com/dgraph-io/ristretto/v2

This will retrieve the library.

Choosing a version

Following these rules:

Usage

package main

import (
  "fmt"

  "github.com/dgraph-io/ristretto/v2"
)

func main() {
  cache, err := ristretto.NewCache(&ristretto.Config[string, string]{
    NumCounters: 1e7,     // number of keys to track frequency of (10M).
    MaxCost:     1 << 30, // maximum cost of cache (1GB).
    BufferItems: 64,      // number of keys per Get buffer.
  })
  if err != nil {
    panic(err)
  }
  defer cache.Close()

  // set a value with a cost of 1
  cache.Set("key", "value", 1)

  // wait for value to pass through buffers
  cache.Wait()

  // get value from cache
  value, found := cache.Get("key")
  if !found {
    panic("missing value")
  }
  fmt.Println(value)

  // del value from cache
  cache.Del("key")
}

Benchmarks

The benchmarks can be found in https://github.com/hypermodeinc/dgraph-benchmarks/tree/main/cachebench/ristretto.

This trace is described as "disk read accesses initiated by a large commercial search engine in response to various web search requests."

Graph showing hit ratios comparison for search workload

Hit Ratio for Database

This trace is described as "a database server running at a commercial site running an ERP application on top of a commercial database."

Graph showing hit ratios comparison for database workload

Hit Ratio for Looping

This trace demonstrates a looping access pattern.

Graph showing hit ratios comparison for looping access pattern

Hit Ratio for CODASYL

This trace is described as "references to a CODASYL database for a one hour period."

Graph showing hit ratios comparison for CODASYL workload

Throughput for Mixed Workload

Graph showing throughput comparison for mixed workload

Throughput ffor Read Workload

Graph showing throughput comparison for read workload

Through for Write Workload

Graph showing throughput comparison for write workload

Projects Using Ristretto

Below is a list of known projects that use Ristretto:

FAQ

How are you achieving this performance? What shortcuts are you taking?

We go into detail in the Ristretto blog post, but in short: our throughput performance can be attributed to a mix of batching and eventual consistency. Our hit ratio performance is mostly due to an excellent admission policy and SampledLFU eviction policy.

As for "shortcuts," the only thing Ristretto does that could be construed as one is dropping some Set calls. That means a Set call for a new item (updates are guaranteed) isn't guaranteed to make it into the cache. The new item could be dropped at two points: when passing through the Set buffer or when passing through the admission policy. However, this doesn't affect hit ratios much at all as we expect the most popular items to be Set multiple times and eventually make it in the cache.

Is Ristretto distributed?

No, it's just like any other Go library that you can import into your project and use in a single process.

Join libs.tech

...and unlock some superpowers

GitHub

We won't share your data with anyone else.