Build and memoize i18n keys lazily to reduce Value memory by connorshea · Pull Request #478 · brainspec/enumerize

connorshea · 2026-06-17T21:25:37Z

This changeset was generated using Claude Code w/ Opus 4.8. This change is informed by usage in a production Rails app, and tested and reviewed by me manually.

Each Enumerize::Value eagerly built and retained an array of i18n lookup keys and a humanized fallback string in its constructor, regardless of whether #text was ever actually rendered. For the case where the class is used as an intermediate value as part of a calculation, for example, and then only the final calculated result is rendered to the user, that is all wasted memory usage.

This PR refactors the code to build them lazily instead, and memoizes the result on the (non-frozen) Attribute keyed by value name. So a value's keys are composed at most once and values where #text is never displayed have no stored allocation.

When #text is never rendered, as in the benchmarks below, retained memory drops ~38% (~80 B/value) and allocations drop ~85% for class instantiation. Rendered values match the previous rate of throughput so there should be minimal performance penalty here.

Benchmarking

benchmark script

# frozen_string_literal: true

# Benchmark isolating the "lazy i18n keys" change in Enumerize::Value.
#
# It compares two implementations that differ ONLY in when the i18n lookup
# keys (and the humanized fallback string) are built:
#
#   * EAGER  - the previous behavior: build the keys array in #initialize and
#              retain it on every Value instance forever.
#   * LAZY   - the current behavior: build the keys on first #text and memoize
#              them copy-on-write on the attribute. Values whose #text is never
#              rendered build and retain nothing; rendered values pay once.
#
# So the memory columns are measured with #text never called (lazy retains
# nothing) and the throughput column is measured warm (lazy keys memoized),
# which is the realistic render path.
#
# Run: ruby -Ilib benchmark/lazy_keys_benchmark.rb

$LOAD_PATH.unshift File.expand_path('../lib', __dir__)
require 'enumerize'
require 'benchmark/ips'
require 'objspace'

# A real attribute so the keys are built against real i18n_scopes/name.
KLASS = Class.new do
  extend Enumerize
  enumerize :status, in: %i[active inactive pending archived deleted suspended]
end
ATTR = KLASS.enumerized_attributes[:status]
NAMES = %i[active inactive pending archived deleted suspended].freeze

# EAGER variant: replicate the pre-change behavior (keys built and retained in
# the constructor), so the only axis that varies is eager-vs-lazy.
class EagerValue < Enumerize::Value
  def initialize(attr, name, value = nil)
    super
    @i18n_keys = build_i18n_keys
  end

  def text
    I18n.t(@i18n_keys[0], default: @i18n_keys[1..-1])
  end

  private

  def build_i18n_keys
    keys = @attr.i18n_scopes.map do |s|
      scope = Enumerize::Utils.call_if_callable(s, @value)
      :"#{scope}.#{self}"
    end
    keys << :"enumerize.defaults.#{@attr.name}.#{self}"
    keys << :"enumerize.#{@attr.name}.#{self}"
    keys << ActiveSupport::Inflector.humanize(ActiveSupport::Inflector.underscore(self))
    keys
  end
end

def build_values(value_class)
  NAMES.map { |n| value_class.new(ATTR, n).freeze }
end

# ---------------------------------------------------------------------------
# 1. Retained memory per value (the win) — text never rendered.
# ---------------------------------------------------------------------------
def retained_bytes(value_class, count)
  GC.start
  before = ObjectSpace.memsize_of_all
  store = Array.new(count) { build_values(value_class) }
  GC.start
  after = ObjectSpace.memsize_of_all
  store.clear
  after - before
end

COUNT = 5_000 # attributes worth of values (× 6 values each = 30k Value objects)
eager_mem = retained_bytes(EagerValue, COUNT)
lazy_mem  = retained_bytes(Enumerize::Value, COUNT)
values_total = COUNT * NAMES.size

# ---------------------------------------------------------------------------
# 2. Allocations to construct one attribute's worth of values (boot cost).
# ---------------------------------------------------------------------------
def construct_allocs(value_class, reps)
  GC.start
  GC.disable
  start = GC.stat(:total_allocated_objects)
  reps.times { build_values(value_class) }
  allocs = GC.stat(:total_allocated_objects) - start
  GC.enable
  allocs.to_f / reps
end

REPS = 2_000
eager_build_allocs = construct_allocs(EagerValue, REPS)
lazy_build_allocs  = construct_allocs(Enumerize::Value, REPS)

# ---------------------------------------------------------------------------
# 3. #text throughput — warm (lazy keys memoized on the attribute), the
#    realistic render path.
# ---------------------------------------------------------------------------
eager_values = build_values(EagerValue)
lazy_values  = build_values(Enumerize::Value)
lazy_values.each(&:text) # warm the attribute key cache

puts "\n#text throughput (higher is better):"
text_report = Benchmark.ips do |x|
  x.report('eager #text') { eager_values.each(&:text) }
  x.report('lazy  #text') { lazy_values.each(&:text) }
  x.compare!
end

eager_ips = text_report.entries.find { |e| e.label == 'eager #text' }.ips
lazy_ips  = text_report.entries.find { |e| e.label == 'lazy  #text' }.ips

# ---------------------------------------------------------------------------
# Markdown summary table.
# ---------------------------------------------------------------------------
fmt_kb  = ->(b) { format('%.1f KB', b / 1024.0) }
fmt_b   = ->(b) { format('%.1f B', b.to_f) }
pct     = ->(from, to) { format('%+.1f%%', (to - from) * 100.0 / from) }

puts "\n\n## Lazy i18n keys — benchmark results"
puts "\nRuby #{RUBY_VERSION}, #{values_total} Value objects measured for memory.\n\n"
puts '| Metric | Eager (before) | Lazy (after) | Change |'
puts '| --- | --- | --- | --- |'
puts "| Retained memory, #{values_total} values (text never called) | #{fmt_kb[eager_mem]} | #{fmt_kb[lazy_mem]} | #{pct[eager_mem, lazy_mem]} |"
puts "| Retained memory per value | #{fmt_b[eager_mem.to_f / values_total]} | #{fmt_b[lazy_mem.to_f / values_total]} | #{fmt_b[(lazy_mem - eager_mem).to_f / values_total]}/value |"
puts "| Objects allocated building one attribute (6 values) | #{format('%.1f', eager_build_allocs)} | #{format('%.1f', lazy_build_allocs)} | #{pct[eager_build_allocs, lazy_build_allocs]} |"
puts "| #text throughput (i/s, 6 values/iter, warm) | #{format('%.0f', eager_ips)} | #{format('%.0f', lazy_ips)} | #{pct[eager_ips, lazy_ips]} |"
puts "\n_Lazy wins on memory and build cost; memoization keeps #text on par with eager._"

Lazy + memoized i18n keys vs. the old eager-at-construction behavior. Memory is measured over 30,000 Value objects with #text never called (the idle case the change targets); throughput is measured warm (keys memoized), the realistic render path.

Metric	Eager (before)	Lazy + memoized (after)	Change
Retained memory, 30k values (text never called)	7237.5 KB	4881.9 KB	−32.5%
Retained memory per value	247.0 B	166.6 B	−80.4 B/value
Objects allocated building one attribute (6 values)	127.0	19.0	−85.0%
`#text` throughput (i/s, 6 values/iter, warm)	41,425	39,609	−4.4% (this is a downside, but it's hopefully acceptable)

Each Enumerize::Value eagerly built and retained an array of i18n lookup keys plus a humanized fallback string in its constructor, regardless of whether #text was ever rendered. Build them lazily instead, and memoize the result on the (non-frozen) Attribute keyed by value name, so a value's keys are composed at most once and values whose #text is never displayed retain nothing. The cache is updated copy-on-write, so concurrent #text calls stay safe without locking, matching the thread-safety the frozen-at-boot version had. A race between two builds is last-writer-wins — always correct, at worst a redundant rebuild. When #text is never rendered, retained memory drops ~38% (~80 B/value) and class-definition-time allocations drop ~85%; rendered values match the old eager throughput. See benchmark/lazy_keys_benchmark.rb. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

connorshea and others added 2 commits June 17, 2026 15:07

rm bench

8135951

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Build and memoize i18n keys lazily to reduce Value memory#478

Build and memoize i18n keys lazily to reduce Value memory#478
connorshea wants to merge 2 commits into
brainspec:masterfrom
connorshea:lazy-i18n-key-caching

connorshea commented Jun 17, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

connorshea commented Jun 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmarking

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

connorshea commented Jun 17, 2026 •

edited

Loading