Add JMH benchmarks comparing read I/O strategies under memory pressure by neoremind · Pull Request #16279 · apache/lucene

neoremind · 2026-06-20T15:54:45Z

Adds JMH benchmarks to compare read I/O strategies in memory constrained scenario, related to #16044.

I/O strategies tested:

mmap no madvise
mmap + MADV_NORMAL + MADV_WILLNEED
mmap + MADV_RANDOM
mmap + MADV_RANDOM + MADV_WILLNEED
FFI pread(2) via Panama
FileChannel + DirectByteBuffer (simulates NIOFSDirectory)
FileChannel + HeapByteBuffer
O_DIRECT

Thread counts: 1, 4, 8, 16.

How to run

dd if=/dev/urandom of=/path/to/pread-bench-16G.dat bs=1M count=16384

java -jar lucene/benchmark-jmh/build/benchmarks/lucene-benchmark-jmh-11.0.0-SNAPSHOT.jar RandomReadIOBenchmark \
  -jvmArgs "--enable-native-access=ALL-UNNAMED -Xms2g -Xmx2g -Dbench.file=/path/to/pread-bench-16G.dat -Dbench.fileSizeMB=16384" \
  -p readSize=16384 -p readsPerOp=16

…x' is executed.

Refactor benchmarks

jimczi · 2026-06-27T19:05:28Z

+        MemorySegment slice = mmapSegmentNormal.asSlice(offsets[i], readSize);
+        int rc = (int) POSIX_MADVISE.invokeExact(slice, (long) readSize, MADV_WILLNEED);


madvise needs a page-aligned start address, so passing the raw random offset here makes it return EINVAL and do nothing — and since rc is discarded, it fails silently. On a c6id.4xlarge strace shows ~5117/5181 of these calls returning -1 EINVAL, so the prefetch never actually runs and the prefetch rows end up identical to plain mmap. The real Directory avoids this because MemorySegmentIndexInput#advise rounds the start down to the page first.

Suggested fix (mirrors what the Directory does):

Suggested change

MemorySegment slice = mmapSegmentNormal.asSlice(offsets[i], readSize);

int rc = (int) POSIX_MADVISE.invokeExact(slice, (long) readSize, MADV_WILLNEED);

// madvise needs a page-aligned start address, otherwise it returns EINVAL and is a no-op.

long offsetInPage = (mmapSegmentNormal.address() + offsets[i]) % ALIGNMENT;

long aoff = offsets[i] - offsetInPage;

long alen = readSize + offsetInPage;

MemorySegment slice = mmapSegmentNormal.asSlice(aoff, alen);

int rc = (int) POSIX_MADVISE.invokeExact(slice, alen, MADV_WILLNEED);

assert rc == 0 : "posix_madvise failed: " + rc;

Same change is needed in doMmapMadvRandomBatchedPrefetch (against mmapSegmentMadvRandom). With this, single-threaded mmap+prefetch goes from ~0.15 → ~4.2 ops/ms cold (≈7× pread at T01, ~device saturation). Might be worth asserting on rc at the other madvise call sites too so this can't silently regress again.

neoremind added 3 commits June 20, 2026 23:49

Add JMH benchmarks comparing read I/O strategies under memory pressure

e928cc5

Change read size from 16k to 4k

d8bc829

Update CHANGES.txt

5676506

github-actions Bot added this to the 10.6.0 milestone Jun 20, 2026

github-advanced-security AI found potential problems Jun 20, 2026

View reviewed changes

Comment thread lucene/benchmark-jmh/src/java/org/apache/lucene/benchmark/jmh/AbstractReadIOBenchmark.java Fixed

Comment thread lucene/benchmark-jmh/src/java/org/apache/lucene/benchmark/jmh/AbstractReadIOBenchmark.java Fixed

neoremind mentioned this pull request Jun 20, 2026

Introduce a pread Directory based on Panama-FFI ? #16044

Open

neoremind added 6 commits June 21, 2026 00:30

Fix tidy

153a733

Address github-advanced-security CR: Command with a relative path 'xx…

7e773f3

…x' is executed.

Address github-advanced-security CR: Command with a relative path 'xx…

ec30500

…x' is executed.

Refactor benchmarks. Use batched prefetch WILLNEED hint.

862598e

Refactor benchmarks

@SuppressWarnings("unused") for retcode of native calls

7a3d080

Fix Forbidden field access

e58c37d

jimczi reviewed Jun 27, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add JMH benchmarks comparing read I/O strategies under memory pressure#16279

Add JMH benchmarks comparing read I/O strategies under memory pressure#16279
neoremind wants to merge 9 commits into
apache:mainfrom
neoremind:16044_readio_pr

neoremind commented Jun 20, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

jimczi Jun 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		MemorySegment slice = mmapSegmentNormal.asSlice(offsets[i], readSize);
		int rc = (int) POSIX_MADVISE.invokeExact(slice, (long) readSize, MADV_WILLNEED);

-        MemorySegment slice = mmapSegmentNormal.asSlice(offsets[i], readSize);
-        int rc = (int) POSIX_MADVISE.invokeExact(slice, (long) readSize, MADV_WILLNEED);
+        // madvise needs a page-aligned start address, otherwise it returns EINVAL and is a no-op.
+        long offsetInPage = (mmapSegmentNormal.address() + offsets[i]) % ALIGNMENT;
+        long aoff = offsets[i] - offsetInPage;
+        long alen = readSize + offsetInPage;
+        MemorySegment slice = mmapSegmentNormal.asSlice(aoff, alen);
+        int rc = (int) POSIX_MADVISE.invokeExact(slice, alen, MADV_WILLNEED);
+        assert rc == 0 : "posix_madvise failed: " + rc;

Uh oh!

Conversation

neoremind commented Jun 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jimczi Jun 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

neoremind commented Jun 20, 2026 •

edited

Loading