perf(pm): parse version manifests from vec buffers by elrrrrrrr · Pull Request #3031 · utooland/utoo

elrrrrrrr · 2026-05-21T17:52:26Z

Summary

Review-sized split from #3028 / #2948, stacked after #3030.

Optimizes exact-version manifest parsing by keeping the HTTP body as the mutable Vec<u8> that simd_json needs:

replaces the generic bytes parser with parse_json_vec_off_runtime;
reads version-manifest responses directly into Vec<u8> on native targets;
keeps fetch_version_manifest_bytes as a compatibility wrapper;
preserves full-manifest raw bytes behavior by only applying the Vec path to version manifests.

This is the zero-copy hot path piece for exact-version manifest fetches; full manifests still keep immutable Bytes because their raw payload is reused for later version extraction.

Size

Diff against perf(pm): add resolver manifest provider boundary #3030: 1 file changed, 75 insertions(+), 11 deletions(-)

Validation

cargo fmt
cargo check -p utoo-ruborist
cargo test -p utoo-ruborist service::manifest::tests::parse_json_vec_off_runtime_consumes_mutable_buffer
cargo clippy --all-targets -- -D warnings --no-deps

pack-napi still warns locally because next.js is a symlink in this worktree; clippy exits successfully.

gemini-code-assist

Code Review

This pull request optimizes manifest parsing by transitioning from Bytes to Vec<u8>, allowing simd_json to perform in-place mutation and avoiding unnecessary memory copies. Review feedback identified that simd_json requires specific padding (SIMDJSON_PADDING) to prevent out-of-bounds reads, which was missing in the implementation. Recommendations include resizing the buffer with the necessary padding before parsing and pre-allocating this space during the initial response body read to further optimize performance.

gemini-code-assist · 2026-05-21T17:56:00Z

        rayon::spawn(move || {
-            let mut parse_buf = bytes.to_vec();
            let result = simd_json::serde::from_slice::<T>(&mut parse_buf)
                .map_err(|e| anyhow!("JSON parse error: {e}"));


The simd_json parser requires the input buffer to have a specific amount of padding (defined by simd_json::SIMDJSON_PADDING) beyond the end of the JSON string to perform safe SIMD reads. Without this padding, the parser may perform out-of-bounds reads, which is undefined behavior. Since simd_json::serde::from_slice mutates the buffer in-place, you should resize the Vec<u8> to include this padding (using spaces to avoid trailing character errors) before parsing.

Suggested change

rayon::spawn(move || {

let mut parse_buf = bytes.to_vec();

let result = simd_json::serde::from_slice::<T>(&mut parse_buf)

.map_err(|e| anyhow!("JSON parse error: {e}"));

rayon::spawn(move || {

parse_buf.resize(parse_buf.len() + simd_json::SIMDJSON_PADDING, b' ');

let result = simd_json::serde::from_slice::<T>(&mut parse_buf)

.map_err(|e| anyhow!("JSON parse error: {e}"));

gemini-code-assist · 2026-05-21T17:56:00Z

        simd_json::serde::from_slice::<T>(&mut parse_buf)
            .map_err(|e| anyhow!("JSON parse error: {e}"))


Similar to the native path, the wasm32 implementation should also ensure the input buffer has the required simd_json::SIMDJSON_PADDING to avoid potential out-of-bounds reads during parsing.

parse_buf.resize(parse_buf.len() + simd_json::SIMDJSON_PADDING, b' '); simd_json::serde::from_slice::<T>(&mut parse_buf) .map_err(|e| anyhow!("JSON parse error: {e}"))

gemini-code-assist · 2026-05-21T17:56:00Z

+        .content_length()
+        .and_then(|len| usize::try_from(len).ok())
+        .unwrap_or(0);
+    let mut body = Vec::with_capacity(capacity);


To optimize the performance of the subsequent JSON parsing, you can pre-allocate the required simd_json::SIMDJSON_PADDING in the Vec. This avoids a potential reallocation when resize is called later in the parsing functions.

Suggested change

let mut body = Vec::with_capacity(capacity);

let mut body = Vec::with_capacity(capacity + simd_json::SIMDJSON_PADDING);

github-actions · 2026-05-22T01:05:36Z

📊 pm-bench-phases · `1810f70` · linux (`ubuntu-latest`)

Workflow run — ant-design

PMs: utoo (this branch) · utoo-npm (latest published) · bun (latest)

npmjs.org

p0_full_cold

PM	wall	±σ	user	sys	RSS	pgMinor
bun	8.99s	0.12s	10.38s	9.99s	673M	313.7K
utoo-next	8.87s	0.99s	10.65s	12.23s	980M	122.4K
utoo-npm	8.08s	0.03s	10.43s	12.05s	989M	125.9K
utoo	7.82s	0.06s	10.42s	11.87s	970M	128.1K

PM	vCtx	iCtx	netRX	netTX	cache	node_mod	lock
bun	16.9K	19.4K	1.19G	7M	1.86G	1.75G	1M
utoo-next	128.3K	91.3K	1.16G	6M	1.71G	1.70G	2M
utoo-npm	126.3K	89.9K	1.16G	5M	1.71G	1.70G	2M
utoo	122.1K	76.8K	1.16G	5M	1.71G	1.70G	2M

p1_resolve

PM	wall	±σ	user	sys	RSS	pgMinor
bun	2.16s	0.06s	3.93s	1.16s	506M	186.9K
utoo-next	2.96s	0.04s	5.10s	1.85s	609M	80.9K
utoo-npm	3.10s	0.03s	5.17s	2.21s	601M	74.7K
utoo	2.93s	0.07s	5.03s	1.87s	612M	86.9K

PM	vCtx	iCtx	netRX	netTX	cache	node_mod	lock
bun	10.8K	4.1K	203M	3M	108M	-	1M
utoo-next	53.3K	71.5K	200M	3M	7M	3M	2M
utoo-npm	76.8K	97.3K	200M	2M	7M	3M	2M
utoo	52.2K	74.2K	200M	3M	7M	3M	2M

p3_cold_install

PM	wall	±σ	user	sys	RSS	pgMinor
bun	6.72s	0.23s	6.34s	9.67s	566M	200.1K
utoo-next	6.05s	0.17s	5.02s	10.46s	486M	64.4K
utoo-npm	5.83s	0.01s	5.13s	10.47s	465M	58.8K
utoo	5.83s	0.15s	5.10s	10.32s	519M	60.9K

PM	vCtx	iCtx	netRX	netTX	cache	node_mod	lock
bun	7.1K	7.4K	1019M	4M	1.76G	1.76G	1M
utoo-next	101.8K	48.7K	989M	3M	1.70G	1.70G	2M
utoo-npm	102.7K	49.3K	989M	3M	1.70G	1.70G	2M
utoo	97.1K	54.0K	989M	3M	1.70G	1.70G	2M

p4_warm_link

PM	wall	±σ	user	sys	RSS	pgMinor
bun	3.33s	0.06s	0.19s	2.44s	136M	32.4K
utoo-next	2.39s	0.22s	0.53s	3.79s	79M	18.1K
utoo-npm	2.41s	0.11s	0.48s	3.82s	79M	18.8K
utoo	2.29s	0.03s	0.50s	3.78s	79M	18.8K

PM	vCtx	iCtx	netRX	netTX	cache	node_mod	lock
bun	249	84	5M	46K	1.91G	1.75G	1M
utoo-next	42.4K	19.4K	18K	25K	1.70G	1.70G	2M
utoo-npm	42.0K	19.6K	15K	10K	1.70G	1.70G	2M
utoo	41.8K	19.4K	14K	10K	1.71G	1.70G	2M

npmmirror.com: no output captured.

elrrrrrrr added A-Pkg Manager Area: Package Manager benchmark Run pm-bench on PR labels May 21, 2026

gemini-code-assist Bot reviewed May 21, 2026

View reviewed changes

This was referenced May 21, 2026

perf(pm): extract requested core from full manifest parse #3034

Draft

perf(pm): source resolver mainloop architecture #3028

Draft

elrrrrrrr force-pushed the perf/pm-split-resolver-version-vec branch 2 times, most recently from bd39706 to b96d81c Compare May 21, 2026 23:09

perf(pm): parse version manifests from vec buffers

604aa30

elrrrrrrr force-pushed the perf/pm-split-resolver-version-vec branch from b96d81c to 604aa30 Compare May 21, 2026 23:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(pm): parse version manifests from vec buffers#3031

perf(pm): parse version manifests from vec buffers#3031
elrrrrrrr wants to merge 1 commit into
perf/pm-split-resolver-provider-boundaryfrom
perf/pm-split-resolver-version-vec

elrrrrrrr commented May 21, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 21, 2026

Uh oh!

gemini-code-assist Bot May 21, 2026

Uh oh!

gemini-code-assist Bot May 21, 2026

Uh oh!

github-actions Bot commented May 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		simd_json::serde::from_slice::<T>(&mut parse_buf)
		.map_err(\|e\| anyhow!("JSON parse error: {e}"))

	let mut body = Vec::with_capacity(capacity);
	let mut body = Vec::with_capacity(capacity + simd_json::SIMDJSON_PADDING);

Conversation

elrrrrrrr commented May 21, 2026

Summary

Size

Validation

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 21, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 21, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 21, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented May 22, 2026

📊 pm-bench-phases · 1810f70 · linux (ubuntu-latest)

npmjs.org

p0_full_cold

p1_resolve

p3_cold_install

p4_warm_link

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

📊 pm-bench-phases · `1810f70` · linux (`ubuntu-latest`)