automation script to pull models.yml by Light2Dark · Pull Request #9635 · marimo-team/marimo

Light2Dark · 2026-05-21T02:09:16Z

📝 Summary

Models.yml is used to populate our models list. We manually updated this. This PR introduces a script to automate, although not perfect.
Add automation script that needs to be run manually to pull latest models data. By default pulls latest 10 models from each provider. We can pass params to control this.
Enriched the models data with more info (cost, capabilities)
Adds a skill to write descriptions.

Pulling this data from https://models.dev. It has an open-source API. Considered openrouter as well, but it has a slightly different structure. Anyway, easy to changeover if needed.

Some models don't exist on the API, maybe we can cross-check. I've removed them manually for now (eg. gpt-5.5-codex-spark)

We could put this into a github actions workflow in the future.

📋 Pre-Review Checklist

For large changes, or changes that affect the public API: this change was discussed or approved through an issue, on Discord, or the community discussions (Please provide a link if applicable).
Any AI generated code has been reviewed line-by-line by the human PR author, who stands by it.
Video or media evidence is provided for any visual changes (optional).

✅ Merge Checklist

I have read the contributor guidelines.
Documentation has been updated where applicable, including docstrings for API changes.
Tests have been added for the changes made.

vercel · 2026-05-21T02:09:22Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
marimo-docs	Ready	Preview, Comment	May 21, 2026 2:48am

cubic-dev-ai · 2026-05-21T02:09:26Z

This PR is large and would use a significant portion of your monthly review quota. Comment @cubic-dev-ai review this to confirm that you want cubic to review it.

codecov · 2026-05-21T02:11:50Z

Bundle Report

Changes will increase total bundle size by 14.39kB (0.06%) ⬆️. This is within the configured threshold ✅

Detailed changes

Bundle name	Size	Change
marimo-esm	25.28MB	14.39kB (0.06%) ⬆️

Affected Assets, Files, and Routes:

view changes for bundle: marimo-esm

Assets Changed:

Asset Name	Size Change	Total Size	Change (%)
`assets/dist-*.js`	-60 bytes	104 bytes	-36.59%
`assets/dist-*.js`	-14 bytes	169 bytes	-7.65%
`assets/dist-*.js`	-32 bytes	137 bytes	-18.93%
`assets/dist-*.js`	-233 bytes	102 bytes	-69.55%
`assets/dist-*.js`	60 bytes	164 bytes	57.69% ⚠️
`assets/dist-*.js`	7 bytes	183 bytes	3.98%
`assets/dist-*.js`	-99 bytes	177 bytes	-35.87%
`assets/dist-*.js`	-111 bytes	276 bytes	-28.68%
`assets/dist-*.js`	119 bytes	256 bytes	86.86% ⚠️
`assets/dist-*.js`	-155 bytes	104 bytes	-59.85%
`assets/dist-*.js`	90 bytes	259 bytes	53.25% ⚠️
`assets/dist-*.js`	-23 bytes	160 bytes	-12.57%
`assets/dist-*.js`	301 bytes	403 bytes	295.1% ⚠️
`assets/dist-*.js`	79 bytes	335 bytes	30.86% ⚠️
`assets/dist-*.js`	72 bytes	176 bytes	69.23% ⚠️
`assets/dist-*.js`	-40 bytes	137 bytes	-22.6%
`assets/dist-*.js`	46 bytes	183 bytes	33.58% ⚠️
`assets/dist-*.js`	65 bytes	169 bytes	62.5% ⚠️
`assets/dist-*.js`	-56 bytes	104 bytes	-35.0%
`assets/dist-*.js`	-16 bytes	387 bytes	-3.97%
`assets/ai-*.js`	14.39kB	271.67kB	5.6% ⚠️

Files in assets/ai-*.js:

./src/components/app-config/ai-config.tsx → Total Size: 76.68kB
./src/core/ai/model-registry.ts → Total Size: 4.9kB
./src/components/ai/ai-model-dropdown.tsx → Total Size: 17.95kB

cubic-dev-ai · 2026-05-21T02:16:10Z

This PR is large and would use a significant portion of your monthly review quota. Comment @cubic-dev-ai review this to confirm that you want cubic to review it.

cubic-dev-ai · 2026-05-21T02:17:53Z

This PR is large and would use a significant portion of your monthly review quota. Comment @cubic-dev-ai review this to confirm that you want cubic to review it.

Copilot

Pull request overview

This PR introduces a manual sync workflow for packages/llm-info/data/models.yml from the public models.dev catalog, restructures the model catalog to be provider-keyed, and updates codegen + frontend consumption to match the new schema (capabilities, modalities, pricing, release dates).

Changes:

Add a pnpm sync-models script + implementation to fetch models.dev/api.json and append/replace entries in models.yml while preserving curated entries/comments.
Change the llm-info model schema and data layout to a top-level provider map, enriching entries with capabilities, modalities, release dates, and cost.
Update codegen/tests and frontend model registry + UI “thinking” indicator to use the new structure.

Reviewed changes

Copilot reviewed 19 out of 19 changed files in this pull request and generated 6 comments.

Show a summary per file

File	Description
packages/llm-info/src/sync-models.ts	CLI entrypoint + YAML-preserving append/replace writer for `models.yml`.
packages/llm-info/src/cli.ts	Parses `sync-models` CLI flags (mode, providers filter, per-provider cap).
packages/llm-info/src/sources/models-dev.ts	Fetches/parses `models.dev` API response with Zod validation and warnings.
packages/llm-info/src/sources/merge.ts	Merges models.dev data into local provider buckets with trimming/sorting.
packages/llm-info/src/index.ts	Updates exported types (capabilities, modalities, provider-keyed structure).
packages/llm-info/src/generate.ts	Updates codegen validation + JSON structure to provider-keyed `models`.
packages/llm-info/src/tests/sync-models.test.ts	Adds comprehensive tests for merge + sync behavior and YAML formatting preservation.
packages/llm-info/src/tests/schema.test.ts	Updates schema tests for new model entry shape + provider-keyed YAML.
packages/llm-info/src/tests/json-structure.test.ts	Updates JSON shape expectations (`models` is a provider-keyed map).
packages/llm-info/data/models.yml	Converts catalog to provider-keyed sections and adds enriched fields.
packages/llm-info/package.json	Adds `sync-models` script.
packages/llm-info/README.md	Documents `pnpm sync-models` usage examples.
packages/llm-info/skills/SKILL.md	Adds a documented workflow for backfilling empty descriptions (Cursor skill).
.cursor/skills/fill-model-descriptions/SKILL.md	Same skill doc mirrored under `.cursor/skills`.
frontend/src/core/ai/model-registry.ts	Updates frontend registry to consume provider-keyed models + rehydrate dates + provider field.
frontend/src/core/ai/tests/model-registry.test.ts	Updates mocks/expectations for provider-keyed models and provider-owned entries.
frontend/src/components/app-config/ai-config.tsx	Switches “thinking” badge to `capabilities.includes("thinking")`.
frontend/src/components/ai/ai-model-dropdown.tsx	Switches “thinking” indicators to `capabilities.includes("thinking")`.
frontend/src/components/ai/tests/ai-utils.test.ts	Updates models.json mock to provider-keyed format + new fields.

+  capabilities: Capability[];
+  input_types: DataType[];
+  output_types: DataType[];
+  release_date: Date;


cubic-dev-ai

9 issues found across 19 files

Prompt for AI agents (unresolved issues)


Check if these issues are valid — if so, understand the root cause of each and fix them. If appropriate, use sub-agents to investigate and fix each issue separately.


<file name="packages/llm-info/src/cli.ts">

<violation number="1" location="packages/llm-info/src/cli.ts:35">
P2: `--mode replace` is ignored. Use `getFlag()` here so the spaced form does not fall back to append mode.</violation>

<violation number="2" location="packages/llm-info/src/cli.ts:73">
P1: Empty provider list should fail. In `--replace` mode, a typo can rewrite `models.yml` from an empty result set.</violation>
</file>

<file name="packages/llm-info/src/sync-models.ts">

<violation number="1" location="packages/llm-info/src/sync-models.ts:237">
P2: Cap is not per final provider. `google` can get trimmed entries from both `google` and `google-vertex`, so `-n 5` can still append 10 models. Trim after merging into one bucket.</violation>
</file>

Architecture diagram

sequenceDiagram
    participant CLI as Sync Script (cli.ts)
    participant Sync as sync-models.ts
    participant Merge as sources/merge.ts
    participant ModelsDev as sources/models-dev.ts
    participant API as models.dev API
    participant YAML as data/models.yml
    participant Codegen as generate.ts
    participant JSON as data/generated/models.json
    participant Frontend as Frontend Components
    participant Registry as model-registry.ts

    Note over CLI,Registry: Model Sync and Consumption Flow

    CLI->>Sync: pnpm sync-models [--replace] [-n 10] [-p openai,google]
    Sync->>ModelsDev: fetchModelsDev()
    ModelsDev->>API: GET https://models.dev/api.json
    API-->>ModelsDev: JSON response
    ModelsDev->>ModelsDev: parseModelsDev() – validate with Zod schema
    ModelsDev-->>Sync: ModelsDevApi object

    Sync->>YAML: readFileSync(models.yml)
    YAML-->>Sync: YAML text
    Sync->>Sync: parseExistingModels() – extract provider-model pairs

    Sync->>Merge: mergeModels(existing, modelsDev, options)
    Merge->>Merge: For each provider in PROVIDER_MAP:
    Merge->>Merge:   - Build AiModel entries from API data
    Merge->>Merge:   - Derive roles, capabilities, cost, modalities
    Merge->>Merge:   - Sort newest-first, cap at maxPerProvider
    Merge->>Merge:   - Skip models that already exist locally
    Merge-->>Sync: MergeSummary (newEntries, preservedCount)

    alt mode === "append"
        Sync->>Sync: appendIntoDocument() – append new entries to existing YAML
    else mode === "replace"
        Sync->>Sync: renderFresh() – generate entirely new YAML
    end

    Sync->>YAML: writeFileSync(models.yml)

    Note over Codegen: Codegen runs separately
    Codegen->>YAML: readFileSync + parse
    Codegen->>Codegen: Validate with ModelsByProviderSchema
    Codegen->>JSON: writeFileSync(models.json)

    Note over Frontend: Runtime consumption
    Frontend->>JSON: import models.json
    JSON-->>Frontend: { models: { providerId: AiModel[] } }
    Frontend->>Registry: getKnownModelMaps()
    Registry->>Registry: Flatten per-provider arrays into single Map
    Registry->>Registry: Each model gets its own provider field
    Registry-->>Frontend: QualifiedModelId → AiModel

    alt User opens model dropdown
        Frontend->>Registry: getModelsByProvider(provider)
        Registry-->>Frontend: AiModel[]
        Frontend->>Frontend: Check model.capabilities.includes("thinking")
    end

    Note over Registry: Key schema changes
    Note over Registry: providers: string[] → provider: string (single)
    Note over Registry: thinking: boolean → capabilities: string[]
    Note over Registry: New fields: input_types, output_types, release_date, cost

_{Reply with feedback, questions, or to request a fix.

Re-trigger cubic}

cubic-dev-ai · 2026-05-21T02:32:15Z

+    modelsYamlPath,
+    write = true,
+    mode = "append",
+    maxPerProvider,


P2: Cap is not per final provider. google can get trimmed entries from both google and google-vertex, so -n 5 can still append 10 models. Trim after merging into one bucket.

Prompt for AI agents

Check if this issue is valid — if so, understand the root cause and fix it. At packages/llm-info/src/sync-models.ts, line 237: <comment>Cap is not per final provider. `google` can get trimmed entries from both `google` and `google-vertex`, so `-n 5` can still append 10 models. Trim after merging into one bucket.</comment> <file context> @@ -0,0 +1,304 @@ + modelsYamlPath, + write = true, + mode = "append", + maxPerProvider, + providers, + } = options; </file context>

This was addressed in 2f7425a — the dedup landed in merge.ts rather than sync-models.ts, so the diff for this file looks unchanged but the underlying behaviour is fixed.

mergeModels now accumulates candidates per marimo provider into a single Map<modelId, AiModel>, then calls sortAndTrim once per marimo provider. See merge.ts ~L174-L212 (candidatesByProvider).

Regression test in packages/llm-info/src/__tests__/sync-models.test.ts:

it("enforces `maxPerProvider` after deduping across mapped providers", ...)

passes google + google-vertex with overlapping ids and maxPerProvider: 2, and asserts the merged bucket has exactly 2 entries. So -n 5 cannot produce 10 anymore.

Thanks for the feedback.

cubic-dev-ai

1 issue found across 9 files (changes from recent commits).

Prompt for AI agents (unresolved issues)


Check if these issues are valid — if so, understand the root cause of each and fix them. If appropriate, use sub-agents to investigate and fix each issue separately.


<file name="packages/llm-info/src/sync-models.ts">

<violation number="1" location="packages/llm-info/src/sync-models.ts:237">
P2: Cap is not per final provider. `google` can get trimmed entries from both `google` and `google-vertex`, so `-n 5` can still append 10 models. Trim after merging into one bucket.</violation>
</file>

_{Reply with feedback, questions, or to request a fix.

Re-trigger cubic}

cubic-dev-ai

1 issue found across 4 files (changes from recent commits).

Prompt for AI agents (unresolved issues)


Check if these issues are valid — if so, understand the root cause of each and fix them. If appropriate, use sub-agents to investigate and fix each issue separately.


<file name="packages/llm-info/src/sync-models.ts">

<violation number="1" location="packages/llm-info/src/sync-models.ts:237">
P2: Cap is not per final provider. `google` can get trimmed entries from both `google` and `google-vertex`, so `-n 5` can still append 10 models. Trim after merging into one bucket.</violation>
</file>

_{Tip: Review your code locally with the cubic CLI to iterate faster.

Re-trigger cubic}

automation script

93e7f2b

vercel Bot deployed to Preview May 21, 2026 02:09 View deployment

update readme and skill.md

0a8aa34

Light2Dark added the enhancement New feature or request label May 21, 2026

vercel Bot deployed to Preview May 21, 2026 02:16 View deployment

update readme

32ac2b8

vercel Bot deployed to Preview May 21, 2026 02:18 View deployment

Light2Dark requested a review from mscolnick May 21, 2026 02:19

remove snapshot storing

0b4ac35

vercel Bot deployed to Preview May 21, 2026 02:22 View deployment

Light2Dark marked this pull request as ready for review May 21, 2026 02:25

Copilot AI review requested due to automatic review settings May 21, 2026 02:25

Copilot started reviewing on behalf of Light2Dark May 21, 2026 02:25 View session

Light2Dark requested a review from manzt May 21, 2026 02:27

Copilot AI reviewed May 21, 2026

View reviewed changes

cubic-dev-ai Bot reviewed May 21, 2026

View reviewed changes

address comments

2f7425a

vercel Bot deployed to Preview May 21, 2026 02:40 View deployment

cubic-dev-ai Bot reviewed May 21, 2026

View reviewed changes

Comment thread packages/llm-info/src/sources/merge.ts Outdated

fixes for dates

b199bb0

vercel Bot deployed to Preview May 21, 2026 02:48 View deployment

cubic-dev-ai Bot reviewed May 21, 2026

View reviewed changes

Comment thread packages/llm-info/src/sources/merge.ts

Conversation

Light2Dark commented May 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📝 Summary

📋 Pre-Review Checklist

✅ Merge Checklist

Uh oh!

vercel Bot commented May 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cubic-dev-ai Bot commented May 21, 2026

Uh oh!

codecov Bot commented May 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Bundle Report

Affected Assets, Files, and Routes:

Assets Changed:

Uh oh!

cubic-dev-ai Bot commented May 21, 2026

Uh oh!

cubic-dev-ai Bot commented May 21, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cubic-dev-ai Bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cubic-dev-ai Bot May 21, 2026

Choose a reason for hiding this comment

Uh oh!

Light2Dark May 21, 2026

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai Bot May 21, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cubic-dev-ai Bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cubic-dev-ai Bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Light2Dark commented May 21, 2026 •

edited

Loading

vercel Bot commented May 21, 2026 •

edited

Loading

codecov Bot commented May 21, 2026 •

edited

Loading

cubic-dev-ai Bot left a comment •

edited

Loading

cubic-dev-ai Bot left a comment •

edited

Loading

cubic-dev-ai Bot left a comment •

edited

Loading