✨enhancement: Model sources should not be hard coded from frontend #5270

louis-menlo · 2025-06-14T11:54:30Z

Describe Your Changes

Currently, Jan-nano is being updated manually on frontend which require app to be up-to-date everytime we change the model repo structure.

Move it to an extension and make it dynamically synced. The current repo info will be shipped with the extension, but it will update itself for latest stats like download numbers.

Self Checklist

Added relevant comments, esp in complex areas
Updated docs (for bug fixes / features)
Created issues for follow-up changes or refactoring needed

Important

Model sources are now dynamically synced, removing hardcoded references in the frontend.

Behavior:
- Model sources are now dynamically synced in index.ts using defaultModelSources.
- Removed hardcoded model references in models.ts.
Extensions:
- Added defaultModelSources array in index.ts for default model sources.
- Updated fetchModelsHub() in index.ts to include defaultModelSources.
Services:
- Removed hardcodedModel from fetchModelSources() and fetchModelHub() in models.ts.

^{This description was created by}^{for 381e859. You can customize this summary. It will automatically update as commits are pushed.}

ellipsis-dev

Caution

Changes requested ❌

Reviewed everything up to 2d70c18 in 1 minute and 39 seconds. Click for details.

Reviewed 272 lines of code in 3 files
Skipped 0 files when reviewing.
Skipped posting 5 draft comments. View those below.
Modify your settings and rules to customize what types of comments Ellipsis leaves. And don't forget to react with 👍 or 👎 to teach Ellipsis.

1. extensions/model-extension/src/index.ts:28

Draft comment:
Typo in comment: 'Defaul mode sources' should be 'Default model sources'.
Reason this comment was not posted:
Confidence changes required: 33% <= threshold 50% None

2. extensions/model-extension/src/index.ts:295

Draft comment:
DEFAULT_MODEL_SOURCES is used but not defined; use the declared 'defaultModelSources' instead.
Reason this comment was not posted:
Comment was not on a location in the diff, so it can't be submitted as a review comment.

3. web-app/src/services/models.ts:35

Draft comment:
The comment refers to 'hardcoded model' being prepended, which is outdated. Update or remove the comment to reflect current behavior.
Reason this comment was not posted:
Confidence changes required: 33% <= threshold 50% None

4. web-app/src/services/models.ts:52

Draft comment:
The comment about prepending the hardcoded model for hub data is misleading now. Update the comment to match the current logic.
Reason this comment was not posted:
Confidence changes required: 33% <= threshold 50% None

5. extensions/model-extension/src/index.ts:29

Draft comment:
Typo: 'Defaul mode sources' seems to be a mistake. Consider correcting it to 'Default model sources'.
Reason this comment was not posted:
Decided after close inspection that this draft comment was likely wrong and/or not actionable: usefulness confidence = 10% vs. threshold = 50% While this is technically correct and points out a real typo in new code, comments about minor typos in documentation are generally not important enough to warrant a PR comment. The typo doesn't affect functionality and is a very minor issue. Documentation typos are better handled through direct fixes or separate documentation reviews. The typo could cause confusion for future developers reading the code. Documentation quality is important for maintainability. While documentation quality matters, this particular typo is extremely minor and obvious. The meaning is still clear despite the typo. This comment should be deleted as it addresses a trivial documentation issue that doesn't meaningfully impact code quality or functionality.

Workflow ID: wflow_WCOAXpjKzeAoSTtB

^{You can customize}^{by changing your verbosity settings, reacting with 👍 or 👎, replying to comments, or adding code review rules.}

extensions/model-extension/src/index.ts

ellipsis-dev

Important

Looks good to me! 👍

Reviewed 381e859 in 40 seconds. Click for details.

Reviewed 13 lines of code in 1 files
Skipped 0 files when reviewing.
Skipped posting 1 draft comments. View those below.
Modify your settings and rules to customize what types of comments Ellipsis leaves. And don't forget to react with 👍 or 👎 to teach Ellipsis.

1. extensions/model-extension/resources/default.json:127

Draft comment:
Ensure the model id update ('jan-nano-4b') aligns with the new naming convention and that the file size remains accurate.
Reason this comment was not posted:
Confidence changes required: 0% <= threshold 50% None

Workflow ID: wflow_e2a0vqKBnZxrxlgG

^{You can customize}^{by changing your verbosity settings, reacting with 👍 or 👎, replying to comments, or adding code review rules.}

* chore: enable shortcut zoom (#5261) * chore: enable shortcut zoom * chore: update shortcut setting * fix: thinking block (#5263) * Merge pull request #5262 from menloresearch/chore/sync-new-hub-data chore: sync new hub data * ✨enhancement: model run improvement (#5268) * fix: mcp tool error handling * fix: error message * fix: trigger download from recommend model * fix: can't scroll hub * fix: show progress * ✨enhancement: prompt users to increase context size * ✨enhancement: rearrange action buttons for a better UX * 🔧chore: clean up logics --------- Co-authored-by: Faisal Amir <[email protected]> * fix: glitch download from onboarding (#5269) * ✨enhancement: Model sources should not be hard coded from frontend (#5270) * 🐛fix: default onboarding model should use recommended quantizations (#5273) * 🐛fix: default onboarding model should use recommended quantizations * ✨enhancement: show context shift option in provider settings * 🔧chore: wording * 🔧 config: add to gitignore * 🐛fix: Jan-nano repo name changed (#5274) * 🚧 wip: disable showSpeedToken in ChatInput * 🐛 fix: commented out the wrong import * fix: masking value MCP env field (#5276) * ✨ feat: add token speed to each message that persist * ♻️ refactor: to follow prettier convention * 🐛 fix: exclude deleted field * 🧹 clean: all the missed console.log * ✨enhancement: out of context troubleshooting (#5275) * ✨enhancement: out of context troubleshooting * 🔧refactor: clean up * ✨enhancement: add setting chat width container (#5289) * ✨enhancement: add setting conversation width * ✨enahncement: cleanup log and change improve accesibility * ✨enahcement: move const beta version * 🐛fix: optional additional_information gpu (#5291) * 🐛fix: showing release notes for beta and prod (#5292) * 🐛fix: showing release notes for beta and prod * ♻️refactor: make an utils env * ♻️refactor: hide MCP for production * ♻️refactor: simplify the boolean expression fetch release note --------- Co-authored-by: Faisal Amir <[email protected]> Co-authored-by: LazyYuuki <[email protected]> Co-authored-by: Bui Quang Huy <[email protected]>

* chore: enable shortcut zoom (#5261) * chore: enable shortcut zoom * chore: update shortcut setting * fix: thinking block (#5263) * Merge pull request #5262 from menloresearch/chore/sync-new-hub-data chore: sync new hub data * ✨enhancement: model run improvement (#5268) * fix: mcp tool error handling * fix: error message * fix: trigger download from recommend model * fix: can't scroll hub * fix: show progress * ✨enhancement: prompt users to increase context size * ✨enhancement: rearrange action buttons for a better UX * 🔧chore: clean up logics --------- Co-authored-by: Faisal Amir <[email protected]> * fix: glitch download from onboarding (#5269) * ✨enhancement: Model sources should not be hard coded from frontend (#5270) * 🐛fix: default onboarding model should use recommended quantizations (#5273) * 🐛fix: default onboarding model should use recommended quantizations * ✨enhancement: show context shift option in provider settings * 🔧chore: wording * 🔧 config: add to gitignore * 🐛fix: Jan-nano repo name changed (#5274) * 🚧 wip: disable showSpeedToken in ChatInput * 🐛 fix: commented out the wrong import * fix: masking value MCP env field (#5276) * ✨ feat: add token speed to each message that persist * ♻️ refactor: to follow prettier convention * 🐛 fix: exclude deleted field * 🧹 clean: all the missed console.log * ✨enhancement: out of context troubleshooting (#5275) * ✨enhancement: out of context troubleshooting * 🔧refactor: clean up * ✨enhancement: add setting chat width container (#5289) * ✨enhancement: add setting conversation width * ✨enahncement: cleanup log and change improve accesibility * ✨enahcement: move const beta version * 🐛fix: optional additional_information gpu (#5291) * 🐛fix: showing release notes for beta and prod (#5292) * 🐛fix: showing release notes for beta and prod * ♻️refactor: make an utils env * ♻️refactor: hide MCP for production * ♻️refactor: simplify the boolean expression fetch release note * 🐛fix: typo in build type check (#5297) * 🐛fix: remove onboarding local model and hide the edit capabilities model (#5301) * 🐛fix: remove onboarding local model and hide the edit capabilities model * ♻️refactor: conditional search params setup screen * 🐛fix: hide token speed when assistant params stream false (#5302) * 🐛fix: glitch padding speed token (#5307) * 🐛fix: immediately show download progress (#5308) * 🐛fix:safely convert values to numbers and handle NaN cases (#5309) * chore: correct binary name for stable version (#5303) (#5311) Co-authored-by: hiento09 <[email protected]> * 🐛fix: llama.cpp default NGL setting does not offload all layers to GPU (#5310) * 🐛fix: llama.cpp default NGL setting does not offload all layers to GPU * chore: cover more cases * chore: clean up * fix: should not show GPU section on Mac * 🐛fix: update default extension settings (#5315) * fix: update default extension settings * chore: hide language setting on Prod * 🐛fix: allow script posthog (#5316) * Sync 0.5.18 to 0.6.0 (#5320) * chore: correct binary name for stable version (#5303) * ci: enable devtool on prod build (#5317) * ci: enable devtool on prod build --------- Co-authored-by: hiento09 <[email protected]> Co-authored-by: Nguyen Ngoc Minh <[email protected]> * fix: glitch model download issue (#5322) * 🐛 fix(updater): terminate sidecar processes before update to avoid file access errors (#5325) * 🐛 fix: disable sorting for threads in SortableItem and clean up thread order handling (#5326) * improved wording in UI elements (#5323) * fix: sorted-thread-not-stable (#5336) * 🐛fix: update wording desc vulkan (#5338) * 🐛fix: update wording desc vulkan * ✨enhancement: update copy * 🐛fix: handle NaN value tokenspeed (#5339) * 🐛 fix: window path problem * feat(server): filter /models endpoint to show only downloaded models (#5343) - Add filtering logic to proxy server for GET /models requests - Keep only models with status "downloaded" in response - Remove Content-Length header to prevent mismatch after filtering - Support both ListModelsResponseDto and direct array formats - Add comprehensive tests for filtering functionality - Fix Content-Length header conflict causing empty responses Fixes issue where all models were returned regardless of download status. * 🐛fix: render streaming token speed based on thread ID & assistant metadata (#5346) * fix(server): add gzip decompression support for /models endpoint filtering (#5349) - Add gzip detection using magic number check (0x1f 0x8b) - Implement gzip decompression before JSON parsing - Add gzip re-compression for filtered responses - Fix "invalid utf-8 sequence" error when upstream returns gzipped content - Maintain Content-Encoding consistency for compressed responses - Add comprehensive gzip handling with flate2 library Resolves issue where filtering failed on gzip-compressed model responses. * fix(proxy): implement true HTTP streaming for chat completions API (#5350) * fix: glitch toggle gpus (#5353) * fix: glitch toogle gpu * fix: Using the GPU's array index as a key for gpuLoading * enhancement: added try-finally * fix: built in models capabilities (#5354) * 🐛fix: setting provider hide model capabilities (#5355) * 🐛fix: setting provider hide model capabilities * 🐛fix: hide tools icon on dropdown model providers * fix: stop server on app close or reload * ✨enhancement: reset heading class --------- Co-authored-by: Louis <[email protected]> * fix: stop api server on page unload (#5356) * fix: stop api server on page unload * fix: check api server status on reload * refactor: api server state * fix: should not pop the guard * 🐛fix: avoid render html title thread (#5375) * 🐛fix: avoid render html title thread * chore: minor bump - tokenjs for manual adding models --------- Co-authored-by: Louis <[email protected]> --------- Co-authored-by: Faisal Amir <[email protected]> Co-authored-by: LazyYuuki <[email protected]> Co-authored-by: Bui Quang Huy <[email protected]> Co-authored-by: hiento09 <[email protected]> Co-authored-by: Nguyen Ngoc Minh <[email protected]> Co-authored-by: Sam Hoang Van <[email protected]> Co-authored-by: Ramon Perez <[email protected]>

✨enhancement: Model sources should not be hard coded from frontend

2d70c18

github-project-automation bot added this to Jan Jun 14, 2025

louis-menlo temporarily deployed to production June 14, 2025 11:54 — with GitHub Actions Inactive

github-actions bot assigned louis-menlo Jun 14, 2025

louis-menlo temporarily deployed to production June 14, 2025 11:54 — with GitHub Actions Inactive

urmauur approved these changes Jun 14, 2025

View reviewed changes

ellipsis-dev bot reviewed Jun 14, 2025

View reviewed changes

extensions/model-extension/src/index.ts Show resolved Hide resolved

🐛fix: correct iQ4_XS model id

381e859

louis-menlo temporarily deployed to production June 14, 2025 12:01 — with GitHub Actions Inactive

ellipsis-dev bot reviewed Jun 14, 2025

View reviewed changes

louis-menlo merged commit a10dbef into release/v0.5.18 Jun 14, 2025
18 checks passed

louis-menlo deleted the fix/remove-hard-coded-model-sources branch June 14, 2025 15:00

github-project-automation bot moved this to QA in Jan Jun 14, 2025

github-actions bot added this to the v0.5.19 milestone Jun 14, 2025

david-menloai moved this from QA to Done in Jan Jun 20, 2025

LazyYuuki removed this from the v0.7.2 milestone Jul 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

✨enhancement: Model sources should not be hard coded from frontend #5270

✨enhancement: Model sources should not be hard coded from frontend #5270

Uh oh!

louis-menlo commented Jun 14, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

ellipsis-dev bot left a comment

Uh oh!

Uh oh!

ellipsis-dev bot left a comment

Uh oh!

Uh oh!

Uh oh!

✨enhancement: Model sources should not be hard coded from frontend #5270

✨enhancement: Model sources should not be hard coded from frontend #5270

Uh oh!

Conversation

louis-menlo commented Jun 14, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Describe Your Changes

Self Checklist

Uh oh!

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

louis-menlo commented Jun 14, 2025 •

edited by ellipsis-dev bot

Loading