Show downloaded models, improve error handling #456

cadenmackenzie · 2024-11-13T23:51:20Z

I wanted a way to show which models have already been downloaded. Ultimately, I would like to add the ability to manage those models such as deleting them. Might make more sense in a sidebar where you chose the model and can manage it.

I also added more robust handling of errors in index.html to safely access them as I saw some warnings in the main branch when no errors were set.

In index.js I combined error handling into a single flow for the populateSelector method using response.json() instead of manually parsing. I also added a helper method to set the error to reduce duplicate code. Something I should have done with my last PR.

Open to suggestions on these changes.

…ex.js

cadenmackenzie · 2024-11-13T23:53:22Z

I am seeing now some of the robust error handling was intentional and merged in here. I can change back if that is preferred.

dtnewman · 2024-11-14T02:03:01Z

+1 on the concept, but this is an issue:

dtnewman · 2024-11-14T02:05:11Z

+1 on the concept, but this is an issue:

Oh wait, that's in Safari. It actually works on Chrome and Firefox. Still, should probably fix before merge.

dtnewman · 2024-11-14T02:21:48Z

+1 on the concept, but this is an issue:

Oh wait, that's in Safari. It actually works on Chrome and Firefox. Still, should probably fix before merge.

I fixed these issue in https://github.com/cadenmackenzie/exo/pull/1/files which you can merge into this PR

fix safari issue

cadenmackenzie · 2024-11-14T02:59:36Z

Thanks for finding that! I went ahead and merged it in.

AlexCheema

What a great idea!
Definitely worthy of a $200 retrospective bounty.

I think the implementation right now is not great -- I'd prefer if we lean on the existing HFShardDownloader since that knows how to do downloads. You might need to add a new abstract method to ShardDownloader something like get_shard_download_status and it should use the same functionality already in hf_helpers.py to check what the percentage completion is.

Also, a few suggest changes to the UI: if the model is partially downloaded (percentage > 0) then display the % next to the model name in the UI.

cadenmackenzie · 2024-11-14T06:04:22Z

Awesome!

Got it working with the suggested changes. Want to do some testing in the morning then will update this PR.

…ble is set

working versions

cadenmackenzie · 2024-11-14T20:03:19Z

Added new abstract method in ShardDownloader, implemented get_shard_download_status in HFShardDownloader leaning on get_local_snapshot_dir, get_weight_map, get_allow_patterns helper functions and then checks the percentage of model downloaded for models where local files are found. Removed old function for checking percentage.

Currently shows "downloaded" for fully downloaded models and "X% downloaded" for models not fully downloaded in dropdown.

I don't love how this is being refreshed using the modelPoolInterval because it lags a little for models that are actively being downloaded so might work on how to improve that.

I think it could be worth moving this logic as well as active downloads to a sidebar. I like how active downloads are being shown in the chat when initiated but if we moved that to a sidebar, it could be a centralized place to view all models, choose a model, see activity of downloaded models, and being able to remove local downloads to free up space. Similar to how you can remove local models in LM Studio.

Open to suggestions.

AlexCheema

Fix the download status code. Otherwise good.

AlexCheema · 2024-11-16T08:06:28Z

exo/download/hf/hf_shard_download.py

+        local_sizes = {}
+
+        for pattern in patterns:
+            if pattern.endswith('safetensors') or pattern.endswith('mlx'):


This doesn't quite work. A pattern can match multiple files.

AlexCheema · 2024-11-16T08:07:02Z

exo/download/hf/hf_shard_download.py

@@ -77,3 +87,55 @@ async def wrapped_progress_callback(event: RepoProgressEvent):
  @property
  def on_progress(self) -> AsyncCallbackSystem[str, Tuple[Shard, RepoProgressEvent]]:
    return self._on_progress
+
+  async def get_shard_download_status(self) -> Optional[Dict[str, float]]:


Generally there seems to be a lot of duplication between this and other functions. I think a refactor should be done here.

…ownloading models

cadenmackenzie · 2024-11-17T00:37:42Z

Found an issue in handle_model_support that was creating HFShardDownloader without quick_check=true so it was starting download of models when being checked for download percentage.

Will work on get_shard_download_status refactor

cadenmackenzie · 2024-11-17T20:49:00Z

Hi @AlexCheema can you give me some more insight into what you want to be refactored? I initially thought we could reuse some of the download percentage logic that is happening during download but as far as I can tell, that is only checking against the remote during download in download_file(). Would you want to pull some of that logic out into another helper function to use in get_shard_download_status() or am I missing something?

AlexCheema · 2024-11-18T17:16:36Z

Hi @AlexCheema can you give me some more insight into what you want to be refactored? I initially thought we could reuse some of the download percentage logic that is happening during download but as far as I can tell, that is only checking against the remote during download in download_file(). Would you want to pull some of that logic out into another helper function to use in get_shard_download_status() or am I missing something?

Yeah I think pulling some of that logic out would make sense. I think it's in general in need of a good refactor if you want to take a go at that and will bump up the bounty to $300.

…centage in hf_shard_download

…d_file to use that helper

…ensor files to properly check the size with GET request

Hf helper refactor

cadenmackenzie · 2024-11-18T23:26:44Z

Hi Alex, please review. I didn't modify download_file much but added a check using the new helper method.

Modified the get_shard_download_status to lean on the helper methods to calculate the overall percentage of files and return that. Also updated chatgpt_api to use that overall instead doing percent calculation there. Should fix the pattern matching issue that you identified as well.

cadenmackenzie · 2024-11-20T18:07:22Z

@AlexCheema pinging for your review

AlexCheema · 2024-11-21T13:16:38Z

Please resolve conflicts and ping me again. @cadenmackenzie

cadenmackenzie · 2024-11-21T15:38:40Z

@AlexCheema resolved

updates from main

AlexCheema · 2024-11-22T15:29:22Z

Did you test this after the merge? Models aren't loading and getting syntax errors.

AlexCheema · 2024-11-22T15:29:51Z

Please assign me when fixed and ready for me to review @cadenmackenzie

cadenmackenzie · 2024-11-22T16:26:52Z

Hi @AlexCheema , my apologies. I did test it and it was working for me even without the inference_engine_classes defined. Not sure why.

I fixed it and installed Pylance.

AlexCheema · 2024-11-22T17:17:26Z

/modelpool is hanging for me.
nothing shows up in the tinychat ui

cadenmackenzie added 8 commits November 13, 2024 15:06

adding logic to check which models are downloaded

c7dd312

reusing helper function to get cached directory

de09e2a

removing uneccesary console logs and fixing order of variables in ind…

7d7bdd8

…ex.js

removing error separtation so I can put in different PR

fb32a85

adding back in set error message

59f5b6d

cleaning up logging in index.js

25d67f5

removing unneccesary css

95ce665

removing sorting of models by name

3eb726c

fix safari issue

cbeb1b3

Merge pull request #1 from dtnewman/dn/downloadModelsV2

372d873

fix safari issue

AlexCheema requested changes Nov 14, 2024

View reviewed changes

cadenmackenzie added 5 commits November 14, 2024 09:33

working versions

d9aabd7

removing is_model_downloaded method and changing how downloaded varia…

dfcf513

…ble is set

reducing redundent checks

972074e

removing checking of percentage for models that are not found locally

dd38924

Merge pull request #2 from cadenmackenzie/downloadedModelsV2Revisions

bd2985a

working versions

AlexCheema requested changes Nov 16, 2024

View reviewed changes

creating HFShardDownloader with quick_check true so it doesnt start d…

649157d

…ownloading models

cadenmackenzie added 2 commits November 18, 2024 13:53

modifying how its being displayed becuase now calculating overall per…

c923ef6

…centage in hf_shard_download

adding helper funciton to check file download. also modifying downloa…

c61f40c

…d_file to use that helper

cadenmackenzie added 9 commits November 18, 2024 13:54

modify get_shard_download_status to use helper function

dec79ac

modifying helper fucntion checking size to follow redirect for .safet…

4c6fda7

…ensor files to properly check the size with GET request

adding redirect for all requests

3ac8687

comment

3256051

removing traceback

db610f5

removing path update

6a7de04

Merge pull request #4 from cadenmackenzie/hf_helperRefactor

fad0591

Hf helper refactor

moving os import

b77362b

removing import get_hf_home

695ab34

cadenmackenzie added 4 commits November 18, 2024 17:42

fixing formatting

8135437

fixing formatting

91276cc

yapf formatting

8ee6cc3

yapf in download_file

0d50167

cadenmackenzie added 2 commits November 21, 2024 07:33

Merge branch 'main' into downloadedModelsV2

2cdd55d

defining optional

1ca11ea

cadenmackenzie added 2 commits November 21, 2024 14:16

Merge pull request #5 from cadenmackenzie/main

7a8c722

updates from main

remvoing console log

7e6c69f

AlexCheema assigned cadenmackenzie Nov 22, 2024

fixiing required engines definition

39139c1

cadenmackenzie removed their assignment Nov 22, 2024

cadenmackenzie requested a review from AlexCheema November 22, 2024 16:29

AlexCheema assigned cadenmackenzie Nov 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Show downloaded models, improve error handling #456

Show downloaded models, improve error handling #456

cadenmackenzie commented Nov 13, 2024

cadenmackenzie commented Nov 13, 2024

dtnewman commented Nov 14, 2024

dtnewman commented Nov 14, 2024

dtnewman commented Nov 14, 2024 •

edited

Loading

cadenmackenzie commented Nov 14, 2024

AlexCheema left a comment

cadenmackenzie commented Nov 14, 2024

cadenmackenzie commented Nov 14, 2024

AlexCheema left a comment

AlexCheema Nov 16, 2024

AlexCheema Nov 16, 2024

cadenmackenzie commented Nov 17, 2024

cadenmackenzie commented Nov 17, 2024

AlexCheema commented Nov 18, 2024

cadenmackenzie commented Nov 18, 2024

cadenmackenzie commented Nov 20, 2024

AlexCheema commented Nov 21, 2024

cadenmackenzie commented Nov 21, 2024

AlexCheema commented Nov 22, 2024

AlexCheema commented Nov 22, 2024

cadenmackenzie commented Nov 22, 2024 •

edited

Loading

AlexCheema commented Nov 22, 2024

Show downloaded models, improve error handling #456

Are you sure you want to change the base?

Show downloaded models, improve error handling #456

Conversation

cadenmackenzie commented Nov 13, 2024

cadenmackenzie commented Nov 13, 2024

dtnewman commented Nov 14, 2024

dtnewman commented Nov 14, 2024

dtnewman commented Nov 14, 2024 • edited Loading

cadenmackenzie commented Nov 14, 2024

AlexCheema left a comment

Choose a reason for hiding this comment

cadenmackenzie commented Nov 14, 2024

cadenmackenzie commented Nov 14, 2024

AlexCheema left a comment

Choose a reason for hiding this comment

AlexCheema Nov 16, 2024

Choose a reason for hiding this comment

AlexCheema Nov 16, 2024

Choose a reason for hiding this comment

cadenmackenzie commented Nov 17, 2024

cadenmackenzie commented Nov 17, 2024

AlexCheema commented Nov 18, 2024

cadenmackenzie commented Nov 18, 2024

cadenmackenzie commented Nov 20, 2024

AlexCheema commented Nov 21, 2024

cadenmackenzie commented Nov 21, 2024

AlexCheema commented Nov 22, 2024

AlexCheema commented Nov 22, 2024

cadenmackenzie commented Nov 22, 2024 • edited Loading

AlexCheema commented Nov 22, 2024

dtnewman commented Nov 14, 2024 •

edited

Loading

cadenmackenzie commented Nov 22, 2024 •

edited

Loading