-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Open
Labels
enhancementNew feature or requestNew feature or request
Description
Enable more powerful search using visual and audio context.
eg
- Use Video LLMs like:
- Convert video to frames + audio:
- ffmpeg to extract frames/audio
- Send multimodal input to the LLM
- Output: searchable embeddings or semantic summaries
all the exact things that will need to be done to receive the bounty.
precision is important otherwise the bounty cannot be awarded.
/bounty 400
This is neccesary as matches with user needs
This issue is a response/relied to this issue: #1142
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request