[bounty] Video LLM for Search

Enable more powerful search using visual and audio context.

eg 
- [ ] Use Video LLMs like:
  - [ ] [Video-LLaMA](https://github.com/DAMO-NLP-SG/Video-LLaMA)
  - [ ] [Video-ChatGPT](https://github.com/OpenGVLab/Video-ChatGPT)
  - [ ] [MiniGPT-4 + CLIP](https://github.com/Vision-CAIR/MiniGPT-4)
- [ ] Convert video to frames + audio:
  - [ ] ffmpeg to extract frames/audio
- [ ] Send multimodal input to the LLM
- [ ] Output: searchable embeddings or semantic summaries

all the exact things that will need to be done to receive the bounty.

precision is important otherwise the bounty cannot be awarded.

/bounty 400

This is neccesary as matches with user needs

This issue is a response/relied to this issue: #1142


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[bounty] Video LLM for Search #1857

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[bounty] Video LLM for Search #1857

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions