Skip to content

[bounty] Video LLM for Search #1857

@BenraouaneSoufiane

Description

@BenraouaneSoufiane

Enable more powerful search using visual and audio context.

eg

  • Use Video LLMs like:
  • Convert video to frames + audio:
    • ffmpeg to extract frames/audio
  • Send multimodal input to the LLM
  • Output: searchable embeddings or semantic summaries

all the exact things that will need to be done to receive the bounty.

precision is important otherwise the bounty cannot be awarded.

/bounty 400

This is neccesary as matches with user needs

This issue is a response/relied to this issue: #1142

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions