Support for T5 Architecture #384

niranjanakella · 2024-06-05T03:31:38Z

Hello @EricLBuehler, opening this issue as part of T5 Seq2Seq model architecture support in mistral.rs. (As discussed)

Relates to: #156

EricLBuehler · 2024-06-05T09:24:14Z

Thank you for opening this issue. Just to clarify, would this be a quantized or nonquantized implementation?

niranjanakella · 2024-06-06T04:09:29Z

@EricLBuehler Non-Quantized f16,32 implementation currently holds more precedence. But if possible, would also like to have a quantized implementation too.

Also I wish to know if LoRA adapters can be loaded at runtime without merging them into the model. It would be a huge game changer for most applications given the fact that many developers train multiple adapters. Would be great to attach multiple adapters during runtime.

EricLBuehler · 2024-06-06T09:03:23Z

Non-Quantized f16,32 implementation currently holds more precedence. But if possible, would also like to have a quantized implementation too.

Sounds great, I'll get started on an implementation.

Also I wish to know if LoRA adapters can be loaded at runtime without merging them into the model. It would be a huge game changer for most applications given the fact that many developers train multiple adapters. Would be great to attach multiple adapters during runtime.

We actually have this feature already! There are 2 ways to do this:

Activate adapters at runtime by preloading some and then sending requests to activate adapters
Use per-request adapter specification to have granular control.

Docs: https://github.com/EricLBuehler/mistral.rs/blob/master/docs/ADAPTER_MODELS.md#adapter-model-dynamic-adapter-activation.

EricLBuehler · 2024-06-14T18:47:16Z

Hi @niranjanakella! Sorry for the delay; I have been busy with the Idefics 2 implementation (#309). I should have a prototype ready tonight, though!

niranjanakella · 2024-06-14T18:49:20Z

@EricLBuehler No problem sounds good. I am looking forward to trying it out soon.

EricLBuehler · 2024-06-14T20:10:24Z

See: #432.

EricLBuehler added new feature New feature or request models Additions to model or architectures labels Jun 5, 2024

EricLBuehler mentioned this issue Jun 14, 2024

Add the T5 seq2seq model #432

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for T5 Architecture #384

Support for T5 Architecture #384

niranjanakella commented Jun 5, 2024 •

edited

Loading

EricLBuehler commented Jun 5, 2024

niranjanakella commented Jun 6, 2024 •

edited

Loading

EricLBuehler commented Jun 6, 2024 •

edited

Loading

EricLBuehler commented Jun 14, 2024

niranjanakella commented Jun 14, 2024

EricLBuehler commented Jun 14, 2024

Support for T5 Architecture #384

Support for T5 Architecture #384

Comments

niranjanakella commented Jun 5, 2024 • edited Loading

EricLBuehler commented Jun 5, 2024

niranjanakella commented Jun 6, 2024 • edited Loading

EricLBuehler commented Jun 6, 2024 • edited Loading

EricLBuehler commented Jun 14, 2024

niranjanakella commented Jun 14, 2024

EricLBuehler commented Jun 14, 2024

niranjanakella commented Jun 5, 2024 •

edited

Loading

niranjanakella commented Jun 6, 2024 •

edited

Loading

EricLBuehler commented Jun 6, 2024 •

edited

Loading