Unsupported model type: jais #906

SherifElfadaly · 2024-08-28T09:46:17Z

Question

System Info

macOS, node v20.10, @xenova/transformers 2.17.2

Environment/Platform

Description

Error: Unsupported model type: jais
    at Function.from_pretrained (file:///node_modules/@xenova/transformers/src/models.js:5526:19)
    at async Promise.all (index 1)
    at loadItems (file:///node_modules/@xenova/transformers/src/pipelines.js:3279:5)
    at pipeline (file:///node_modules/@xenova/transformers/src/pipelines.js:3219:21)
    at SearchQueryParser.initializeModel (src/search-engine/query-parser/search-query-parser.ts:27:18)

Reproduction

import { Logger } from '@nestjs/common';

export class SearchQueryParser {
  private tokenizer: any;
  private model: any;
  private logger: Logger;
  private systemPrompt = '';

  constructor() {
    this.logger = new Logger('query parser');
    this.initializeModel();
  }

  private async initializeModel() {
    const { AutoTokenizer, pipeline } = await import('@xenova/transformers');
    this.tokenizer = await AutoTokenizer.from_pretrained(
      'omarabb315/Query-5KM-no_synonyms_noon_1',
      {
        progress_callback: (data) => {
          this.logger.verbose(
            ${data.status} ${data.file || ''} ${data.progress || ''}`,
          );
        },
      },
    );
    this.model = await pipeline(
      'text-generation',
      'omarabb315/Query-5KM-no_synonyms_noon_1',
    );
  }

  async parse(query: string): Promise<any> {
    if (!this.model) {
      await this.initializeModel();
    }

    const tokenizerResponse = this.tokenizer.apply_chat_template(
      [
        { role: 'system', content: this.systemPrompt },
        { role: 'user', content: query },
      ],
      {
        tokenize: false,
        add_generation_prompt: true,
      },
    );

    const response = this.model(tokenizerResponse.toString());

    const parsedQuery = response[0].generated_text;

    return parsedQuery;
  }
}

The text was updated successfully, but these errors were encountered:

xenova · 2024-08-28T12:45:09Z

The model you've linked to appears to be gated, so I added support for and tested w/ https://huggingface.co/onnx-community/tiny-random-jais. Once you convert your one to ONNX, it should work with Transformers.js v3 👍

omarabb315 · 2024-08-28T13:06:09Z

@xenova Thanks for your support, I've granted you the access to the model, can you convert it to onnx or tell me how, because I tried the below code snippet but returned this error, so I wonder how did you prepared the "tiny-random-jais" model:

Error:

ValueError: Trying to export a jais model, that is a custom or unsupported architecture, but no custom onnx configuration was passed as `custom_onnx_configs`. Please refer to https://huggingface.co/docs/optimum/main/en/exporters/onnx/usage_guides/export_a_model#custom-export-of-transformers-models for an example on how to export custom models. Please open an issue at https://github.com/huggingface/optimum/issues if you would like the model type jais to be supported natively in the ONNX export.

The code snippet:

from optimum.onnxruntime import ORTModelForSequenceClassification
from transformers import AutoTokenizer

model_checkpoint = "omarabb315/Query-5KM-no_synonyms_noon_1"
save_directory = "omarabb315/Query-5KM-no_synonyms_noon_onnx"

# Load a model from transformers and export it to ONNX
ort_model = ORTModelForSequenceClassification.from_pretrained(model_checkpoint, export=True, trust_remote_code=True)
tokenizer = AutoTokenizer.from_pretrained(model_checkpoint)

# Save the onnx model and tokenizer
ort_model.push_to_hub(save_directory)
tokenizer.push_to_hub(save_directory)

xenova · 2024-08-28T13:15:13Z

You can do it with our conversion script:

git clone -b v3 https://github.com/xenova/transformers.js.git
cd transformers.js
pip install -q -r scripts/requirements.txt
python -m scripts.convert \
  --quantize \
  --model_id katuni4ka/tiny-random-jais \
  --trust_remote_code \
  --skip_validation \
  --custom_onnx_configs '{"model":"gpt2"}' \
  --task text-generation-with-past

I can make PRs for your models, if you'd like 👍

omarabb315 · 2024-08-28T13:17:40Z

That is clear, thanks a lot, yes please PR would be great

xenova · 2024-08-28T14:28:45Z

PR made 👍 I've also added fp16 and int8 quantized variants, which may help reduce computational load. I haven't been able to test them though... hopefully it works! 🤞

I have tested https://huggingface.co/inceptionai/jais-family-590m-chat, and that seems to work fine 👍

import { pipeline } from '@huggingface/transformers';

const generator = await pipeline('text-generation', 'onnx-community/jais-family-590m-chat', {
    dtype: 'q8',
});

const question = "What is the capital of UAE?";
const prompt_eng = `### Instruction: Your name is 'Jais', and you are named after Jebel Jais, the highest mountain in UAE. You were made by 'Inception' in the UAE. You are a helpful, respectful, and honest assistant. Always answer as helpfully as possible, while being safe. Complete the conversation between [|Human|] and [|AI|]:\n### Input: [|Human|] ${question}\n[|AI|]\n### Response:`;

const result = await generator(prompt_eng, { max_new_tokens: 128, return_full_text: false });
console.log(result);

produces:

"The capital city of the United Arab Emirates, a federation of seven emirates on the eastern side of the Arabian peninsula, is Abu Dhabi. This bustling metropolis is not only the political, economic, and cultural hub of the UAE but also a global hub for tourism and trade. It is home to the world's largest oil refinery, the Abu Dhabi National Petroleum Company (ADNOC), and the world's largest seafaring company, the Abu Dhabi Ports Corporation (ADPOC). The city is also a major hub for the United Arab Emirates'"

SherifElfadaly · 2024-08-28T15:06:54Z

@xenova The original error is now gone put a new error was raised

2024-08-28 18:03:45.956 node[14526:6341613] 2024-08-28 18:03:45.956927 [E:onnxruntime:, inference_session.cc:2105 operator()] Exception during initialization: filesystem error: in file_size: No such file or directory ["model.onnx_data"]

/node_modules/onnxruntime-node/lib/backend.ts:16
      this.#inferenceSession.loadModel(pathOrBuffer.buffer, pathOrBuffer.byteOffset, pathOrBuffer.byteLength, options);
                             ^
Error: Exception during initialization: filesystem error: in file_size: No such file or directory ["model.onnx_data"]
    at new OnnxruntimeSessionHandler (/node_modules/onnxruntime-node/lib/backend.ts:16:30)
    at Immediate.<anonymous> (/node_modules/onnxruntime-node/lib/backend.ts:61:19)
    at processImmediate (node:internal/timers:478:21)

Any idea what is missing?

xenova · 2024-08-28T15:08:37Z

Yes we are still adding the external data functionality (to load models > 2GB). For now, you can set dtype: 'q8' or dtype: 'fp16' to use a smaller model.

SherifElfadaly · 2024-08-28T15:40:38Z

@xenova Thanks for your support, it seems to be working fine now.

omarabb315 · 2024-08-28T17:11:51Z

@xenova Thanks again for your effort, lastly, can you please share with me the snippet used to produce this PR, in order to use it in our future models

xenova · 2024-08-28T21:00:14Z

can you please share with me the snippet used to produce this PR, in order to use it in our future models

See here for the commit to add JAIS models to Transformers.js, here for the conversion code and here for the code snippet. Is there anything else you'd like?

SherifElfadaly added the question Further information is requested label Aug 28, 2024

xenova added a commit that referenced this issue Aug 28, 2024

Add support for JAIS models (#906)

4e1acf0

SherifElfadaly closed this as completed Aug 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unsupported model type: jais #906

Unsupported model type: jais #906

SherifElfadaly commented Aug 28, 2024

xenova commented Aug 28, 2024

omarabb315 commented Aug 28, 2024 •

edited

Loading

xenova commented Aug 28, 2024

omarabb315 commented Aug 28, 2024

xenova commented Aug 28, 2024 •

edited

Loading

SherifElfadaly commented Aug 28, 2024

xenova commented Aug 28, 2024

SherifElfadaly commented Aug 28, 2024

omarabb315 commented Aug 28, 2024 •

edited

Loading

xenova commented Aug 28, 2024 •

edited

Loading

Unsupported model type: jais #906

Unsupported model type: jais #906

Comments

SherifElfadaly commented Aug 28, 2024

Question

System Info

Environment/Platform

Description

Reproduction

xenova commented Aug 28, 2024

omarabb315 commented Aug 28, 2024 • edited Loading

xenova commented Aug 28, 2024

omarabb315 commented Aug 28, 2024

xenova commented Aug 28, 2024 • edited Loading

SherifElfadaly commented Aug 28, 2024

xenova commented Aug 28, 2024

SherifElfadaly commented Aug 28, 2024

omarabb315 commented Aug 28, 2024 • edited Loading

xenova commented Aug 28, 2024 • edited Loading

omarabb315 commented Aug 28, 2024 •

edited

Loading

xenova commented Aug 28, 2024 •

edited

Loading

omarabb315 commented Aug 28, 2024 •

edited

Loading

xenova commented Aug 28, 2024 •

edited

Loading