Added an alternative way to initialize/load some models (for non-… #1325

shawl336 · 2024-09-07T14:47:06Z

Added an alternative way to initialize/load some models (for non-Android codes), tokens, hotwrods and keywords, that initializing/loading from memory buffers. The original usage of model initialization unchanged.
Basically, I just added additional "const char* $_buf_begin, const char* $_buf_end" vars ($ == encoder/decoder/joiner/model), if they are non-nullptr, sherpa will attempt to init the models from the memory starting from "$_buf_start" to "$_buf_end" in priority to the original filename strings.
Also, I added "tokens_buf_str, hotwords_buf_str and keywords_buf_str", if they are non-empty, sherpa will attempt to treat them as the content directly, and using istringstream rather than iftream.
the supported models are:
online models:
OnlineTransducerModel
OnlineParaformerModel
OnlineWenetCtcModel
OnlineZipformer2CtcModel
OnlineNeMoCtcModel

offline models:
  OfflineTransducerModel
  OfflineParaformerModel
  OfflineNemoEncDecCtcModel
  OfflineWhisperModel
  OfflineTdnnModel
  OfflineZipformerCtcModel
  OfflineWenetCtcModel
  OfflineSenseVoiceModel

keywordspotter models:
  KeywordSpotterTransducer

csukuangfj · 2024-09-07T15:02:08Z

Could you also add tests for your changes?

csukuangfj · 2024-09-07T15:03:38Z

sherpa-onnx/c-api/c-api.h

  const char *joiner;
+  const char *joiner_buf_begin, *joiner_buf_end;  // if non-null, loading the joiner from the buffer in prioriy


Please add new fields to the end of existing fields.

Please don't change the order of exsiting fields.

I suggest that you use

const char *encoder_buf; int32_t encoder_buf_len;

csukuangfj · 2024-09-07T15:12:08Z

By the way, I suggest that you change one model config per pull-request and add test for the changes.

Otherwise, the pull-request is very large and you need to write lots of tests to cover your changes.

csukuangfj · 2024-09-07T15:09:23Z

sherpa-onnx/csrc/online-transducer-model-config.h

@@ -12,15 +12,25 @@ namespace sherpa_onnx {

 struct OnlineTransducerModelConfig {
  std::string encoder;
+  const char *encoder_buf_begin, *encoder_buf_end;


Suggested change

const char *encoder_buf_begin, *encoder_buf_end;

const char *encoder_buf_begin = nullptr;

const char *encoder_buf_end = nullptr;

Please follow our existing code style to define one variable per line.

Please initialize it to nullptr.

csukuangfj · 2024-09-07T15:13:32Z

sherpa-onnx/c-api/c-api.cc

+  recognizer_config.model_config.transducer.encoder_buf_begin =
+      SHERPA_ONNX_OR(config->model_config.transducer.encoder_buf_begin, nullptr);


Suggested change

recognizer_config.model_config.transducer.encoder_buf_begin =

SHERPA_ONNX_OR(config->model_config.transducer.encoder_buf_begin, nullptr);

recognizer_config.model_config.transducer.encoder_buf_begin =

config->model_config.transducer.encoder_buf_begin;

And please change other places.

csukuangfj · 2024-09-07T15:32:16Z

sherpa-onnx/c-api/c-api.h

  const char *joiner;
+  const char *joiner_buf_begin, *joiner_buf_end;  // if non-null, loading the joiner from the buffer in prioriy


I suggest that you use

const char *encoder_buf; int32_t encoder_buf_len;

csukuangfj · 2024-09-07T15:33:43Z

sherpa-onnx/csrc/offline-nemo-enc-dec-ctc-model-config.h

@@ -12,11 +12,15 @@ namespace sherpa_onnx {

 struct OfflineNemoEncDecCtcModelConfig {
  std::string model;
+  const char *model_buf_begin, *model_buf_end;


Please update the Validate() method to check that
when model_buf_begin is not nullptr, then model must be empty and vice versa.

csukuangfj requested changes Sep 7, 2024

View reviewed changes

csukuangfj reviewed Sep 7, 2024

View reviewed changes

csukuangfj requested changes Sep 7, 2024

View reviewed changes

shawl336 closed this Sep 8, 2024

shawl336 force-pushed the master branch from 742fdb8 to 888f74b Compare September 8, 2024 04:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added an alternative way to initialize/load some models (for non-… #1325

Added an alternative way to initialize/load some models (for non-… #1325

shawl336 commented Sep 7, 2024

csukuangfj commented Sep 7, 2024

csukuangfj Sep 7, 2024

csukuangfj Sep 7, 2024 •

edited

Loading

csukuangfj commented Sep 7, 2024

csukuangfj Sep 7, 2024

csukuangfj Sep 7, 2024

csukuangfj Sep 7, 2024 •

edited

Loading

csukuangfj Sep 7, 2024

		const char *joiner;
		const char joiner_buf_begin, joiner_buf_end; // if non-null, loading the joiner from the buffer in prioriy

	const char encoder_buf_begin, encoder_buf_end;
	const char *encoder_buf_begin = nullptr;
	const char *encoder_buf_end = nullptr;

		recognizer_config.model_config.transducer.encoder_buf_begin =
		SHERPA_ONNX_OR(config->model_config.transducer.encoder_buf_begin, nullptr);

Added an alternative way to initialize/load some models (for non-… #1325

Added an alternative way to initialize/load some models (for non-… #1325

Conversation

shawl336 commented Sep 7, 2024

csukuangfj commented Sep 7, 2024

csukuangfj Sep 7, 2024

Choose a reason for hiding this comment

csukuangfj Sep 7, 2024 • edited Loading

Choose a reason for hiding this comment

csukuangfj commented Sep 7, 2024

csukuangfj Sep 7, 2024

Choose a reason for hiding this comment

csukuangfj Sep 7, 2024

Choose a reason for hiding this comment

csukuangfj Sep 7, 2024 • edited Loading

Choose a reason for hiding this comment

csukuangfj Sep 7, 2024

Choose a reason for hiding this comment

csukuangfj Sep 7, 2024 •

edited

Loading

csukuangfj Sep 7, 2024 •

edited

Loading