Skip to content

different results with fine-tuned model #14700

@ramil-gadirov

Description

@ramil-gadirov

Hi,

I fune-tuned a new az language based on stt_en_fastconformer_hybrid_large_pc model (100h data, wer 7.4%, 350k steps), when transcribing a normal speech (microphone), audio books or movie, etc, a result is very good, but when trying a phone speech (good quality, no noise) getting very poor results from it (all audio 16khz/mono/16bit). Is there any params to change for training or maybe another nero model is more accurate for phone records?

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions