-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[cli] support context biasing with ac automaton #2128
Conversation
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. | ||
# See the License for the specific language governing permissions and | ||
# limitations under the License. | ||
|
||
from wenet.dataset.processor import __tokenize_by_bpe_model |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
wenet/wenet/dataset/processor.py
Line 28 in 3c7f291
torchaudio.utils.sox_utils.set_buffer_size(16500) |
procesor.py 这一行代码和运行环境关系较大,不保证都能运行成功,是否考虑将
__tokenize_by_bpe_model
函数换个位置
We should aslo add context support in |
用 yapf 做过格式化了吗? |
@kaixunhuang0 The new AC automate will override the greedy search before. |
paraformer 支持热词容易吗, 只考虑greedy search 就行 ,字数对齐到text了 没有prefix , 会简单些, 有时间支持下吗@cdliang11 |
格式化了 |
好的,我瞧瞧 |
热词列表:
@pkufool, 这里我们搬运了你的代码。