-
Carnegie Mellon University
- Pittsburgh, PA
- https://pyf98.github.io
- in/yifan-peng
- @pengyf21
- https://scholar.google.com/citations?user=wH2FALMAAAAJ&hl=en
Highlights
- Pro
-
-
NeMo_VoiceTextBlender Public
NAACL 2025: "VoiceTextBlender: Augmenting Large Language Models with Speech Capabilities via Single-Stage Joint Speech-Text Supervised Fine-Tuning"
-
open_asr_leaderboard Public
Forked from huggingface/open_asr_leaderboard -
espnet Public
Forked from espnet/espnetEnd-to-End Speech Processing Toolkit
-
shinjiwlab.github.io Public
Forked from shinjiwlab/shinjiwlab.github.ioJavaScript MIT License UpdatedDec 6, 2024 -
pyf98.github.io.old Public
Forked from academicpages/academicpages.github.ioPersonal webpage
-
NeMo Public
Forked from NVIDIA/NeMoA scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Python Apache License 2.0 UpdatedAug 23, 2024 -
lm-evaluation-harness Public
Forked from EleutherAI/lm-evaluation-harnessA framework for few-shot evaluation of language models.
-
DPHuBERT Public
INTERSPEECH 2023: "DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models"
-
speech-model-compression Public
A collection of papers related to speech model compression
-
audio Public
Forked from pytorch/audioData manipulation and transformation for audio signal processing, powered by PyTorch
Python BSD 2-Clause "Simplified" License UpdatedMar 29, 2023 -
-
espnet_model_zoo Public
Forked from espnet/espnet_model_zooESPnet Model Zoo
Python Apache License 2.0 UpdatedJan 20, 2023 -
Traffic-Accident-Detection Public archive
IDL course project: Traffic Accident Detection via Deep Learning.