Skip to content

This repository contains a machine learning-based system to detect deepfake (synthetic) voices. It leverages audio feature extraction techniques such a Yamnet , and deep learning models (ANN,CNN, RNN) to distinguish real and fake audio. The project includes data preprocessing, model training, evaluation, and a simple API for inference.

License

Notifications You must be signed in to change notification settings

KaushiML3/Deepfake-voice-detection_Yamnet

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Deepfake-voice_detection_Yamnet

This repository contains a machine learning-based system to detect deepfake (synthetic) voices. The system utilizes audio feature extraction techniques such as YAMNet and deep learning models (ANN, CNN, RNN) to differentiate between real and fake audio.

🚀 Features

  • Audio Preprocessing: Converts raw audio into meaningful features.
  • Feature Extraction: Uses YAMNet to extract embeddings from audio signals.
  • Dataset : Uses the kaggle "In The Wild (audio Deepfake)" dataset for training the model.Link
  • Deep Learning Models: Implements ANN, CNN, and RNN architectures for classification.
  • Training & Evaluation: Trains models with labeled datasets and evaluates accuracy.
  • Inference API: Provides an API to classify input audio as real or deepfake.
    1. image
    2. image

Model Performance

  1. ANN model atchitecture
  • Model training stat: Accuracy and Val_lose

    1. image
  • Evaluation Metrics: Accuracy, Precision, Recall, F1-score 2. image

                      precision    recall  f1-score   support
              fake       0.97      0.95      0.96       446
              real       0.96      0.98      0.97       554
    
          accuracy                           0.97      1000
         macro avg       0.97      0.97      0.97      1000
      weighted avg       0.97      0.97      0.97      1000
    
  1. CNN model atchitecture
  • Model training stat: Accuracy and Val_lose

    1. image
  • Evaluation Metrics: Accuracy, Precision, Recall, F1-score 2. image

                      precision    recall  f1-score   support
              fake       0.97      0.96      0.97       446
              real       0.97      0.97      0.97       554
    
          accuracy                           0.97      1000
         macro avg       0.97      0.97      0.97      1000
      weighted avg       0.97      0.97      0.97      1000
    
  1. RNN model atchitecture
  • Model training stat: Accuracy and Val_lose

    1. image
  • Evaluation Metrics: Accuracy, Precision, Recall, F1-score 2. image

                      precision    recall  f1-score   support
              fake       0.94      0.93      0.94       446
              real       0.94      0.95      0.95       554
    
          accuracy                           0.94      1000
         macro avg       0.94      0.94      0.94      1000
      weighted avg       0.94      0.94      0.94      1000
    

Best Model Achieved: ANN with 97% accuracy

🛠️ Setup & Installation

1.Clone the repository:

git clone https://github.com/KaushiML3/Deepfake-voice-detection_Yamnet.git
cd Deepfake-voice-detection_Yamnet

2.Install dependencies:

pip install -r requirements.txt

3.Run inference

  • change the direction for API folder
python main.py

About

This repository contains a machine learning-based system to detect deepfake (synthetic) voices. It leverages audio feature extraction techniques such a Yamnet , and deep learning models (ANN,CNN, RNN) to distinguish real and fake audio. The project includes data preprocessing, model training, evaluation, and a simple API for inference.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages