Skip to content
View shangeth's full-sized avatar
🏠
Working from home
🏠
Working from home

Organizations

@SforAiDl

Block or report shangeth

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
shangeth/README.md

Visitors Repos Badge

Hi there, I'm Shangeth 👋

Researcher, Developer!

  • Previously,
  • Research Interests 🤓
    • Multi-Modal LLM(Speech)
    • Spoken Dialogue Systems
    • Speech Representations
    • Unsupervised/Semi-Supervised Representation Learning
    • Deep Reinforcement Learning

Check out my recent open-source release of multi-modal LLM for speech understanding.

Connect with me:

mail me shangeth.com twitter | Twitter linkedin | LinkedIn Google Scholar | Google Scholar



Shangeth's GitHub stats

Pinned Loading

  1. wavencoder wavencoder Public

    WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.

    Python 89 14

  2. AccentRecognition AccentRecognition Public

    Identification of accent of an english speaker with their speech signal.

    Python 8

  3. SpeakerProfiling SpeakerProfiling Public

    Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf

    Python 64 22

  4. NLTK-Twitter-Sentiment-Analysis NLTK-Twitter-Sentiment-Analysis Public

    Search Tweets with Sentiment/Factual filters

    JavaScript 21 19

  5. Facial-Emotion-Recognition-PyTorch-ONNX Facial-Emotion-Recognition-PyTorch-ONNX Public

    Python 40 13

  6. skit-ai/slu-prosody skit-ai/slu-prosody Public

    Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 2023.

    Jupyter Notebook 23 3