Supradeep Danturti

Projects & Learnings

Overlap Speaker Counter: This project addresses the challenge of accurately counting speakers in meeting recordings where speech may overlap. This is essential for improving the accuracy of automated meeting transcriptions. To generate realistic training data, a simulator was developed that combines clean speech (LibriSpeech-clean-100) with noise and reverberation effects (Open-RIR dataset). Two established speaker recognition models (x-vector and ECAPA-TDNN) were tested alongside a novel approach. This new method integrated a pretrained Wav2Vec 2.0 model with a linear classifier and XVector. The system analyzes short audio segments, providing timestamps and the detected number of speakers. Crucially, the Wav2Vec 2.0 hybrid model significantly outperformed the other approaches. This demonstrates its power in handling complex meeting environments. This work pushes the boundaries of speaker counting technology and offers a valuable tool for the SpeechBrain project, ultimately benefiting a wide range of speech-related applications.

Age Classification Using Convolutional Neural Networks This project is a learning experience on how different neural networks and specific hyperparameters work on a dataset. Related to COMP 6721 Applied AI

Survillience System is a mini version of the Patent. Reflective of the initial stride towards it :))

EDA On FIFA World Cup is a project which was part of learning Exploratory data analysis and dashboarding using Flask and Python.

The Lost Mayan T is a small video developed using Unity which has basic level design, Cinemachine and particle system animation. It's not the best but you can watch it here :)

Gastrointestinal Cancer Classification was an attempt to extract features, classify and understand Pathology images.

Complete Deep Learning Using Pytorch

I'm still learning. This repo is currently private.

Understanding SpeechBrain is an open source speech library/framework which has Different models from speech recognition to conversations with humans or other bots(Maybe)

I'm still learning. This repo is currently private.

Understanding SpeechBrain is an open source speech library/framework which has different models from speech recognition to conversations with humans or other bots(Maybe)

I'm still learning. This repo is currently private.

Patent/Publications

University Surveillance and Attendance System Using Face Recognition based on Machine Learning and Internet of Things

AU2021104568, Sept 2021

Supradeep Danturti, Beulah Kondapalli, Sameeri Mamillapalli, Narasimha Rao Gudikandhula, Prasanthi Rathanala, Anthony Sunny Dayal Pendurthy, Yaswanth Yalamarthy

This Webpage is inspired by Andrej Karpathy! (More like Copied 😅)

Supradeep Danturti

I like to create "something" using Deep Neural Networks.