Shrishti Saha Shetu

Research Associate, Machine Learning for Acoustic Frontend
Fraunhofer IIS

Graduate Studies: Master's in Communication and Multimedia Engineering (FAU Erlangen Nuremberg, Germany, class of 2020)

Undergraduate Studies: B.Tech in Electrical and Electronics Engineering (National Institute of Technology, Jamshedpur, India, class of 2017)

Areas of interest entail: Machine Learning, Multimedia Signal Processing, Computational Neuroscience, AI for Health and Biomedical Research.

Papers

This section covers all my published conference and journal papers.

An Empirical Study of Visual Features for DNN Based Audio-Visual Speech Enhancement in Multi-Talker Environments.

Projects

This section covers all the projects I have completed and some demos of those projects.

Deep Learning Based Noise Suppression

In this project we present a novel deep learning method to suppress any kind of noises present in a speech signal. The Deep learning model used for this project use a covulational recurrent neural network and the model was trained with 500 hours data provided by Microsoft DNS challenge

Visit the project page for more details and listen to the enhanced audio samples.

An Empirical Study of Visual Features for Deep Learning based Audio-Visual Speech Enhancement

This project was the part of my master thesis work. In this project we reimplemented the AV speech separation model presented in Google Looking to Listen Project . Furthermore, we also proposed a raw lip images based AV speech enhancement model and compared the results with Google model. This project also covers a study of other visual features, such as, lip embedding and also the the effect of visual features on speech denoising tasks.

Visit the project page for more details and listen to the enhanced audio-visual samples.

Blind Estimation of the Subband Reverberation Time

The research work for this project was done at Multimedia Communication and Signal Processing department at FAU Erlangen as a part of my research internship. In this project, we studied many novel approaches for subband T60s estimation. We also proposed two diffrent methods for blind subband and fullband T60 estimation. Please visit the projcet page for more details.

Lanes Detection for Autonomus Driving

The difficulty with the road-lane markings is that there is no labeled dataset of them and creating such dataset would cost millions of dollars. In this project, we solved this problem by creating a dataset of simulated images and then intermixed with a dataset of real images that contain no road.

Acoustic Echo Cancellation with Double-Talk Detection

The need for acoustic echo cancellation arises whenever a loud speaker and a microphone are placed together in nearby vicinity of each other and as a result the electroacoustic circuit may become unstable and produce undesirable howling. So to solve this problem, in this project we use the regularized Normalized LeastMean Square(NLMS) algorithmto generate the coefficients of the adaptive filter and to have a control over the filter adaption we couple it with a double talk detector usingMagnitude Squared Coherence(MSC) to extract the local speech signal.

Overfitting and Regularization for Wireless Resource Management.

In this work, we address both the theoretical and practical aspects of overfitting and regularization with applications to wireless resource management. We first discover the theoretical reasoning of overfitting and how regularization helps and then on implementation side we show the effects of regularization on a practical DNN-based approximatin

My Skills: Experience and Learning

This section highlights my courses and experience and gives a general overview on the skills I have gathered over the recent past.

Experience
Framework
- Keras
- TensorFlow
- PyTorch
Courses
- Machine learning in signal processing
- Seminar on overfitting and regularization for machine learning in wirless communication

Language
- Python
- Matlab
- Java
- C++
- HTML
- CSound
Framework
- Docker
- Kubernetes
- Github
- Bash
Operating System
- Linux
- Unix
- Windows

-->

Testimonials and Awards

This section has excerpts and snapshots of recommendations, testimonials and awards from people I have interacted with in my academic and professional journey.

He is quick to learn new tasks. Within a short time, he was very successful in applying the acquired professional knowledge to his work with high degree of dedication at all times We were always very satisfied with the results of his work.

Prof. Dr. ir. Emanuël Habets

Associate Professor for Perception-based Spatial Audio Signal Processing

Mr.Shetu impressed us with his expertise in the field of deep neural networks that was valuable to the entire team. He was a dedicated internee who has always accomplished his assignments with utmost commitment.

Prof.Dr.Dr. Ulrich Hoppe

Department head, Universitätsklinikum Erlangen

Mr. Shetu has varied expertise in MATLAB and Python, which he successfully applied in practice. His work was characterized by high reliability and displayed a high level of responsibility

Prof. Dr. Meinard Muller

Professor for Semantic Audio Signal Processing

His quality of relaizing and analyzing a topic is impressive. I believe he has a very good potential to pursue his career to doctorate level.

Prof. Rabindra Nath Mahanty

Porfessor and HOD, EEE, NIT, Jamshedpur.

He stood out among his peers for his egarness to engage in the process of learning and discovery.

Prof. Dr. Niranjan Kumar

Professor, EEE, NIT, Jamshedpur.

He distinguished himself as one of the best students in the class and laboratory, with a keen grasp of basic facts coupled with mathematical knowledge, which he applies efficiently to problem solving.

Dr. Madhu Singh

Associate Professor, EEE, NIT, Jamshedpur.

What am I currently learning and doing?

Coursework

Projects

Realtime speech denoising for streaming systems, IoT devices and hearing aids
Online audio-visual speech enhancement
Personal Project:Improving the implementation of Deep-Channel uses deep neural networks to detect single-molecule events from patch-clamp data

Contact Me

Address

Wichernstrasse 18, Zi-518, Erlangen-91052, Bayern, Germany

Phone Number

+49 17636661445

Email

shetu.nitjsr13@gmail.com

Your message has been sent. Thank you!

Shrishti Saha Shetu

Varied expertise for developing, optimizing and deploying cross platform DNN based speech enhancement models for resource constrained embedded devices for Real time VOIP and ASR applications.

Research Associate, Machine Learning for Acoustic Frontend
Fraunhofer IIS

Papers

An Empirical Study of Visual Features for DNN Based Audio-Visual Speech Enhancement in Multi-Talker Environments.

Projects

Deep Learning Based Noise Suppression

An Empirical Study of Visual Features for Deep Learning based Audio-Visual Speech Enhancement

Blind Estimation of the Subband Reverberation Time

Lanes Detection for Autonomus Driving

Acoustic Echo Cancellation with Double-Talk Detection

Overfitting and Regularization for Wireless Resource Management.

My Skills: Experience and Learning

Machine Learning

Programming

Testimonials and Awards

Prof. Dr. ir. Emanuël Habets

Associate Professor for Perception-based Spatial Audio Signal Processing

Prof.Dr.Dr. Ulrich Hoppe

Department head, Universitätsklinikum Erlangen

Prof. Dr. Meinard Muller

Professor for Semantic Audio Signal Processing

Prof. Rabindra Nath Mahanty

Porfessor and HOD, EEE, NIT, Jamshedpur.

Prof. Dr. Niranjan Kumar

Professor, EEE, NIT, Jamshedpur.

Dr. Madhu Singh

Associate Professor, EEE, NIT, Jamshedpur.

What am I currently learning and doing?

Coursework

Projects

Contact Me

Address

Phone Number

Email

Varied expertise for developing, optimizing and deploying cross platform DNN based speech enhancement models for resource constrained embedded devices for Real time VOIP and ASR applications.

Research Associate, Machine Learning for Acoustic Frontend Fraunhofer IIS

Papers

Projects

My Skills: Experience and Learning

Machine Learning

Programming

Academics

Software

Testimonials and Awards

Associate Professor for Perception-based Spatial Audio Signal Processing

Department head, Universitätsklinikum Erlangen

Professor for Semantic Audio Signal Processing

Porfessor and HOD, EEE, NIT, Jamshedpur.

Professor, EEE, NIT, Jamshedpur.

Associate Professor, EEE, NIT, Jamshedpur.

What am I currently learning and doing?

Coursework

Projects

Contact Me

Address

Phone Number

Email

Research Associate, Machine Learning for Acoustic Frontend
Fraunhofer IIS