Faizan Amin - Machine Learning Engineer & Full Stack Developer

It is a web based portal where a user can upload a video or audio file from a class room recording or any other session. It is then process by SpeechToText models to get text from that file and then get different insights. Here is list of insights that we get from audio file.

Basic Meta

We extract basic insights from text like sentences and then split into Sentences and questions. We also extract speaker information to get info about speakers and their occurences.

Blooms and Emotions

We have trained a Natural Language Processing algorithm that provides insights for each sentence or a given paragraph that what is users emotion in that text. We also have a model for blooms classification with very good accuracy and can easily extract blooms data. Then we show data to user.

NER

We extract Named Entities from text like names, places and datetime etc and show user an overlayed text where user can toggle between raw and overlayed transcription using NLP.

Other Meta

We also get other metadata like Paragraphs, Summaries, Topics and also display WordCloud with most occuring words for easy understanding of text from speech.

Blooms App

Basic Meta

Blooms and Emotions

NER

Other Meta