It is a web based portal where a user can upload a video or audio file from a class room recording or any other session. It is then process by SpeechToText models to get text from that file and then get different insights. Here is list of insights that we get from audio file.
Basic Meta
We extract basic insights from text like sentences and then split into Sentences and questions. We also extract speaker information to get info about speakers and their occurences.
Blooms and Emotions
We have trained a Natural Language Processing algorithm that provides insights for each sentence or a given paragraph that what is users emotion in that text. We also have a model for blooms classification with very good accuracy and can easily extract blooms data. Then we show data to user.
NER
We extract Named Entities from text like names, places and datetime etc and show user an overlayed text where user can toggle between raw and overlayed transcription using NLP.
Other Meta
We also get other metadata like Paragraphs, Summaries, Topics and also display WordCloud with most occuring words for easy understanding of text from speech.