Gokul Karthik Kumar, a graduate of Mohamed Bin Zayed University of Artificial Intelligence (MBZUAI), has made significant contributions to the field of artificial intelligence (AI). His notable projects include developing a system to detect hate speech in memes, creating an AI-human interactive dubbing platform, and conducting research on deep learning-based text-to-speech systems for 13 Indian languages.
During his time at MBZUAI, Kumar had the opportunity to explore various aspects of AI, particularly focusing on natural language processing. He credits the university for providing him with a transformative experience and preparing him for a career in AI research and development. Under the guidance of knowledgeable professors, Kumar honed his research skills, from problem identification to research proposal and presentation.
With a strong background in machine learning across different domains, including text, image, speech, and time series, Kumar has worked with esteemed organizations such as Microsoft Research India, TCS Research, MBZUAI, and IIT Madras. He has achieved success in several hackathons, both national and international, and holds a US patent.
Kumar has co-authored articles published in prestigious conferences like IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023), Association for Computational Linguistics (ACL 2022) Workshop, Empirical Methods in Natural Language Processing (EMNLP 2022) Workshop, and International Joint Conference on Neural Networks (IJCNN).
After the commencement ceremony, Kumar will present his research paper titled “Towards Building Text-To-Speech Systems for the Next Billion Users” at ICASSP 2023 in Greece. This project, initiated during his summer internship at Microsoft Research India, involved collaborating with co-author Praveen from IIT Madras. Their work focused on evaluating design choices for text-to-speech systems, leading to the development of state-of-the-art models for 13 Indian languages. By extending open-source text-to-speech capabilities to local languages, Kumar aims to reach a wider audience, especially those who may not be proficient in reading English.