Sonu Dixit

About Me

Hello! I like to apply maths and neural networks to solve real-world problems. I have a strong foundation in AI and hands-on experience across diverse domains. I have worked on LLM fine-tuning, large-scale information retrieval, question-answering, and reinforcement learning. Please refer to the experience section for more details on my projects.

I earned my Master's degree in Artificial Intelligence from the Indian Institute of Science in 2019. During my Masters, I worked on multi-agent reinforcement learning for traffic signal control. In the industry, I have contributed to [24]7.ai and Disney Star India Pvt Ltd.

Outside of work, I enjoy swimming, running, and spending time in nature through long walks and treks.

Sonu Dixit

Industry Experience

LLama 3 Finetuing for Customer Chat Conversation

Intent Classification

Information Retrieval and Conditional Generation

LLM as Co-Pilot

Compliance Evaluation and Early Warning System - Agent-Customer Chat

Named Entity Recognition by Question Answering

Content Analysis and Storyline Extraction

Content Analysis Project Image
  • Quantified character attributes using text, image, and audio data, correlating features with viewership metrics.
  • Episode vector learning, social relationship, face recognition. Website

Entity extraction from Semi-Structured Documents

Publications

Publication 1 Image

Zero-Shot Generalization using Intrinsically Motivated Compositional Emergent Protocols

Authors: R Hazra*, Sonu Dixit* — NAACL 2021 Workshop (ViGIL)

Latest Preprint:Intrinsically Motivated Compositional Language Emergence

We argue intrinsic rewards increase compositionality in emergent communication between two Agents. The improved compositionality increases zero shot generalisation in the downstream tasks. website

Publication 2 Image

gComm: Environment for Investigating Generalization in Grounded Language Acquisition

Authors: R Hazra, Sonu Dixit, S Sen — NAACL 2021 Workshop (ViGIL)

This work introduces gComm, an environment designed to evaluate grounded language acquisition. We focus on how agents generalize language understanding in complex, multi-agent scenarios.

M.Tech Thesis: Adaptive Traffic Signal Control using Multi Agent RL

M.Tech Thesis Image
  • Implemented multi-agent reinforcement learning (MARL) to dynamically adjust traffic signal durations based on congestion.
  • Algorithm: Proximal Policy Optimization (PPO) with Advantage Actor-Critic. Report
  • Simulated traffic data using PTV Vissim and demonstrated MARL’s superiority over fixed-time algorithms in terms of average speed, delay, and lane occupancy. Results
  • Each Signal is represented as a Neural Network. During training multiple small neural networks are trained at same time. The networks are connected via the reward that they get.
  • Advisor - Prof Shalabh Bhatnagar

Contact

Email: sonudixit2k@gmail.com

LinkedIn: linkedin.com/in/sonudixit