I work on AI systems that can align with human intent, reason across multimodal information, and behave reliably in real-world settings.
My research sits at the intersection of vision-language models, medical AI, trustworthy machine learning, hallucination evaluation, and agentic AI systems. I am especially interested in building and evaluating AI systems that are not only capable, but also reliable, interpretable, and deployment-ready.
Currently, I work on:
- π§ Multimodal AI β vision-language models, medical VQA, image/video understanding
- π₯ Medical AI β trustworthy VLMs for clinical and biomedical applications
- π‘οΈ Trustworthy AI β hallucination detection, uncertainty, safety evaluation, benchmarking
- π€ Agentic AI Systems β evaluation, reasoning workflows, and reliable AI agents
- βοΈ Applied ML Engineering β LLM/VLM fine-tuning, deployment, scalable AI platforms
Beyond research, I have experience leading engineering teams, supervising students, building production ML systems, and collaborating across academia and industry.
Multimodal AI Β· Vision-Language Models Β· Medical AI Β· Trustworthy ML Β· AI Safety Β· Hallucination Evaluation Β· Agentic AI Β· LLM/VLM Fine-Tuning Β· Applied Deep Learning
Python Β· PyTorch Β· Transformers Β· vLLM Β· Hugging Face Β· OpenAI APIs Β· Docker Β· FastAPI Β· PostgreSQL Β· Git Β· LaTeX Β· Jupyter
- π Currently working on multimodal, medical, and trustworthy AI systems
- π± Exploring agentic AI evaluation, alignment, reasoning, and safety
- π― Open to research collaborations, applied AI projects, and industry-facing AI systems
- π¬ Ask me about VLMs, medical AI, hallucination detection, LLM/VLM fine-tuning, and deployment
- π« Reach me through sushant.info.np
- π Pronouns: he/him
- π± Vegetarian
From: 23 May 2026 - To: 30 May 2026
Total Time: 6 hrs 8 mins
Other 26 hrs 50 mins βββββββββββββββββββββββββ 81.36 %
LaTeX 4 hrs 38 mins βββββββββββββββββββββββββ 14.09 %
Documentation 1 hr 30 mins βββββββββββββββββββββββββ 04.55 %




