Research themes
What I focus onResponsible AI and safety, interpretable reasoning, structured knowledge extraction, and evaluation methods that catch “silent failures.”
I build systems that ▍
My research goal is to combine neural learning with explicit structure and principled evaluation. Across foundation model safety, neuro-symbolic reasoning, structured knowledge extraction, and scientific ML, I’ve seen strong models fail in subtle, opaque ways under deployment and stress. I focus on auditing behavior, understanding representations, and enforcing semantic + reasoning consistency instead of relying only on output-level metrics.
I like systems that are simple on the surface and rigorous underneath: clear baselines, measurable progress, and explanations that survive real usage.
Responsible AI and safety, interpretable reasoning, structured knowledge extraction, and evaluation methods that catch “silent failures.”
Practical LLM/KG systems, evaluation harnesses, and reproducible research code. Comfortable across Python/Java/SQL and modern ML stacks.
Research engineering + production ML systems.
Java, Python, C/C++, SQL, JavaScript, HTML/CSS, R
PyTorch, TensorFlow, Transformers, LangChain, ChromaDB, XFormers, NumPy, Pandas, Scikit-learn, Statsmodels, Keras, Matplotlib, Seaborn, PyPDF
AWS (EC2, S3, IAM, Lambda, SageMaker, CloudWatch), Docker, Kubernetes, Linux, Git, CI/CD, REST APIs, Nginx
FastAPI, Flask, ML pipelines, experiment tracking, model versioning, data validation, monitoring & logging, distributed training, GPU optimization, batch & real-time inference
ETL pipelines, SQL optimization, schema design, data cleaning & transformation, time-series preprocessing, vector databases (ChromaDB), caching strategies
Independent study: NeuroSymbolic system for analyzing cognitive decline risk from long-term AI usage.
Multimodal safety + bias mitigation across generative and language models.
Satellite navigation forecasting pipeline focused on reliability and real-world integration.
Knowledge graphs from LLMs and a retrieval-augmented medical assistant.
Selected publications and preprints.
LSTM-based clock-bias forecasting with strong accuracy gains and a practical preprocessing workflow for navigation reliability.
KG generation directly from unstructured text, evaluated with structural + semantic metrics for GraphRAG readiness.
Retrieval-augmented medical chatbot using modern LLM tooling to answer questions grounded in medical literature PDFs.
Critical evaluation of TSFMs for anomaly detection/prediction: when they help, when classic methods win, and why.
Selected builds from my recent work.
Self-improving research agent that discovers papers, builds structured research memory, generates hypotheses, and adapts strategy across iterative research cycles.
Electron desktop meeting copilot that captures system audio, transcribes speech, and generates context-aware AI responses with optional screenshots and file grounding.
NeuroSymbolic framework for cognitive-risk analysis with rules, temporal logic, and an interactive demo.
Benchmarking time series foundation models for anomaly detection under realistic constraints.
KG extraction + evaluation pipeline for structured reasoning and GraphRAG readiness checks.
Clock-bias forecasting pipeline with robust preprocessing and deployment-minded evaluation.
Note: This project cannot be open sourced due to security restrictions by the Indian Space Research Organization.
RAG chatbot grounded in medical PDFs with efficient inference tooling.
Interactive visualization of classic searching + sorting algorithms for learning and demos.
For research collaborations, internships, or talks, send a short note with context + one link.