I'm a Data Scientist & AI Engineer with expertise in machine learning, deep learning, and large-scale data systems. My work focuses on building scalable, end-to-end ML solutions that solve real-world problems, from data preprocessing to deployment.
I enjoy working at the intersection of AI research and practical applications, turning complex data into insights and intelligent systems. I aim to contribute to cutting-edge AI innovation—LLMs, generative AI, optimization-driven ML—while driving measurable business impact.
Download ResumeAI · ML · Deep Learning · Data Engineering · Statistics
M.S. Data Science — Stony Brook · B.E. ECE — Andhra University
Two merged PRs: (1) Fixed llama-stack integration for llama-stack>=0.4.0 — updated api_base_url and CI workflow (PR #2805). (2) Stopped CI from running tests for archived Google integrations — removed workflow files and labeler refs for google-ai-haystack and google-vertex-haystack (PR #2802).
Contributed documentation improvements (merged PR #9660 — clarified `Gamma` `loglike_obs` and `weights` parameterization) and currently implementing rotated copula support for enhanced tail-dependence modeling.
Authored two contributions — one improving documentation and example clarity (merged), and another proposing a new `select_list` / `is_list` selector feature for list-like columns (closed after backend discussions). Both contributions deepened library understanding and informed future selector design.
Added parametrized Transformers smoke test for tokenizer robustness (merged PR #1814). Tests initialization and constrained generation across multiple Hugging Face checkpoints to surface tokenizer-related edge cases across models.
Published under AI Mind. Covers the shift from prompt engineering to prompt design as a disciplined creative process.
10-minute technical deep dive explaining why chunking strategies determine retrieval quality in RAG pipelines.
27-minute, 6.6 K-word guide on LLM evaluation metrics — 16 claps and 4 reader highlights.