Data Scientist
State Street
01/01/2025 - Present
• Designed predictive liquidity risk models using XGBoost, LSTM time-series networks, and Bloomberg B-PIPE, forecasting portfolio liquidity stress up to 30 days ahead, reducing liquidity risk incidents by 32% across multi-asset portfolios.
• Implemented real-time anomaly detection pipelines using Apache Kafka, AWS MSK, Isolation Forests, and Autoencoders, processing 10M+ market events daily, cutting risk detection latency by 45% with sub-200ms response times.
• Evaluated and deployed explainable AI frameworks using SHAP, LIME, and Snowflake SQL, providing transparent risk drivers, successfully passing SEC and internal audits with zero model governance findings.
• Architected end-to-end MLOps workflows using MLflow, Kubeflow, Docker, Kubernetes on AWS EKS, automating model deployment and monitoring, improving forecasting accuracy by 28% and deployment reliability by 40%.
• Developed self-service analytics dashboards using Tableau, Power BI, Plotly Dash, and FIS/SimCorp integrations, enabling portfolio managers to identify stress scenarios 2–4 weeks earlier.
Data Scientist
Persistent Systems
01/05/2021 - 01/07/2023
• Engineered ML-based credit scoring models using Logistic Regression, Random Forest, Gradient Boosting, and CIBIL/Experian/Equifax APIs, processing 1M+ loan applications per month, reducing loan default rates by 27% and improving approval speed by 60%.
• Developed advanced behavioral feature engineering pipelines using Python, Pandas, PySpark, UPI transaction analytics, and Account Aggregator (AA) Framework, increasing approval rates for new-to-credit customers by 22%.
• Implemented explainable AI models using SHAP, StatsModels, and Python, ensuring 100% RBI compliance for model transparency and credit risk reporting.
• Designed low-latency fraud detection pipelines using Isolation Forest, Autoencoders, Apache Kafka, Spark Streaming, and NPCI/Payment Gateway APIs, detecting 92% of fraudulent transactions in milliseconds while reducing false positives by 35%.
• Orchestrated cloud-native ML workflows using AWS S3, EC2, Lambda, Docker, Kubernetes, MLflow, and Jenkins, reducing end-to-end model deployment time by 50% and supporting 20M+ transactions/day.