Build and operate end-to-end ML/AI pipelines (data → training → deployment → monitoring).Automate CI/CD for ML/AI with Jenkins, integrate MLflow for tracking and registry.Deploy scalable batch and online inference systems using Docker and Kubernetes/ECS.Implement observability for model.Manage streaming pipelines with Kafka and Flink.Ensure secure, high-quality code execution for user-submitted scripts.Lead Agent & Foundational Model Operations — integrate APIs (OpenAI, Gemini, Anthropic), manage RAG pipelines and vector stores, handle prompt/tool management, caching, routing, and guardrails.Collaborate with product, legal, and security teams to ensure compliance and responsible model use.