Required skills and qualifications Exp- 7-12 Years · Experience: Proven experience in technical support or engineering, preferably in AI/ML/GenAI environments. · Technical Proficiency: Expertise in GenAI models (e.g., GPT, Claude, PaLM2, Llama2) and frameworks (e.g., RAG, Agents, COT). · Cloud Platform and DevOps: Hands-on experience with cloud platforms (Azure, AWS, GCP) and DevOps tools. · Database knowledge: SQL/Sybase/Mongo DB, any data warehouse (Snowflake, Databricks) experience. · Scripting and Automation: Strong proficiency in Python, Shell scripting, and other relevant programming and UI languages like Java, Angular, · Monitoring tools knowledge like Splunk, AppDynamics, Autosys, Grafana/ Loki/ Prometheus · ITIL application support management processes: Incident/Problem/Service/Jira management. · Kubernetes and Containerization: Familiarity with containerization technologies like Docker, Loki and orchestration tools like Kubernetes (preferably EKS or OpenShift). · Problem-solving and Analytical Skills: Excellent problem-solving, analytical, and troubleshooting skills with strong attention to detail. · Communication and Collaboration: Strong command and control with good communication and interpersonal skills to collaborate effectively with diverse teams and stakeholders across global teams. · Educational Background: Bachelor's degree in Computer Science, Engineering, or a related field. Preferred qualifications · Experience with natural language processing (NLP) and machine learning (ML) models. · Familiarity with large language models (LLMs) such as GPT-3.5 Turbo, GPT-4.0, and GPT-4-O. · Experience with OpenAI technologies and managing GenAI services in cloud environments. · Understanding of MLOps practices and model lifecycle management. · Familiarity with application monitoring solutions like Dynatrace and Splunk.
Job Title
Site Reliability Engineer