IT Cloud and AI Ops Lead
NXP Semiconductors
Job Responsibility:
· Platform Development: Design, develop, and implement cloud-based solutions, leveraging AI/ML to automate and enhance IT operations.
· Intelligent Automation: Drive the automation of routine operational tasks, incident response, and remediation workflows using AI-driven agents and orchestration tools, minimizing manual intervention and improving operational efficiency.
· Incident Management and Remediation: Integrate AIOps insights with existing incident management systems, providing real-time intelligence to rapidly identify, diagnose, and resolve IT issues, leading to proactive issue resolution and reduced mean time to recovery (MTTR).
· Predictive Analysis and Performance Optimization: Lead the implementation of AIOps models for proactive anomaly detection, root cause analysis, and predictive insights to fine-tune IT systems for peak operational efficiency, capacity planning, and resource optimization.
· Technical acumen: Provide technical support to peers, promoting architectural excellence, innovation, and best practices in Cloud and AIOps development and operations.
· Cross-Functional Collaboration: Partner with cloud architects, platform engineers, functional leads, and IT operations teams to integrate AI agents into the platform and ensure solutions align with business needs and deliver measurable ROI.
· Innovation & Research: Actively research and evaluate emerging Cloud and AIOps technologies, generative AI, and advanced RAGs, bringing promising innovations into production through POCs and long-term architectural evolution.
Job Qualification:
· Experience of 4+ years in building and deploying cloud solution and managing multi-cloud environments (AWS, Azure, Google Cloud).
· Extensive knowledge and hands-on experience with cloud technologies and services such as EC2, EBS, S3, Terraform, CLI, API, CICD Pipeline, FinOps, VPC, RDS, Resource Manager.
· Experience building and using cloud native and custom AI tools and services
· Experience with developing full-stack applications and services, is preferred
· Experience managing cloud-based operations and processes such as alerting, monitoring, logging and incident management
· Demonstrated ability to translate business requirements and operational challenges into technical specifications and deliver robust AIOps solutions.
· Strong communication and interpersonal skills, with a track record of influencing technical and cross-functional stakeholders.
· Experience with software development life cycle (SDLC) and agile/iterative methodologies with programming and scripting experience in relevant language, Python, Terraform, Cloud formation, JSON
· Familiarity with containerization (Docker, Kubernetes).
· Bachelor’s degree in Computer Science, Information Technology, or a related field.
· Certifications in cloud and development technologies are a plus
Key Competencies:
A highly analytical and hands-on Cloud and AIOps lead who is passionate about leveraging AI to drive operational excellence and resilience in a fast-paced environment, bridging the gap between AI development and IT operations, and is committed to building intelligent, self-healing systems that power a world-class experience.
· Programming Languages : Python, Java, Javascript, NodeJs
· Operating Systems : Windows, Linux, MacOS
· Web Languages : HTML, CSS, JavaScript
· Development Frameworks and Other : NodeJs, React, API Automation, Postman
· Version Control : Git, Github
· Cloud Services : AWS, Azure, GCP
#LI-7013