476 Devops Engineers jobs in Hyderabad
DevOps Engineers
Posted today
Job Viewed
Job Description
Role and Responsibilities
Skills and Experience
Site Reliability Engineer
Posted 8 days ago
Job Viewed
Job Description
At Amgen, if you feel like you're part of something bigger, it's because you are. Our shared mission-to serve patients living with serious illnesses-drives all that we do.
Since 1980, we've helped pioneer the world of biotech in our fight against the world's toughest diseases. With our focus on four therapeutic areas -Oncology, Inflammation, General Medicine, and Rare Disease- we reach millions of patients each year. As a member of the Amgen team, you'll help make a lasting impact on the lives of patients as we research, manufacture, and deliver innovative medicines to help people live longer, fuller happier lives.
Our award-winning culture is collaborative, innovative, and science based. If you have a passion for challenges and the opportunities that lay within them, you'll thrive as part of the Amgen team. Join us and transform the lives of patients while transforming your career.
Site Reliability Engineer
**What you will do**
Let's do this. Let's change the world. In this vital role you will responsible for the reliability, stability, performance, scalability, and security of platforms that support Amgen's digital products and engineering teams. This hands-on role focuses on supporting cloud-based infrastructure, automating operations, maintaining observability, and improving platform reliability through code.
You'll work closely with senior engineers and cross-functional teams to support CI/CD workflows, container platforms, incident response, and enterprise tooling-all while adopting modern SRE principles and practices.
This role is ideal for engineers who have foundational site reliability experience and are looking to expand their skills in a cloud-native, enterprise-scale environment.
**Roles & Responsibilities:**
**Infrastructure & Platform Support**
+ Provision and manage cloud infrastructure using Infrastructure as Code (IaC)
+ Support container orchestration platforms, ensuring availability, access control, and resource management
+ Assist in configuring and maintaining CI/CD pipelines and environments
**Monitoring & Incident Response**
+ Set up and maintain observability tools to track system health and performance
+ Participate in alert tuning, incident resolution, and root cause analysis
+ Support integration of observability platforms with incident response workflows
**Automation & Platform Operations**
+ Automate routine platform tasks such as provisioning, patching, and configuration
+ Write scripts to improve platform reliability, reduce manual work, and enforce compliance
+ Participate in platform upgrades, maintenance windows, and service validation efforts
**AI Enablement & Intelligence**
+ Support the adoption of AI-assisted operational tools for log analysis, anomaly detection, and predictive alerts
+ Collaborate with senior engineers to evaluate AI/ML-based observability and automation platforms
+ Assist in integrating AI-driven insights into dashboards, alerts, or incident workflows
+ Stay current with emerging AI trends in infrastructure and site reliability, and contribute to tool evaluations and pilots
**Collaboration & Enablement**
+ Work with development, QA, and security teams to ensure reliable and secure deployments
+ Document operational procedures, playbooks, and system runbooks
+ Learn and support enterprise collaboration platforms and internal tooling
+ Participate in Agile and SAFe delivery processes-including sprint planning, stand-ups, retrospectives, and PI planning-to ensure security and platform reliability are embedded across development cycles.
**What we expect of you**
We are all different, yet we all use our unique contributions to serve patients. The (vital attribute) professional we seek is a (type of person) with these qualifications.
**Basic Qualifications:**
+ Master's degree / Bachelor's degree and 5 to 9 years in Computer Science, IT or related field
+ 4 years of hands-on related experience in site reliability, DevOps, or platform engineering roles
+ Hands-on experience with cloud platforms preferably AWS
+ Familiarity with Kubernetes or container orchestration technologies
+ Exposure to CI/CD practices and pipeline automation
+ Experience troubleshooting Linux systems, processes, and services
**Preferred Qualifications:**
**Must-Have Skills:**
+ Practical experience with **cloud platforms** (e.g., AWS, Azure, or GCP), including compute, networking, IAM, and storage services
+ Familiarity with **container orchestration platforms** (e.g., Kubernetes, Docker), including basic workload deployment and troubleshooting
+ Experience using **Infrastructure as Code (IaC)** tools such as **Terraform** or **CloudFormation**
+ Working knowledge of **Linux administration** , including system services, package management, and file system structures
+ Hands-on exposure to **CI/CD platforms** (e.g., GitLab CI, Jenkins, GitHub Actions) and pipeline troubleshooting
+ Proficiency in **scripting or automation languages** like **Python** , **Bash** , or **Go**
+ Exposure to **observability tooling** (e.g., **Dynatrace** , **Prometheus** , or **Grafana** ) for monitoring and alerting
+ Familiarity with **incident management practices** and tools (e.g., runbooks, escalation workflows, basic alert tuning)
+ Version control skills using **Git** and understanding of branching strategies
+ Experience supporting or integrating **enterprise collaboration platforms** (e.g., Jira, Confluence, ServiceNow)
+ Interest and basic understanding of **AI/ML tools** used in infrastructure and operations (e.g., anomaly detection, intelligent alerting, log analysis)
**Good-to-Have Skills:**
+ Experience using Infrastructure as Code (IaC) tools like Terraform or CloudFormation
+ Familiarity with IT incident response workflows and ticketing platforms
+ Knowledge of secrets management, configuration management tools (e.g., Ansible), or logging frameworks
+ Exposure to **AI-assisted tooling** (e.g., AIOps platforms, AI-enhanced alerting, anomaly detection)
**Professional Certifications (Preferred)**
+ Cloud DevOps Certification (AWS/Azure/GCP)
+ Certified Kubernetes Administrator (CKA) or Security Specialist (CKS)
+ CI/CD Platform Certification
+ ITIL Foundation or equivalent service management certification
**Soft Skills:**
+ Strong analytical and troubleshooting skills
+ Collaborative and proactive mindset
+ Effective communication and documentation practices
+ Curiosity and willingness to adopt new tools and methods, including AI integrations
+ Ability to manage time and prioritize tasks in dynamic environments
**Shift Information:** This position is an onsite role and may require working during later hours to align with business hours. Candidates must be willing and able to work outside of standard hours as required to meet business needs.
**What you can expect of us**
As we work to develop treatments that take care of others, we also work to care for your professional and personal growth and well-being. From our competitive benefits to our collaborative culture, we'll support your journey every step of the way.
In addition to the base salary, Amgen offers competitive and comprehensive Total Rewards Plans that are aligned with local industry standards.
**Apply now and make a lasting impact with the Amgen team.**
**careers.amgen.com**
As an organization dedicated to improving the quality of life for people around the world, Amgen fosters an inclusive environment of diverse, ethical, committed and highly accomplished people who respect each other and live the Amgen values to continue advancing science to serve patients. Together, we compete in the fight against serious disease.
Amgen is an Equal Opportunity employer and will consider all qualified applicants for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, disability status, or any other basis protected by applicable law.
We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.
Site Reliability Engineer
Posted 10 days ago
Job Viewed
Job Description
Site Reliability Engineer
Experience: 7+ Years
Location: Hyderabad
Hybrid 4-day office and 1 Day remote
Skills for Principal:
- Strong leadership and people management skills.
- Exceptional technical proficiency in Pearson's technology stack.
- Advanced project management capabilities.
- Excellent communication and collaboration skills.
- Adept at risk assessment and crisis management.
- Strategic thinking with a focus on long-term operational excellence.
- Champion operational excellence by directing initiatives that elevate system reliability, availability, and overall efficiency.
- Function as the diplomatic link that binds the SRE team to other organizational units, harmonizing goals, and facilitating collaboration for mutual success.
- Cultivate an environment of excellence, propelling the development of SRE engineers, and Sr. SRE engineers.
- 7+ years of professional work experience as described above.
Site Reliability Engineer
Posted 3 days ago
Job Viewed
Job Description
Site Reliability Engineer
Experience: 7+ Years
Location: Hyderabad
Hybrid 4-day office and 1 Day remote
Skills for Principal:
- Strong leadership and people management skills.
- Exceptional technical proficiency in Pearson's technology stack.
- Advanced project management capabilities.
- Excellent communication and collaboration skills.
- Adept at risk assessment and crisis management.
- Strategic thinking with a focus on long-term operational excellence.
- Champion operational excellence by directing initiatives that elevate system reliability, availability, and overall efficiency.
- Function as the diplomatic link that binds the SRE team to other organizational units, harmonizing goals, and facilitating collaboration for mutual success.
- Cultivate an environment of excellence, propelling the development of SRE engineers, and Sr. SRE engineers.
- 7+ years of professional work experience as described above.
Site Reliability Engineer
Posted today
Job Viewed
Job Description
Site Reliability Engineer
Experience: 7+ Years
Location: Hyderabad
Hybrid 4-day office and 1 Day remote
Skills for Principal:
- Strong leadership and people management skills.
- Exceptional technical proficiency in Pearson's technology stack.
- Advanced project management capabilities.
- Excellent communication and collaboration skills.
- Adept at risk assessment and crisis management.
- Strategic thinking with a focus on long-term operational excellence.
- Champion operational excellence by directing initiatives that elevate system reliability, availability, and overall efficiency.
- Function as the diplomatic link that binds the SRE team to other organizational units, harmonizing goals, and facilitating collaboration for mutual success.
- Cultivate an environment of excellence, propelling the development of SRE engineers, and Sr. SRE engineers.
- 7+ years of professional work experience as described above.
Site Reliability Engineer
Posted today
Job Viewed
Job Description
Mentor teammates on SRE best practices and guide technical direction
Work closely with the product engineering team to rapidly deliver capabilities
Automate and optimize developer pipelines
Build monitoring to assess system and pipeline health
Qualifications:
Proficiency in Python, Go, Ruby, or Java is a plus
Expertise in Linux administration, configuration, and networking protocols
Experience managing and automating cloud infrastructure in AWS, Google Cloud Platform, and/or Azure
Exemplary experience with tools such as Kubernetes, Terraform, Ansible, Puppet, and Chef
5+ years experience in Site Reliability Engineering or DevOps
Site Reliability Engineer
Posted today
Job Viewed
Job Description
Driven by the passion to improve quality of people’s lives, WS Audiology continues to grow as market leader in the hearing aid industry. With our commitment to increase penetration in an underserved hearing care market, we want to accelerate our business transformation in order to reach more people, more effectively.
We are looking for Site Reliability Engineers (SREs) with domain expertise in at least one of the following fields: containers, public clouds, and cloud-native workloads.
As an SRE you will be responsible for ensuring the reliability, performance, and security of the operational backbone of a partly medical cloud-based product suite.
What you will do
What you bring
What we offer
Personal competencies
If you are a highly motivated and skilled SRE who is enthusiastic about ensuring the reliability and performance of complex software systems, we encourage you to apply.
And beyond your professional qualifications, we are looking for the following:
You take initiative and ownership when something is not working. Furthermore, you are the type of profile, that gives inputs and suggestions for, how processes can be improved.
You are able to both give and receive constructive feedback and people will see you as a structured problem solver.
Last but not least, we would expect you be thriving in being part of a team and having the ability to exchange knowledge and expertise, internally as well as externally.
Be The First To Know
About the latest Devops engineers Jobs in Hyderabad !
Site Reliability Engineer
Posted today
Job Viewed
Job Description
- Experience with supporting Java (J2EE/Spring Boot) based multi-tier applications with complex upstream downstream interactions having expertise in understanding the application request flow and analysing application logs for investigating and troubleshooting issues and application break .
- Ability to work in a dynamic environment with ability to self-organize and plan and prioritise the work in an environment where multiple issues compete for attention.
- Contribute in developing and implementing automated CI/CD capability for our application .
- Contribute in our continuous improvement and continuous delivery while increasing maturity of DevOps practices.
- Get involved in the discussions and provide inputs in designing a fully automated, robust and secure infrastructure.
- Collaborate closely with other internal SRE and Dev teams/business users in investigating, testing and deployments
- Responsible for handling Release Management, raising Change Request and scheduling for the implementation of fixes and enhancements.
- Work effectively in collaboration with different teams either local or remote.
- Work towards high availability of our applications by putting in right Observability in place.
- Support our production environment with strong performance tuning, end-to-end troubleshooting, networking fundamentals skills.
- Willingness to in rotational shifts/On-Call rosters as part of 24x7 teams supporting critical applications.
Requirements
- Minimum 5-7 years’ experience as a Site Reliability engineer supporting different application and application infrastructure in a Hybrid-cloud platforms with mix of On-Prem and AWS/GCP
- Ability to support Java (J2EE/Spring Boot) or .NET applications and manage Incident and support recovery of the application and drive root cause analysis, management communication and client relationship management in partnership with Infrastructure Service Support team members .
- Ensures all production changes are made in accordance with life-cycle methodology and risk guidelines
- Application Support, Deployment of Release, patches & fixes on Platform
- Analyse application performance, perform tuning and ensure high availability & stability of platform.
- Knowledge of Batch Processing systems and tools
- Knowledge of Unix/Linux system and containerization and container orchestration platforms and platforms (viz., Docker, Cloud Foundry, OpenShift, Kubernetes) etc.
- Strong scripting skills ability automate manual tasks which could be easily converted to a script - shell, Python or PowerShell.
- Familiarity with usage of Observability tools like Grafana, Kibana, AppDynamics etc.
- Experienced in AWS/GCP Public cloud services
- Hands on experience any of the CI/CD tools viz., Jenkins, Circle-CI, GitHub Actions and ability to understand and define different deployment strategies.
- Hands-on experience with GIT. Managing deployment and branching with in GIT