DevOps Engineers

Hyderabad, Andhra Pradesh GSPANN

Posted today

Job Viewed

Tap Again To Close

Job Description

Description GSPANN is hiring a DevOps Engineer for its Pune or Hyderabad location. This full-time role involves managing cloud infrastructure, automating deployments, and building scalable CI/CD pipelines using tools like Azure, Kubernetes, Terraform, and Python.

Role and Responsibilities

  • Manage cloud-based production environments, with a strong preference for Microsoft Azure.
  • Script automation tasks efficiently using Python (preferred).
  • Deploy infrastructure using tools like Ansible, Terraform, and Azure DevOps.
  • Orchestrate containers with a deep understanding of Kubernetes and Docker.
  • Apply configuration management using tools such as Chef, Ansible, and AWS CodeDeploy.
  • Work with Continuous Integration/Continuous Deployment (CI/CD) tools including GitLab, Jenkins, Bamboo, Travis CI, and CircleCI.
  • Troubleshoot complex technical issues and deliver reliable and scalable solutions.
  • Operate independently in high-paced environments while demonstrating ownership and accountability.
  • Design and implement CI/CD pipelines that enhance software delivery speed and reliability.
  • Leverage Infrastructure as Code (IaC) practices using Terraform, Ansible, and Azure DevOps for scalable infrastructure management.
  • Improve system consistency and automation through effective use of configuration management tools.
  • Showcase strong soft skills including ownership, collaboration, and analytical problem-solving.
  • Skills and Experience

  • Bachelor's degree in Computer Science, Information Science, Engineering, or a related field.
  • 3-8 years of experience in a DevOps role with hands-on involvement in deployment and automation processes.
  • Maintain high availability, low latency, and peak performance of global e-commerce platforms.
  • Strengthen operational excellence by embedding observability best practices and advanced monitoring solutions.
  • Collaborate across engineering, operations, and product teams to boost system reliability and deployment workflows.
  • Build and manage robust CI/CD pipelines to ensure efficient and secure software releases.
  • Create automation tools that streamline incident response and application deployments.
  • Apply DevOps methodologies to reduce system downtime and enhance performance.
  • Track error budgets, meet defined Service Level Objectives (SLOs), and uphold critical service uptime.
  • Automate infrastructure provisioning and scaling to manage resource efficiency and traffic fluctuations.
  • Detect and resolve performance bottlenecks and system inefficiencies before they escalate.
  • This advertiser has chosen not to accept applicants from your region.

    Site Reliability Engineer

    Hyderabad, Andhra Pradesh Amgen

    Posted 8 days ago

    Job Viewed

    Tap Again To Close

    Job Description

    **Join Amgen's Mission of Serving Patients**
    At Amgen, if you feel like you're part of something bigger, it's because you are. Our shared mission-to serve patients living with serious illnesses-drives all that we do.
    Since 1980, we've helped pioneer the world of biotech in our fight against the world's toughest diseases. With our focus on four therapeutic areas -Oncology, Inflammation, General Medicine, and Rare Disease- we reach millions of patients each year. As a member of the Amgen team, you'll help make a lasting impact on the lives of patients as we research, manufacture, and deliver innovative medicines to help people live longer, fuller happier lives.
    Our award-winning culture is collaborative, innovative, and science based. If you have a passion for challenges and the opportunities that lay within them, you'll thrive as part of the Amgen team. Join us and transform the lives of patients while transforming your career.
    Site Reliability Engineer
    **What you will do**
    Let's do this. Let's change the world. In this vital role you will responsible for the reliability, stability, performance, scalability, and security of platforms that support Amgen's digital products and engineering teams. This hands-on role focuses on supporting cloud-based infrastructure, automating operations, maintaining observability, and improving platform reliability through code.
    You'll work closely with senior engineers and cross-functional teams to support CI/CD workflows, container platforms, incident response, and enterprise tooling-all while adopting modern SRE principles and practices.
    This role is ideal for engineers who have foundational site reliability experience and are looking to expand their skills in a cloud-native, enterprise-scale environment.
    **Roles & Responsibilities:**
    **Infrastructure & Platform Support**
    + Provision and manage cloud infrastructure using Infrastructure as Code (IaC)
    + Support container orchestration platforms, ensuring availability, access control, and resource management
    + Assist in configuring and maintaining CI/CD pipelines and environments
    **Monitoring & Incident Response**
    + Set up and maintain observability tools to track system health and performance
    + Participate in alert tuning, incident resolution, and root cause analysis
    + Support integration of observability platforms with incident response workflows
    **Automation & Platform Operations**
    + Automate routine platform tasks such as provisioning, patching, and configuration
    + Write scripts to improve platform reliability, reduce manual work, and enforce compliance
    + Participate in platform upgrades, maintenance windows, and service validation efforts
    **AI Enablement & Intelligence**
    + Support the adoption of AI-assisted operational tools for log analysis, anomaly detection, and predictive alerts
    + Collaborate with senior engineers to evaluate AI/ML-based observability and automation platforms
    + Assist in integrating AI-driven insights into dashboards, alerts, or incident workflows
    + Stay current with emerging AI trends in infrastructure and site reliability, and contribute to tool evaluations and pilots
    **Collaboration & Enablement**
    + Work with development, QA, and security teams to ensure reliable and secure deployments
    + Document operational procedures, playbooks, and system runbooks
    + Learn and support enterprise collaboration platforms and internal tooling
    + Participate in Agile and SAFe delivery processes-including sprint planning, stand-ups, retrospectives, and PI planning-to ensure security and platform reliability are embedded across development cycles.
    **What we expect of you**
    We are all different, yet we all use our unique contributions to serve patients. The (vital attribute) professional we seek is a (type of person) with these qualifications.
    **Basic Qualifications:**
    + Master's degree / Bachelor's degree and 5 to 9 years in Computer Science, IT or related field
    + 4 years of hands-on related experience in site reliability, DevOps, or platform engineering roles
    + Hands-on experience with cloud platforms preferably AWS
    + Familiarity with Kubernetes or container orchestration technologies
    + Exposure to CI/CD practices and pipeline automation
    + Experience troubleshooting Linux systems, processes, and services
    **Preferred Qualifications:**
    **Must-Have Skills:**
    + Practical experience with **cloud platforms** (e.g., AWS, Azure, or GCP), including compute, networking, IAM, and storage services
    + Familiarity with **container orchestration platforms** (e.g., Kubernetes, Docker), including basic workload deployment and troubleshooting
    + Experience using **Infrastructure as Code (IaC)** tools such as **Terraform** or **CloudFormation**
    + Working knowledge of **Linux administration** , including system services, package management, and file system structures
    + Hands-on exposure to **CI/CD platforms** (e.g., GitLab CI, Jenkins, GitHub Actions) and pipeline troubleshooting
    + Proficiency in **scripting or automation languages** like **Python** , **Bash** , or **Go**
    + Exposure to **observability tooling** (e.g., **Dynatrace** , **Prometheus** , or **Grafana** ) for monitoring and alerting
    + Familiarity with **incident management practices** and tools (e.g., runbooks, escalation workflows, basic alert tuning)
    + Version control skills using **Git** and understanding of branching strategies
    + Experience supporting or integrating **enterprise collaboration platforms** (e.g., Jira, Confluence, ServiceNow)
    + Interest and basic understanding of **AI/ML tools** used in infrastructure and operations (e.g., anomaly detection, intelligent alerting, log analysis)
    **Good-to-Have Skills:**
    + Experience using Infrastructure as Code (IaC) tools like Terraform or CloudFormation
    + Familiarity with IT incident response workflows and ticketing platforms
    + Knowledge of secrets management, configuration management tools (e.g., Ansible), or logging frameworks
    + Exposure to **AI-assisted tooling** (e.g., AIOps platforms, AI-enhanced alerting, anomaly detection)
    **Professional Certifications (Preferred)**
    + Cloud DevOps Certification (AWS/Azure/GCP)
    + Certified Kubernetes Administrator (CKA) or Security Specialist (CKS)
    + CI/CD Platform Certification
    + ITIL Foundation or equivalent service management certification
    **Soft Skills:**
    + Strong analytical and troubleshooting skills
    + Collaborative and proactive mindset
    + Effective communication and documentation practices
    + Curiosity and willingness to adopt new tools and methods, including AI integrations
    + Ability to manage time and prioritize tasks in dynamic environments
    **Shift Information:** This position is an onsite role and may require working during later hours to align with business hours. Candidates must be willing and able to work outside of standard hours as required to meet business needs.
    **What you can expect of us**
    As we work to develop treatments that take care of others, we also work to care for your professional and personal growth and well-being. From our competitive benefits to our collaborative culture, we'll support your journey every step of the way.
    In addition to the base salary, Amgen offers competitive and comprehensive Total Rewards Plans that are aligned with local industry standards.
    **Apply now and make a lasting impact with the Amgen team.**
    **careers.amgen.com**
    As an organization dedicated to improving the quality of life for people around the world, Amgen fosters an inclusive environment of diverse, ethical, committed and highly accomplished people who respect each other and live the Amgen values to continue advancing science to serve patients. Together, we compete in the fight against serious disease.
    Amgen is an Equal Opportunity employer and will consider all qualified applicants for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, disability status, or any other basis protected by applicable law.
    We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.
    This advertiser has chosen not to accept applicants from your region.

    Site Reliability Engineer

    Hyderabad, Andhra Pradesh IntraEdge

    Posted 10 days ago

    Job Viewed

    Tap Again To Close

    Job Description

    Site Reliability Engineer

    Experience: 7+ Years

    Location: Hyderabad

    Hybrid 4-day office and 1 Day remote


    Skills for Principal:

    • Strong leadership and people management skills.
    • Exceptional technical proficiency in Pearson's technology stack.
    • Advanced project management capabilities.
    • Excellent communication and collaboration skills.
    • Adept at risk assessment and crisis management.
    • Strategic thinking with a focus on long-term operational excellence.
    • Champion operational excellence by directing initiatives that elevate system reliability, availability, and overall efficiency.
    • Function as the diplomatic link that binds the SRE team to other organizational units, harmonizing goals, and facilitating collaboration for mutual success.
    • Cultivate an environment of excellence, propelling the development of SRE engineers, and Sr. SRE engineers.
    • 7+ years of professional work experience as described above.
    This advertiser has chosen not to accept applicants from your region.

    Site Reliability Engineer

    Hyderabad, Andhra Pradesh IntraEdge

    Posted 3 days ago

    Job Viewed

    Tap Again To Close

    Job Description

    Site Reliability Engineer

    Experience: 7+ Years

    Location: Hyderabad

    Hybrid 4-day office and 1 Day remote

    Skills for Principal:

    • Strong leadership and people management skills.
    • Exceptional technical proficiency in Pearson's technology stack.
    • Advanced project management capabilities.
    • Excellent communication and collaboration skills.
    • Adept at risk assessment and crisis management.
    • Strategic thinking with a focus on long-term operational excellence.
    • Champion operational excellence by directing initiatives that elevate system reliability, availability, and overall efficiency.
    • Function as the diplomatic link that binds the SRE team to other organizational units, harmonizing goals, and facilitating collaboration for mutual success.
    • Cultivate an environment of excellence, propelling the development of SRE engineers, and Sr. SRE engineers.
    • 7+ years of professional work experience as described above.
    This advertiser has chosen not to accept applicants from your region.

    Site Reliability Engineer

    Hyderabad, Andhra Pradesh IntraEdge

    Posted today

    Job Viewed

    Tap Again To Close

    Job Description

    Site Reliability Engineer

    Experience: 7+ Years

    Location: Hyderabad

    Hybrid 4-day office and 1 Day remote

    Skills for Principal:

    • Strong leadership and people management skills.
    • Exceptional technical proficiency in Pearson's technology stack.
    • Advanced project management capabilities.
    • Excellent communication and collaboration skills.
    • Adept at risk assessment and crisis management.
    • Strategic thinking with a focus on long-term operational excellence.
    • Champion operational excellence by directing initiatives that elevate system reliability, availability, and overall efficiency.
    • Function as the diplomatic link that binds the SRE team to other organizational units, harmonizing goals, and facilitating collaboration for mutual success.
    • Cultivate an environment of excellence, propelling the development of SRE engineers, and Sr. SRE engineers.
    • 7+ years of professional work experience as described above.

    This advertiser has chosen not to accept applicants from your region.

    Site Reliability Engineer

    Hyderabad, Andhra Pradesh Anicalls (Pty) Ltd

    Posted today

    Job Viewed

    Tap Again To Close

    Job Description

    The Role
    Mentor teammates on SRE best practices and guide technical direction
    Work closely with the product engineering team to rapidly deliver capabilities
    Automate and optimize developer pipelines
    Build monitoring to assess system and pipeline health


    Qualifications:
    Proficiency in Python, Go, Ruby, or Java is a plus
    Expertise in Linux administration, configuration, and networking protocols
    Experience managing and automating cloud infrastructure in AWS, Google Cloud Platform, and/or Azure
    Exemplary experience with tools such as Kubernetes, Terraform, Ansible, Puppet, and Chef
    5+ years experience in Site Reliability Engineering or DevOps

    This advertiser has chosen not to accept applicants from your region.

    Site Reliability Engineer

    Hyderabad, Andhra Pradesh WSAudiology

    Posted today

    Job Viewed

    Tap Again To Close

    Job Description

    Driven by the passion to improve quality of people’s lives, WS Audiology continues to grow as market leader in the hearing aid industry. With our commitment to increase penetration in an underserved hearing care market, we want to accelerate our business transformation in order to reach more people, more effectively.

    We are looking for Site Reliability Engineers (SREs) with domain expertise in at least one of the following fields: containers, public clouds, and cloud-native workloads.

    As an SRE you will be responsible for ensuring the reliability, performance, and security of the operational backbone of a partly medical cloud-based product suite.

    What you will do

  • Join our innovative team and build awareness about how our hearing aids and service solutions can change customers’ lives by making wonderful sound part of everyone's life.
  • Design and implement future-proof, scalable, and highly available software systems, services and applications.
  • Monitor system performance and identify potential issues before they become problems.
  • Troubleshoot and resolve operational issues quickly and efficiently, utilising a data-based approach.
  • Automate repetitive tasks and implement continuous improvement processes to increase efficiency and reliability.
  • Influence architectural decisions with a focus on security, scalability and high-performance
  • Participate in a shared and compensated OnCall rotation (approx. 1 week every 6–8 weeks)
  • Support a structured incident management process with automated alerting and clear escalation paths
  • Ensure compliance with security and regulatory requirements.
  • Provide technical guidance and support to SaaS product development teams, other IT staff and stakeholders.
  • Continuously evaluate and adopt new technologies and tools to improve system performance and reliability.
  • What you bring

  • 3 - 5 years of experience building and maintaining SaaS infrastructure
  • Expert skills with networking, storage and virtualization automation with tools like Kubernetes, Bicep, Terraform
  • Setting up and supporting CI/ CD
  • In-depth knowledge of distributed systems, networking, operating systems and databases
  • Experience with cloud computing platforms (e.g. Azure, AWS or GCP).
  • Proficiency in a high-level programming language like .Net Core/ C#, Python or Java
  • What we offer

  • Opportunities for career growth through certifications (e.g., Kubernetes, Azure), mentorship and internal mobility
  • Budget for learning & development (technical training, conferences, certifications
  • Personal competencies

  • You are proactive and eager to take ownership while collaborating respectfully in a global teamHigh communication and collaboration skills in an international environment
  • Ability to work effectively in a fast-paced, dynamic environment.
  • You are a continuous learner and enjoy mentoring others or being mentored
  • If you are a highly motivated and skilled SRE who is enthusiastic about ensuring the reliability and performance of complex software systems, we encourage you to apply.

    And beyond your professional qualifications, we are looking for the following:

    You take initiative and ownership when something is not working. Furthermore, you are the type of profile, that gives inputs and suggestions for, how processes can be improved.

    You are able to both give and receive constructive feedback and people will see you as a structured problem solver.

    Last but not least, we would expect you be thriving in being part of a team and having the ability to exchange knowledge and expertise, internally as well as externally.

    This advertiser has chosen not to accept applicants from your region.
    Be The First To Know

    About the latest Devops engineers Jobs in Hyderabad !

    Site Reliability Engineer

    Hyderabad, Andhra Pradesh Unison Consulting Pte Ltd

    Posted today

    Job Viewed

    Tap Again To Close

    Job Description

    • Experience with supporting Java (J2EE/Spring Boot) based multi-tier applications with complex upstream downstream interactions having expertise in understanding the application request flow and analysing application logs for investigating and troubleshooting issues and application break .
    • Ability to work in a dynamic environment with ability to self-organize and plan and prioritise the work in an environment where multiple issues compete for attention.
    • Contribute in developing and implementing automated CI/CD capability for our application .
    • Contribute in our continuous improvement and continuous delivery while increasing maturity of DevOps practices.
    • Get involved in the discussions and provide inputs in designing a fully automated, robust and secure infrastructure.
    • Collaborate closely with other internal SRE and Dev teams/business users in investigating, testing and deployments
    • Responsible for handling Release Management, raising Change Request and scheduling for the implementation of fixes and enhancements.
    • Work effectively in collaboration with different teams either local or remote.
    • Work towards high availability of our applications by putting in right Observability in place.
    • Support our production environment with strong performance tuning, end-to-end troubleshooting, networking fundamentals skills.
    • Willingness to in rotational shifts/On-Call rosters as part of 24x7 teams supporting critical applications.

    Requirements

    • Minimum 5-7 years’ experience as a Site Reliability engineer supporting different application and application infrastructure in a Hybrid-cloud platforms with mix of On-Prem and AWS/GCP
    • Ability to support Java (J2EE/Spring Boot) or .NET applications and manage Incident and support recovery of the application and drive root cause analysis, management communication and client relationship management in partnership with Infrastructure Service Support team members .
    • Ensures all production changes are made in accordance with life-cycle methodology and risk guidelines
    • Application Support, Deployment of Release, patches & fixes on Platform
    • Analyse application performance, perform tuning and ensure high availability & stability of platform.
    • Knowledge of Batch Processing systems and tools
    • Knowledge of Unix/Linux system and containerization and container orchestration platforms and platforms (viz., Docker, Cloud Foundry, OpenShift, Kubernetes) etc.
    • Strong scripting skills ability automate manual tasks which could be easily converted to a script - shell, Python or PowerShell.
    • Familiarity with usage of Observability tools like Grafana, Kibana, AppDynamics etc.
    • Experienced in AWS/GCP Public cloud services
    • Hands on experience any of the CI/CD tools viz., Jenkins, Circle-CI, GitHub Actions and ability to understand and define different deployment strategies.
    • Hands-on experience with GIT. Managing deployment and branching with in GIT
    This advertiser has chosen not to accept applicants from your region.
     

    Nearby Locations

    Other Jobs Near Me

    Industry

    1. request_quote Accounting
    2. work Administrative
    3. eco Agriculture Forestry
    4. smart_toy AI & Emerging Technologies
    5. school Apprenticeships & Trainee
    6. apartment Architecture
    7. palette Arts & Entertainment
    8. directions_car Automotive
    9. flight_takeoff Aviation
    10. account_balance Banking & Finance
    11. local_florist Beauty & Wellness
    12. restaurant Catering
    13. volunteer_activism Charity & Voluntary
    14. science Chemical Engineering
    15. child_friendly Childcare
    16. foundation Civil Engineering
    17. clean_hands Cleaning & Sanitation
    18. diversity_3 Community & Social Care
    19. construction Construction
    20. brush Creative & Digital
    21. currency_bitcoin Crypto & Blockchain
    22. support_agent Customer Service & Helpdesk
    23. medical_services Dental
    24. medical_services Driving & Transport
    25. medical_services E Commerce & Social Media
    26. school Education & Teaching
    27. electrical_services Electrical Engineering
    28. bolt Energy
    29. local_mall Fmcg
    30. gavel Government & Non Profit
    31. emoji_events Graduate
    32. health_and_safety Healthcare
    33. beach_access Hospitality & Tourism
    34. groups Human Resources
    35. precision_manufacturing Industrial Engineering
    36. security Information Security
    37. handyman Installation & Maintenance
    38. policy Insurance
    39. code IT & Software
    40. gavel Legal
    41. sports_soccer Leisure & Sports
    42. inventory_2 Logistics & Warehousing
    43. supervisor_account Management
    44. supervisor_account Management Consultancy
    45. supervisor_account Manufacturing & Production
    46. campaign Marketing
    47. build Mechanical Engineering
    48. perm_media Media & PR
    49. local_hospital Medical
    50. local_hospital Military & Public Safety
    51. local_hospital Mining
    52. medical_services Nursing
    53. local_gas_station Oil & Gas
    54. biotech Pharmaceutical
    55. checklist_rtl Project Management
    56. shopping_bag Purchasing
    57. home_work Real Estate
    58. person_search Recruitment Consultancy
    59. store Retail
    60. point_of_sale Sales
    61. science Scientific Research & Development
    62. wifi Telecoms
    63. psychology Therapy
    64. pets Veterinary
    View All Devops Engineers Jobs View All Jobs in Hyderabad