1,365 Site Reliability Engineer jobs in India

Site Reliability Engineer

Bengaluru, Karnataka Oracle

Posted today

Job Viewed

Tap Again To Close

Job Description

**Job Description**
Looking for a DevOps Senior Engineer in the Data Engineering team who can help us support next-generation Analytics applications over Oracle cloud.
This posting is for DevOps Senior Engineer in the Oracle Analytics Warehouse product development organization. Fully handled Cloud service that provides customers a turn-key enterprise warehouse on the cloud for Fusion Applications. The service is being built on a sophisticated technology stack demonstrating a brand-new data integration platform and the industry's most sophisticated analytical business analytics platform.
are looking for senior engineer with experience in supporting data warehousing products. As a member of the Product development organization, focus will be on working with development teams, providing timely support to customers and identify/implementing process automation, for cloud BI product.
· BE or equivalent experience or higher degree in Computer Science / Engineering or equivalent from top university
· validated experience, supporting business customers on any Cloud/On-premise BI Application
· Proficiency in using Python to interact with the Apache Spark framework for big data processing
· Experience in SQL/PL-SQL and excellent de-bugging skills
· Experience in Diagnosing network latency and intermittent issues, Reading and analyzing log files
· Good Functional Knowledge in domains like ERP, HCM, SCP and CX
· Working experience with any ERP/in-demand application such as Oracle EBS, Fusion is helpful
· Good programming skills in Python/Java
· Exposure to cloud infrastructure, Oracle Cloud Infrastructure (OCI) is helpful
· Experience in performance tuning SQL and understanding ETL pipelines
· Build, Configure, Manage and Coordinate all Build and Release engineering activities
· Strong logical/critical thinking and problem resolution skill
· Excellent interpersonal skills
**Responsibilities**
Roles and Responsibilities:
· As member of Pipeline Production Operations, you will address customer issues and tickets within defined SLA's
· Proactively identify and resolve potential problems in an effort to prevent them from occurring and improve the overall customer experiences
· You will approach each case with a goal of ensuring Oracle Analytics products are performing at an efficient level by addressing any underlying or additional problems uncovered during each
Customer engagement.
· Co-ordinate and connect with different team members to formulate the solutions to customer issues
· You will ensure full understanding of the issue, including impact to customer.
You will recommend solutions to customers and follow through to resolution or escalate the case in a timely manner if no resolution can be found.
· Bring together logs, configuration details and attempt to reproduce the reported issues.
· Develop and improve Knowledge base for the issues and their solutions.
Participate in knowledge sharing via involvement in technical discussions and Knowledge Base documentation.
Prioritize workload based on severity and demonstrate a sense of urgency when handling cases.
Find opportunities for process improvements and automation through building right utilities/tools
Willing to be working in Shifts and weekends based on support rota.
Career Level - IC3
**About Us**
As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's challenges. We've partnered with industry-leaders in almost every sector-and continue to thrive after 40+ years of change by operating with integrity.
We know that true innovation starts when everyone is empowered to contribute. That's why we're committed to growing an inclusive workforce that promotes opportunities for all.
Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing or by calling in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
This advertiser has chosen not to accept applicants from your region.

Site Reliability Engineer

Hyderabad, Andhra Pradesh Oracle

Posted today

Job Viewed

Tap Again To Close

Job Description

**Job Description**
Looking for a DevOps Senior Engineer in the Data Engineering team who can help us support next-generation Analytics applications over Oracle cloud.
This posting is for DevOps Senior Engineer in the Oracle Analytics Warehouse product development organization. Fully handled Cloud service that provides customers a turn-key enterprise warehouse on the cloud for Fusion Applications. The service is being built on a sophisticated technology stack demonstrating a brand-new data integration platform and the industry's most sophisticated analytical business analytics platform.
are looking for senior engineer with experience in supporting data warehousing products. As a member of the Product development organization, focus will be on working with development teams, providing timely support to customers and identify/implementing process automation, for cloud BI product.
· BE or equivalent experience or higher degree in Computer Science / Engineering or equivalent from top university
· validated experience, supporting business customers on any Cloud/On-premise BI Application
· Proficiency in using Python to interact with the Apache Spark framework for big data processing
· Experience in SQL/PL-SQL and excellent de-bugging skills
· Experience in Diagnosing network latency and intermittent issues, Reading and analyzing log files
· Good Functional Knowledge in domains like ERP, HCM, SCP and CX
· Working experience with any ERP/in-demand application such as Oracle EBS, Fusion is helpful
· Good programming skills in Python/Java
· Exposure to cloud infrastructure, Oracle Cloud Infrastructure (OCI) is helpful
· Experience in performance tuning SQL and understanding ETL pipelines
· Build, Configure, Manage and Coordinate all Build and Release engineering activities
· Strong logical/critical thinking and problem resolution skill
· Excellent interpersonal skills
**Responsibilities**
Roles and Responsibilities:
· As member of Pipeline Production Operations, you will address customer issues and tickets within defined SLA's
· Proactively identify and resolve potential problems in an effort to prevent them from occurring and improve the overall customer experiences
· You will approach each case with a goal of ensuring Oracle Analytics products are performing at an efficient level by addressing any underlying or additional problems uncovered during each
Customer engagement.
· Co-ordinate and connect with different team members to formulate the solutions to customer issues
· You will ensure full understanding of the issue, including impact to customer.
You will recommend solutions to customers and follow through to resolution or escalate the case in a timely manner if no resolution can be found.
· Bring together logs, configuration details and attempt to reproduce the reported issues.
· Develop and improve Knowledge base for the issues and their solutions.
Participate in knowledge sharing via involvement in technical discussions and Knowledge Base documentation.
Prioritize workload based on severity and demonstrate a sense of urgency when handling cases.
Find opportunities for process improvements and automation through building right utilities/tools
Willing to be working in Shifts and weekends based on support rota.
Career Level - IC3
**About Us**
As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's challenges. We've partnered with industry-leaders in almost every sector-and continue to thrive after 40+ years of change by operating with integrity.
We know that true innovation starts when everyone is empowered to contribute. That's why we're committed to growing an inclusive workforce that promotes opportunities for all.
Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing or by calling in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
This advertiser has chosen not to accept applicants from your region.

Site Reliability Engineer

Bengaluru, Karnataka Autodesk

Posted 1 day ago

Job Viewed

Tap Again To Close

Job Description

**Job Requisition ID #**
25WD89004
**Position Overview**
Are you excited about shaping the future of platform development through cutting-edge technologies and cloud services?
Autodesk is searching for a skilled Platform Development Engineer to join our Platform Services and Emerging Technologies team. In this role, you will play a pivotal role in designing, developing, and optimizing our cloud-based platform, with a focus on serverless computing, virtual machines, and container orchestration services in both AWS and Azure environments. If you are passionate about leveraging cloud services to build scalable and resilient platforms while driving innovation and collaboration, this is the perfect opportunity for you.
**Minimum Qualifications**
+ Bachelor's degree in Computer Science, Computer Engineering, or a related field, with a minimum of 2 years of experience in platform development or a similar role.
+ 2+ years of hands-on software development experience in Python, Java, Go, NodeJS, or .NET.
+ Proficiency in serverless computing services such as AWS Lambda and Azure Functions.
+ Hands-on experience with virtual machine services like AWS EC2 and Azure Virtual Machines.
+ Familiarity with container orchestration services such as AWS ECS, Azure Container Instances (ACI), and Azure Kubernetes Service (AKS).
+ Strong understanding of Infrastructure as Code (IaaC) principles and tools like Terraform and CloudFormation.
+ Experience designing and implementing scalable, high-performance platform solutions.
+ Experience implementing unit and integration tests.
**Preferred Qualifications**
+ AWS certifications such as AWS Certified Solutions Architect or AWS Certified Developer.
+ Azure certifications such as Azure Solutions Architect or Azure Developer Associate.
+ Proficiency in CI/CD pipelines for automated deployment and testing.
+ Experience with monitoring and logging tools such as CloudWatch, Azure Monitor, and ELK Stack.
**The Ideal Candidate**
+ You have a strong background in platform development, with expertise in serverless computing, virtual machines, and container orchestration services in both AWS and Azure environments.
+ You are well-versed in Infrastructure as Code (IaaC) principles and tools.
+ You thrive in a collaborative environment and are passionate about leveraging cloud services to build scalable and resilient platforms.
+ You stay updated with the latest trends and best practices in cloud computing and platform development.
#LI-AK1
**Learn More**
**About Autodesk**
Welcome to Autodesk! Amazing things are created every day with our software - from the greenest buildings and cleanest cars to the smartest factories and biggest hit movies. We help innovators turn their ideas into reality, transforming not only how things are made, but what can be made.
We take great pride in our culture here at Autodesk - it's at the core of everything we do. Our culture guides the way we work and treat each other, informs how we connect with customers and partners, and defines how we show up in the world.
When you're an Autodesker, you can do meaningful work that helps build a better world designed and made for all. Ready to shape the world and your future? Join us!
**Salary transparency**
Salary is one part of Autodesk's competitive compensation package. Offers are based on the candidate's experience and geographic location. In addition to base salaries, our compensation package may include annual cash bonuses, commissions for sales roles, stock grants, and a comprehensive benefits package.
**Diversity & Belonging**
We take pride in cultivating a culture of belonging where everyone can thrive. Learn more here: you an existing contractor or consultant with Autodesk?**
Please search for open jobs and apply internally (not on this external site).
This advertiser has chosen not to accept applicants from your region.

Site Reliability Engineer

Chennai, Tamil Nadu UPS

Posted 4 days ago

Job Viewed

Tap Again To Close

Job Description

**Avant de postuler à un emploi, sélectionnez votre langue de préférence parmi les options disponibles en haut à droite de cette page.**
Découvrez votre prochaine opportunité au sein d'une organisation qui compte parmi les 500 plus importantes entreprises mondiales. Envisagez des opportunités innovantes, découvrez notre culture enrichissante et travaillez avec des équipes talentueuses qui vous poussent à vous développer chaque jour. Nous savons ce qu'il faut faire pour diriger UPS vers l'avenir : des personnes passionnées dotées d'une combinaison unique de compétences. Si vous avez les qualités, de la motivation, de l'autonomie ou le leadership pour diriger des équipes, il existe des postes adaptés à vos aspirations et à vos compétences d'aujourd'hui et de demain.
**Fiche de poste :**
**Job Summary:**
We are seeking a skilled and proactive **Site Reliability Engineer (SRE)** with 5-8 years of experience and deep expertise in **Google Cloud Platform (GCP)** . The ideal candidate will be responsible for the reliability, availability, and performance of cloud-based applications and infrastructure. You will collaborate with development, operations, and security teams to build and maintain scalable, secure, and highly available systems.
**Key Responsibilities:**
+ Design, develop, and maintain **reliable, scalable, and highly available systems** on GCP.
+ Build and manage **CI/CD pipelines** , infrastructure as code (IaC), and monitoring solutions.
+ Proactively monitor and manage **system performance, uptime, and capacity** using observability tools.
+ Troubleshoot and resolve **infrastructure and application-level issues** in real-time.
+ Implement and maintain **disaster recovery** , **failover mechanisms** , and **backup strategies** .
+ Automate repetitive tasks and processes to improve **efficiency and reduce toil** .
+ Participate in **on-call rotations** , incident management, and root cause analysis (RCA).
+ Ensure compliance with **security standards, privacy regulations, and governance policies** .
+ Collaborate with cross-functional teams to support **DevOps and SRE best practices** .
+ Drive improvements in **SLAs, SLOs, and error budgets** through data-driven insights.
**Required Qualifications:**
+ 5-8 years of relevant experience as an SRE, DevOps Engineer, or Cloud Infrastructure Engineer.
+ Strong hands-on experience with **Google Cloud Platform (GCP)** - Compute Engine, GKE, Cloud Functions, Cloud Storage, IAM, BigQuery, etc.
+ Proficiency in **Infrastructure as Code** tools like **Terraform** , **Deployment Manager** , or **CloudFormation** .
+ Experience with **Kubernetes** , **Docker** , and container orchestration.
+ Proficiency in scripting languages like **Python** , **Shell** , or **Go** .
+ Deep understanding of **monitoring and logging tools** such as **Prometheus** , **Grafana** , **Stackdriver** , or **Datadog** .
+ Knowledge of **CI/CD tools** such as Jenkins, GitLab CI, or Cloud Build.
+ Experience with **incident response** , **postmortem analysis** , and **site reliability principles** .
+ Strong problem-solving and communication skills.
**Preferred Qualifications:**
+ GCP certifications (e.g., **Professional Cloud DevOps Engineer** , **Cloud Architect** ).
+ Exposure to **multi-cloud environments** or hybrid cloud infrastructure.
+ Familiarity with **Agile** and **ITIL** frameworks.
+ Experience working in regulated environments with compliance standards (e.g., ISO, SOC2).
**Type de contrat:**
en CDI
_Chez UPS, égalité des chances, traitement équitable et environnement de travail inclusif sont des valeurs clefs auxquelles nous sommes attachés._
This advertiser has chosen not to accept applicants from your region.

Site Reliability Engineer

Chennai, Tamil Nadu UPS

Posted 4 days ago

Job Viewed

Tap Again To Close

Job Description

**Before you apply to a job, select your language preference from the options available at the top right of this page.**
Explore your next opportunity at a Fortune Global 500 organization. Envision innovative possibilities, experience our rewarding culture, and work with talented teams that help you become better every day. We know what it takes to lead UPS into tomorrow-people with a unique combination of skill + passion. If you have the qualities and drive to lead yourself or teams, there are roles ready to cultivate your skills and take you to the next level.
**Job Description:**
**Job Summary:**
We are seeking a skilled and proactive **Site Reliability Engineer (SRE)** with 5-8 years of experience and deep expertise in **Google Cloud Platform (GCP)** . The ideal candidate will be responsible for the reliability, availability, and performance of cloud-based applications and infrastructure. You will collaborate with development, operations, and security teams to build and maintain scalable, secure, and highly available systems.
**Key Responsibilities:**
+ Design, develop, and maintain **reliable, scalable, and highly available systems** on GCP.
+ Build and manage **CI/CD pipelines** , infrastructure as code (IaC), and monitoring solutions.
+ Proactively monitor and manage **system performance, uptime, and capacity** using observability tools.
+ Troubleshoot and resolve **infrastructure and application-level issues** in real-time.
+ Implement and maintain **disaster recovery** , **failover mechanisms** , and **backup strategies** .
+ Automate repetitive tasks and processes to improve **efficiency and reduce toil** .
+ Participate in **on-call rotations** , incident management, and root cause analysis (RCA).
+ Ensure compliance with **security standards, privacy regulations, and governance policies** .
+ Collaborate with cross-functional teams to support **DevOps and SRE best practices** .
+ Drive improvements in **SLAs, SLOs, and error budgets** through data-driven insights.
**Required Qualifications:**
+ 5-8 years of relevant experience as an SRE, DevOps Engineer, or Cloud Infrastructure Engineer.
+ Strong hands-on experience with **Google Cloud Platform (GCP)** - Compute Engine, GKE, Cloud Functions, Cloud Storage, IAM, BigQuery, etc.
+ Proficiency in **Infrastructure as Code** tools like **Terraform** , **Deployment Manager** , or **CloudFormation** .
+ Experience with **Kubernetes** , **Docker** , and container orchestration.
+ Proficiency in scripting languages like **Python** , **Shell** , or **Go** .
+ Deep understanding of **monitoring and logging tools** such as **Prometheus** , **Grafana** , **Stackdriver** , or **Datadog** .
+ Knowledge of **CI/CD tools** such as Jenkins, GitLab CI, or Cloud Build.
+ Experience with **incident response** , **postmortem analysis** , and **site reliability principles** .
+ Strong problem-solving and communication skills.
**Preferred Qualifications:**
+ GCP certifications (e.g., **Professional Cloud DevOps Engineer** , **Cloud Architect** ).
+ Exposure to **multi-cloud environments** or hybrid cloud infrastructure.
+ Familiarity with **Agile** and **ITIL** frameworks.
+ Experience working in regulated environments with compliance standards (e.g., ISO, SOC2).
**Employee Type:**
Permanent
UPS is committed to providing a workplace free of discrimination, harassment, and retaliation.
This advertiser has chosen not to accept applicants from your region.

Site Reliability Engineer

Gurgaon, Haryana S&P Global

Posted 4 days ago

Job Viewed

Tap Again To Close

Job Description

**About the Role:**
**OSTTRA India**
**The Role: Site Reliability Engineer**
**The Team:** SRE is a global team that provides technical support across the suite of OSTTRA products. The SRE team works closely with a highly competent Technical Operation Centre (TOC), Development and Infrastructure teams to deliver proactive tasks to improve the supportability of our platforms. Our work helps to ensure that OSTTRA provides a high-quality service and maintains client satisfaction.
**The Impact:** Together, we build, support, protect and manage high-performance, resilient platforms that process more than 100 million messages a day. Our services are vital to automated trade processing around the globe, managing peak volumes and working with our customers and regulators to ensure the efficient settlement of trades and effective operation of global capital markets.
**What's in it for you:**
OSTTRA is seeking a Site Reliability Engineer professional to join the SRE Team. The role will be specialised into the designated platforms provisioning 2nd line technical support to TOC as well as integration support for our Trade Processing applications. This person will report directly to the regional SRE manager and work closely with an experienced global team to contribute to the quality of our support.
You will have 6-10 years' experience of roles like Site Reliability Engineer or Application Support with Project Management tasks to meet the needs of our expanding portfolio of Financial Services clients.
This role presents an excellent opportunity to be part of an agile team based out of India, collaborating with colleagues across multiple regions globally, with a strong focus on delivering value through self-service.
**Responsibilities:**
+ Your duties will include Capacity Management, Operational Support Design, Audit Preparation, Incident Escalation, Problem Management Engagement, DR Design and Execution and ad hoc High Profile Client Engagement for your designated platform(s) in our full suite of OTC Derivative products and FX for post-trade confirmation processing.
+ You will need to demonstrate excellent communication skills and have a natural ability to learn with a keen interest in technology. You must be a team player and enjoy working in a high-performance collaborative environment with multiple teams.
+ The successful candidate will need to be able to apply strong technical skills and good business knowledge, together with investigative techniques and problem-solving skills to identify gaps and improve overall estate to bring resilience and stability to the platform(s).
+ Liaising with other teams across Product, Development and particularly the infrastructure teams as required for 3rd line escalation. Technical advisory will be required at times by Product and business or clients for solution delivery.
+ Working closely with Development and Infrastructure team, to understand and ensure supportability of platforms and liaising with delivery teams to ensure readiness for new platform releases. Based in our Gurgaon office, you will be responsible for handling, identifying and communicating technical resolutions in English.
**What We're Looking For:**
+ University graduate or equivalent with background of bachelor's in computer science
+ Experience or having high motivation in managing the capacity, performance throughput and EOS/EOL of platform from infrastructure to software
+ Experience in troubleshooting of issues, defining supportability, soaking in software development life cycle SDLC process streamlining application delivery from Dev/QA to UAT/Production
+ Good understanding of Site Reliable Engineer as well as Application Support processes, supporting of incidents and execute/design disaster recovery
+ Strong ability to understand application architecture, able to effectively navigate to the problem area, and identify proactive measures around resiliency, recovery design
+ Ability to apply analytical methodology, such as trending, distribution etc., to get insight from application data to help troubleshooting and analysing best approach
+ Ability to understand business workflow and tie to technical implementation
+ Experience in reading and tracing Java, C++, Python and/or scripting languages
+ Experience of databases including SQL scripting, preferably but not limited to Oracle
**Good to Have:**
+ Understanding of networking principles, its practical uses and basic troubleshooting.
+ Possess the understanding of Cloud (AWS, GCP or Azure), PAAS and implementation with Kubernetes, OpenShift, Windows and Linux
+ Experience in handling client issues and expectation management
+ Good understanding of messaging platforms and protocols like XML, XSLT, IBM MQ, AMQ etc
+ Knowledge of financial messaging protocols like FIX, FPmL, TOF etc
+ Experience security protocols related to connectivity encryption utilizing SSL and TLS
+ Have experience of working in the Finance Industry
+ Knowledge of the Financial OTC Derivative and FX products
+ Awareness of Derivatives products and post trade processing (desirable)
**The Location: Gurgaon, India**
**About Company Statement:**
OSTTRA is a market leader in derivatives post-trade processing, bringing innovation, expertise, processes and networks together to solve the post-trade challenges of global financial markets. OSTTRA operates cross-asset post-trade processing networks, providing a proven suite of Credit Risk, Trade Workflow and Optimization services. Together these solutions streamline post-trade workflows, enabling firms to connect to counterparties and utilities, manage credit risk, reduce operational risk and optimize processing to drive post-trade efficiencies.
OSTTRA was formed in 2021 through the combination of four businesses that have been at the heart of post trade evolution and innovation for the last 20+ years: MarkitServ, Traiana, TriOptima and Reset. These businesses have an exemplary track record of developing and supporting critical market infrastructure and bring together an established community of market participants comprising all trading relationships and paradigms, connected using powerful integration and transformation capabilities.
**About OSTTRA**
_Candidates should note that OSTTRA is an_ _independent firm,_ _jointly owned by S&P Global and CME Group. As part of the joint venture, S&P Global_ _provides recruitment services_ _to OSTTRA - however, successful candidates will be interviewed and directly employed by OSTTRA, joining our global team of more than 1,200 post trade experts._
OSTTRA was formed in 2021 through the combination of four businesses that have been at the heart of post trade evolution and innovation for the last 20+ years: MarkitServ, Traiana, TriOptima and Reset. OSTTRA is a joint venture, owned 50/50 by S&P Global and CME Group.
With an outstanding track record of developing and supporting critical market infrastructure, our combined network connects thousands of market participants to streamline end to end workflows - from trade capture at the point of execution, through portfolio optimization, to clearing and settlement.
Joining the OSTTRA team is a unique opportunity to help build a bold new business with an outstanding heritage in financial technology, playing a central role in supporting global financial markets.
Learn more at .
**What's In It For** **You?**
**Benefits:**
We take care of you, so you can take care of business. We care about our people. That's why we provide everything you-and your career-need to thrive at S&P Global.
Our benefits include:
+ Health & Wellness: Health care coverage designed for the mind and body.
+ Flexible Downtime: Generous time off helps keep you energized for your time on.
+ Continuous Learning: Access a wealth of resources to grow your career and learn valuable new skills.
+ Invest in Your Future: Secure your financial future through competitive pay, retirement planning, a continuing education program with a company-matched student loan contribution, and financial wellness programs.
+ Family Friendly Perks: It's not just about you. S&P Global has perks for your partners and little ones, too, with some best-in class benefits for families.
+ Beyond the Basics: From retail discounts to referral incentive awards-small perks can make a big difference.
For more information on benefits by country visit: Opportunity Employer**
S&P Global is an equal opportunity employer and all qualified candidates will receive consideration for employment without regard to race/ethnicity, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, marital status, military veteran status, unemployment status, or any other status protected by law. Only electronic job submissions will be considered for employment.
If you need an accommodation during the application process due to a disability, please send an email to:   and your request will be forwarded to the appropriate person. 
**US Candidates Only:** The EEO is the Law Poster   describes discrimination protections under federal law. Pay Transparency Nondiscrimination Provision - - Professional (EEO-2 Job Categories-United States of America), BSMGMT203 - Entry Professional (EEO Job Group)
**Job ID:**
**Posted On:**
**Location:** Gurgaon, Haryana, India
This advertiser has chosen not to accept applicants from your region.

Site Reliability Engineer

Gurgaon, Haryana S&P Global

Posted 4 days ago

Job Viewed

Tap Again To Close

Job Description

**About the Role:**
**OSTTRA India**
**The Role: Site Reliability Engineer**
**The Team:** SRE is a global team that provides technical support across the suite of OSTTRA products. The SRE team works closely with a highly competent Technical Operation Centre (TOC), Development and Infrastructure teams to deliver proactive tasks to improve the supportability of our platforms. Our work helps to ensure that OSTTRA provides a high-quality service and maintains client satisfaction.
**The Impact:** Together, we build, support, protect and manage high-performance, resilient platforms that process more than 100 million messages a day. Our services are vital to automated trade processing around the globe, managing peak volumes and working with our customers and regulators to ensure the efficient settlement of trades and effective operation of global capital markets.
**What's in it for you:**
OSTTRA is seeking a Site Reliability Engineer professional to join the SRE Team. The role will be specialised into the designated platforms provisioning 2nd line technical support to TOC as well as integration support for our Trade Processing applications. This person will report directly to the regional SRE manager and work closely with an experienced global team to contribute to the quality of our support.
You will have 6-10 years' experience of roles like Site Reliability Engineer or Application Support with Project Management tasks to meet the needs of our expanding portfolio of Financial Services clients.
This role presents an excellent opportunity to be part of an agile team based out of India, collaborating with colleagues across multiple regions globally, with a strong focus on delivering value through self-service.
**Responsibilities:**
+ Your duties will include Capacity Management, Operational Support Design, Audit Preparation, Incident Escalation, Problem Management Engagement, DR Design and Execution and ad hoc High Profile Client Engagement for your designated platform(s) in our full suite of OTC Derivative products and FX for post-trade confirmation processing.
+ You will need to demonstrate excellent communication skills and have a natural ability to learn with a keen interest in technology. You must be a team player and enjoy working in a high-performance collaborative environment with multiple teams.
+ The successful candidate will need to be able to apply strong technical skills and good business knowledge, together with investigative techniques and problem-solving skills to identify gaps and improve overall estate to bring resilience and stability to the platform(s).
+ Liaising with other teams across Product, Development and particularly the infrastructure teams as required for 3rd line escalation. Technical advisory will be required at times by Product and business or clients for solution delivery.
+ Working closely with Development and Infrastructure team, to understand and ensure supportability of platforms and liaising with delivery teams to ensure readiness for new platform releases. Based in our Gurgaon office, you will be responsible for handling, identifying and communicating technical resolutions in English.
**What We're Looking For:**
+ University graduate or equivalent with background of bachelor's in computer science
+ Experience or having high motivation in managing the capacity, performance throughput and EOS/EOL of platform from infrastructure to software
+ Experience in troubleshooting of issues, defining supportability, soaking in software development life cycle SDLC process streamlining application delivery from Dev/QA to UAT/Production
+ Good understanding of Site Reliable Engineer as well as Application Support processes, supporting of incidents and execute/design disaster recovery
+ Strong ability to understand application architecture, able to effectively navigate to the problem area, and identify proactive measures around resiliency, recovery design
+ Ability to apply analytical methodology, such as trending, distribution etc., to get insight from application data to help troubleshooting and analysing best approach
+ Ability to understand business workflow and tie to technical implementation
+ Experience in reading and tracing Java, C++, Python and/or scripting languages
+ Experience of databases including SQL scripting, preferably but not limited to Oracle
**Good to Have:**
+ Understanding of networking principles, its practical uses and basic troubleshooting.
+ Possess the understanding of Cloud (AWS, GCP or Azure), PAAS and implementation with Kubernetes, OpenShift, Windows and Linux
+ Experience in handling client issues and expectation management
+ Good understanding of messaging platforms and protocols like XML, XSLT, IBM MQ, AMQ etc
+ Knowledge of financial messaging protocols like FIX, FPmL, TOF etc
+ Experience security protocols related to connectivity encryption utilizing SSL and TLS
+ Have experience of working in the Finance Industry
+ Knowledge of the Financial OTC Derivative and FX products
+ Awareness of Derivatives products and post trade processing (desirable)
**The Location: Gurgaon, India**
**About Company Statement:**
OSTTRA is a market leader in derivatives post-trade processing, bringing innovation, expertise, processes and networks together to solve the post-trade challenges of global financial markets. OSTTRA operates cross-asset post-trade processing networks, providing a proven suite of Credit Risk, Trade Workflow and Optimization services. Together these solutions streamline post-trade workflows, enabling firms to connect to counterparties and utilities, manage credit risk, reduce operational risk and optimize processing to drive post-trade efficiencies.
OSTTRA was formed in 2021 through the combination of four businesses that have been at the heart of post trade evolution and innovation for the last 20+ years: MarkitServ, Traiana, TriOptima and Reset. These businesses have an exemplary track record of developing and supporting critical market infrastructure and bring together an established community of market participants comprising all trading relationships and paradigms, connected using powerful integration and transformation capabilities.
**About OSTTRA**
_Candidates should note that OSTTRA is an_ _independent firm,_ _jointly owned by S&P Global and CME Group. As part of the joint venture, S&P Global_ _provides recruitment services_ _to OSTTRA - however, successful candidates will be interviewed and directly employed by OSTTRA, joining our global team of more than 1,200 post trade experts._
OSTTRA was formed in 2021 through the combination of four businesses that have been at the heart of post trade evolution and innovation for the last 20+ years: MarkitServ, Traiana, TriOptima and Reset. OSTTRA is a joint venture, owned 50/50 by S&P Global and CME Group.
With an outstanding track record of developing and supporting critical market infrastructure, our combined network connects thousands of market participants to streamline end to end workflows - from trade capture at the point of execution, through portfolio optimization, to clearing and settlement.
Joining the OSTTRA team is a unique opportunity to help build a bold new business with an outstanding heritage in financial technology, playing a central role in supporting global financial markets.
Learn more at .
**What's In It For** **You?**
**Benefits:**
We take care of you, so you can take care of business. We care about our people. That's why we provide everything you-and your career-need to thrive at S&P Global.
Our benefits include:
+ Health & Wellness: Health care coverage designed for the mind and body.
+ Flexible Downtime: Generous time off helps keep you energized for your time on.
+ Continuous Learning: Access a wealth of resources to grow your career and learn valuable new skills.
+ Invest in Your Future: Secure your financial future through competitive pay, retirement planning, a continuing education program with a company-matched student loan contribution, and financial wellness programs.
+ Family Friendly Perks: It's not just about you. S&P Global has perks for your partners and little ones, too, with some best-in class benefits for families.
+ Beyond the Basics: From retail discounts to referral incentive awards-small perks can make a big difference.
For more information on benefits by country visit: Opportunity Employer**
S&P Global is an equal opportunity employer and all qualified candidates will receive consideration for employment without regard to race/ethnicity, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, marital status, military veteran status, unemployment status, or any other status protected by law. Only electronic job submissions will be considered for employment.
If you need an accommodation during the application process due to a disability, please send an email to:   and your request will be forwarded to the appropriate person. 
**US Candidates Only:** The EEO is the Law Poster   describes discrimination protections under federal law. Pay Transparency Nondiscrimination Provision - - Professional (EEO-2 Job Categories-United States of America), BSMGMT203 - Entry Professional (EEO Job Group)
**Job ID:**
**Posted On:**
**Location:** Gurgaon, Haryana, India
This advertiser has chosen not to accept applicants from your region.
Be The First To Know

About the latest Site reliability engineer Jobs in India !

Site Reliability Engineer

Hyderabad, Andhra Pradesh Amgen

Posted 4 days ago

Job Viewed

Tap Again To Close

Job Description

**Join Amgen's Mission of Serving Patients**
At Amgen, if you feel like you're part of something bigger, it's because you are. Our shared mission-to serve patients living with serious illnesses-drives all that we do.
Since 1980, we've helped pioneer the world of biotech in our fight against the world's toughest diseases. With our focus on four therapeutic areas -Oncology, Inflammation, General Medicine, and Rare Disease- we reach millions of patients each year. As a member of the Amgen team, you'll help make a lasting impact on the lives of patients as we research, manufacture, and deliver innovative medicines to help people live longer, fuller happier lives.
Our award-winning culture is collaborative, innovative, and science based. If you have a passion for challenges and the opportunities that lay within them, you'll thrive as part of the Amgen team. Join us and transform the lives of patients while transforming your career.
Site Reliability Engineer
**What you will do**
Let's do this. Let's change the world. In this vital role you will responsible for the reliability, stability, performance, scalability, and security of platforms that support Amgen's digital products and engineering teams. This hands-on role focuses on supporting cloud-based infrastructure, automating operations, maintaining observability, and improving platform reliability through code.
You'll work closely with senior engineers and cross-functional teams to support CI/CD workflows, container platforms, incident response, and enterprise tooling-all while adopting modern SRE principles and practices.
This role is ideal for engineers who have foundational site reliability experience and are looking to expand their skills in a cloud-native, enterprise-scale environment.
**Roles & Responsibilities:**
**Infrastructure & Platform Support**
+ Provision and manage cloud infrastructure using Infrastructure as Code (IaC)
+ Support container orchestration platforms, ensuring availability, access control, and resource management
+ Assist in configuring and maintaining CI/CD pipelines and environments
**Monitoring & Incident Response**
+ Set up and maintain observability tools to track system health and performance
+ Participate in alert tuning, incident resolution, and root cause analysis
+ Support integration of observability platforms with incident response workflows
**Automation & Platform Operations**
+ Automate routine platform tasks such as provisioning, patching, and configuration
+ Write scripts to improve platform reliability, reduce manual work, and enforce compliance
+ Participate in platform upgrades, maintenance windows, and service validation efforts
**AI Enablement & Intelligence**
+ Support the adoption of AI-assisted operational tools for log analysis, anomaly detection, and predictive alerts
+ Collaborate with senior engineers to evaluate AI/ML-based observability and automation platforms
+ Assist in integrating AI-driven insights into dashboards, alerts, or incident workflows
+ Stay current with emerging AI trends in infrastructure and site reliability, and contribute to tool evaluations and pilots
**Collaboration & Enablement**
+ Work with development, QA, and security teams to ensure reliable and secure deployments
+ Document operational procedures, playbooks, and system runbooks
+ Learn and support enterprise collaboration platforms and internal tooling
+ Participate in Agile and SAFe delivery processes-including sprint planning, stand-ups, retrospectives, and PI planning-to ensure security and platform reliability are embedded across development cycles.
**What we expect of you**
We are all different, yet we all use our unique contributions to serve patients. The (vital attribute) professional we seek is a (type of person) with these qualifications.
**Basic Qualifications:**
+ Master's degree / Bachelor's degree and 5 to 9 years in Computer Science, IT or related field
+ 4 years of hands-on related experience in site reliability, DevOps, or platform engineering roles
+ Hands-on experience with cloud platforms preferably AWS
+ Familiarity with Kubernetes or container orchestration technologies
+ Exposure to CI/CD practices and pipeline automation
+ Experience troubleshooting Linux systems, processes, and services
**Preferred Qualifications:**
**Must-Have Skills:**
+ Practical experience with **cloud platforms** (e.g., AWS, Azure, or GCP), including compute, networking, IAM, and storage services
+ Familiarity with **container orchestration platforms** (e.g., Kubernetes, Docker), including basic workload deployment and troubleshooting
+ Experience using **Infrastructure as Code (IaC)** tools such as **Terraform** or **CloudFormation**
+ Working knowledge of **Linux administration** , including system services, package management, and file system structures
+ Hands-on exposure to **CI/CD platforms** (e.g., GitLab CI, Jenkins, GitHub Actions) and pipeline troubleshooting
+ Proficiency in **scripting or automation languages** like **Python** , **Bash** , or **Go**
+ Exposure to **observability tooling** (e.g., **Dynatrace** , **Prometheus** , or **Grafana** ) for monitoring and alerting
+ Familiarity with **incident management practices** and tools (e.g., runbooks, escalation workflows, basic alert tuning)
+ Version control skills using **Git** and understanding of branching strategies
+ Experience supporting or integrating **enterprise collaboration platforms** (e.g., Jira, Confluence, ServiceNow)
+ Interest and basic understanding of **AI/ML tools** used in infrastructure and operations (e.g., anomaly detection, intelligent alerting, log analysis)
**Good-to-Have Skills:**
+ Experience using Infrastructure as Code (IaC) tools like Terraform or CloudFormation
+ Familiarity with IT incident response workflows and ticketing platforms
+ Knowledge of secrets management, configuration management tools (e.g., Ansible), or logging frameworks
+ Exposure to **AI-assisted tooling** (e.g., AIOps platforms, AI-enhanced alerting, anomaly detection)
**Professional Certifications (Preferred)**
+ Cloud DevOps Certification (AWS/Azure/GCP)
+ Certified Kubernetes Administrator (CKA) or Security Specialist (CKS)
+ CI/CD Platform Certification
+ ITIL Foundation or equivalent service management certification
**Soft Skills:**
+ Strong analytical and troubleshooting skills
+ Collaborative and proactive mindset
+ Effective communication and documentation practices
+ Curiosity and willingness to adopt new tools and methods, including AI integrations
+ Ability to manage time and prioritize tasks in dynamic environments
**Shift Information:** This position is an onsite role and may require working during later hours to align with business hours. Candidates must be willing and able to work outside of standard hours as required to meet business needs.
**What you can expect of us**
As we work to develop treatments that take care of others, we also work to care for your professional and personal growth and well-being. From our competitive benefits to our collaborative culture, we'll support your journey every step of the way.
In addition to the base salary, Amgen offers competitive and comprehensive Total Rewards Plans that are aligned with local industry standards.
**Apply now and make a lasting impact with the Amgen team.**
**careers.amgen.com**
As an organization dedicated to improving the quality of life for people around the world, Amgen fosters an inclusive environment of diverse, ethical, committed and highly accomplished people who respect each other and live the Amgen values to continue advancing science to serve patients. Together, we compete in the fight against serious disease.
Amgen is an Equal Opportunity employer and will consider all qualified applicants for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, disability status, or any other basis protected by applicable law.
We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.
This advertiser has chosen not to accept applicants from your region.

Site Reliability Engineer

Chennai, Tamil Nadu ADP

Posted 4 days ago

Job Viewed

Tap Again To Close

Job Description

**_ADP is hiring Site Reliability Engineer_**
_In ADP, we're building the next generation of technologies. Our mission is simple: Create powerful solutions that are efficient, intuitive, beautiful, and responsive. As a Site Reliability Engineer, you are responsible for availability, performance, efficiency, change management, monitoring, emergency response, and capacity planning. He or She will be responsible to deliver automations which makes the MNC systems and platforms more reliable and efficient resulting in the Improved Client Experience._
**_What you'll do:_**
+ Engage in and improve the whole lifecycle of services-from inception and design, through deployment, operation and refinement.
+ Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.
+ Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
+ May work together with other staff engaged in similar functions.
+ Building software to improve DevOps, ITOps, and support processes.
**_Qualifications you'll need:_**
Education: Bachelor's degree (Mandatory) preferably in Computer Science or Information Technology
Experience:
+ Overall 3+ years in Devops.
+ Experience working under a Scrum methodology.
+ Ability to analyze and resolve problems through effective customer interface and communication.
+ Ability to prioritize workload.
+ Deep knowledge of version control.
+ CI/CD implementation expertise.
+ Good knowledge on cloud native applications (AWS).
+ Experience on infrastructure as a code (preferable CloudFormation and/or Ansible/ Terraform).
+ Familiar with programming languages like Phyton and PowerShell.
+ Windows technologies, Networking and Security knowledge.
**A little about ADP:** We are a comprehensive global provider of cloud-based human capital management (HCM) solutions that unite HR, payroll, talent, time, tax and benefits administration and a leader in business outsourcing services, analytics, and compliance expertise. We believe our people make all the difference in cultivating a down-to-earth culture that embraces our core values, welcomes ideas, encourages innovation, and values belonging. We've received recognition for our work by many esteemed organizations, learn more at ADP Awards and Recognition ( .
**Diversity, Equity, Inclusion & Equal Employment Opportunity at ADP:** ADP is committed to an inclusive, diverse and equitable workplace, and is further committed to providing equal employment opportunities regardless of any protected characteristic including: race, color, genetic information, creed, national origin, religion, sex, affectional or sexual orientation, gender identity or expression, lawful alien status, ancestry, age, marital status, protected veteran status or disability. Hiring decisions are based upon ADP's operating needs, and applicant merit including, but not limited to, qualifications, experience, ability, availability, cooperation, and job performance.
**Ethics at ADP:** ADP has a long, proud history of conducting business with the highest ethical standards and full compliance with all applicable laws. We also expect our people to uphold our values with the highest level of integrity and behave in a manner that fosters an honest and respectful workplace. Click to learn more about ADP's culture and our full set of values.
This advertiser has chosen not to accept applicants from your region.

Site Reliability Engineer

Bengaluru, Karnataka HDFC Limited

Posted today

Job Viewed

Tap Again To Close

Job Description

Hiring for Lead / Sr Site Reliability Engineer for Mumbai & Bangalore Location

Experience - 8 - 14 Years

Job Purpose

  • Analysing, troubleshooting, and designing vital services, platforms, and infrastructure on GCP while always thinking about reliability, scalability, resilience, security, and performance.


Job Responsibilities:


  • Help build a Site Reliability Engineering culture by sharing the best practices, approaches, documentation, and code with other engineering teams
  • Apply automation and software to any tasks or parts of the system which are performed manually
  • Able to troubleshoot complicated, cross platform issues handling OS, Networking, Database in a cloud-based SaaS environment and handle live production incidents
  • Monitor application performance take steps to improve overall application performance and stability and follow through with implementation



Key Skills:

  • Experience in monitoring and analyzing infrastructure performance using standard performance monitoring tools
  • Demonstrable experience in Containerization-Docker and orchestration (Kubernetes)
  • Experience with Infrastructure As Code (Terraform, Cloud Formation, Ansible)
  • Knowledge and proven hands-on experience in large-scale databases and distributed technologies, such as Kafka and Confluent Platform Kafka
  • Basic programming and scripting skills
This advertiser has chosen not to accept applicants from your region.
 

Nearby Locations

Other Jobs Near Me

Industry

  1. request_quote Accounting
  2. work Administrative
  3. eco Agriculture Forestry
  4. smart_toy AI & Emerging Technologies
  5. school Apprenticeships & Trainee
  6. apartment Architecture
  7. palette Arts & Entertainment
  8. directions_car Automotive
  9. flight_takeoff Aviation
  10. account_balance Banking & Finance
  11. local_florist Beauty & Wellness
  12. restaurant Catering
  13. volunteer_activism Charity & Voluntary
  14. science Chemical Engineering
  15. child_friendly Childcare
  16. foundation Civil Engineering
  17. clean_hands Cleaning & Sanitation
  18. diversity_3 Community & Social Care
  19. construction Construction
  20. brush Creative & Digital
  21. currency_bitcoin Crypto & Blockchain
  22. support_agent Customer Service & Helpdesk
  23. medical_services Dental
  24. medical_services Driving & Transport
  25. medical_services E Commerce & Social Media
  26. school Education & Teaching
  27. electrical_services Electrical Engineering
  28. bolt Energy
  29. local_mall Fmcg
  30. gavel Government & Non Profit
  31. emoji_events Graduate
  32. health_and_safety Healthcare
  33. beach_access Hospitality & Tourism
  34. groups Human Resources
  35. precision_manufacturing Industrial Engineering
  36. security Information Security
  37. handyman Installation & Maintenance
  38. policy Insurance
  39. code IT & Software
  40. gavel Legal
  41. sports_soccer Leisure & Sports
  42. inventory_2 Logistics & Warehousing
  43. supervisor_account Management
  44. supervisor_account Management Consultancy
  45. supervisor_account Manufacturing & Production
  46. campaign Marketing
  47. build Mechanical Engineering
  48. perm_media Media & PR
  49. local_hospital Medical
  50. local_hospital Military & Public Safety
  51. local_hospital Mining
  52. medical_services Nursing
  53. local_gas_station Oil & Gas
  54. biotech Pharmaceutical
  55. checklist_rtl Project Management
  56. shopping_bag Purchasing
  57. home_work Real Estate
  58. person_search Recruitment Consultancy
  59. store Retail
  60. point_of_sale Sales
  61. science Scientific Research & Development
  62. wifi Telecoms
  63. psychology Therapy
  64. pets Veterinary
View All Site Reliability Engineer Jobs