590 Software Reliability jobs in Hyderabad
Site Reliability Engineering Lead
Posted today
Job Viewed
Job Description
Role and Responsibilities
Skills and Experience
Manager Data Reliability Engineering
Posted today
Job Viewed
Job Description
Responsibilities:
Skills:
Experience and Qualifications:
Site Reliability Engineering Advisor
Posted today
Job Viewed
Job Description
This is the critical Site Reliability Engineer Advisor position in FedEx Office Omni Channel team for the current and future vision of FedEx Retail online Services. If not filled, this will put the execution of our vision of framework stability and resilience as well as print omni channel runway at risk.
JOB SPECIFIC INFORMATION
Skills/Knowledge Considered a Plus:
Knowledge of the API First Philosophy
Education: Bachelor's degree or equivalent in Computer Science, Electrical / Electronics Engineering, MIS or related discipline
Experience: Five (5) years of work experience in managing reliability and performance of infrastructure, applications and systems at scale.
Knowledge, Skills and Abilities
• Fluency in English
• Problem Solving Skills
• Communication
• Collaboration
• Adaptability
• Ability to operate in a 24x7 environment encompassing global timezones
Preferred Qualifications:
Pay Transparency:
Pay:
Additional Details:
FedEx was built on a philosophy that puts people first, one we take seriously. We are an equal opportunity/affirmative action employer and we are committed to a diverse, equitable, and inclusive workforce in which we enforce fair treatment, and provide growth opportunities for everyone.
All qualified applicants will receive consideration for employment regardless of age, race, color, national origin, genetics, religion, gender, marital status, pregnancy (including childbirth or a related medical condition), physical or mental disability, or any other characteristic protected by applicable laws, regulations, and ordinances.
Our Company
FedEx is one of the world's largest express transportation companies and has consistently been selected as one of the top 10 World’s Most Admired Companies by "Fortune" magazine. Every day FedEx delivers for its customers with transportation and business solutions, serving more than 220 countries and territories around the globe. We can serve this global network due to our outstanding team of FedEx team members, who are tasked with making every FedEx experience outstanding.
Our Philosophy
The People-Service-Profit philosophy (P-S-P) describes the principles that govern every FedEx decision, policy, or activity. FedEx takes care of our people; they, in turn, deliver the impeccable service demanded by our customers, who reward us with the profitability necessary to secure our future. The essential element in making the People-Service-Profit philosophy such a positive force for the company is where we close the circle, and return these profits back into the business, and invest back in our people. Our success in the industry is attributed to our people. Through our P-S-P philosophy, we have a work environment that encourages team members to be innovative in delivering the highest possible quality of service to our customers. We care for their well-being, and value their contributions to the company.
Our Culture
Our culture is important for many reasons, and we intentionally bring it to life through our behaviors, actions, and activities in every part of the world. The FedEx culture and values have been a cornerstone of our success and growth since we began in the early 1970’s. While other companies can copy our systems, infrastructure, and processes, our culture makes us unique and is often a differentiating factor as we compete and grow in today’s global marketplace.
Software Engineer (Site Reliability)

Posted 17 days ago
Job Viewed
Job Description
HYDERABAD OFFICE INDIA
Job Description
We are seeking a motivated Software/Platform Engineer with expe rience in Databricks observability to join our dynamic team in D&A Platforms SRE . The ideal candidate will work and play a role in maintaining the reliability, availability, and performance of our data infrastructure and applications, demonstrating Databricks to ensure flawless operations and efficient performance. You will collaborate closely with development, operations, and data teams to implement best practices in observability and monitoring, enabling a proactive approach to incident management and system optimization.
Key Responsibilities
Reliability and Performance :
+ Design, implement, and maintain scalable and reliable systems and services
+ Monitor system performance, availability, and reliability, proactively identifying and resolving issues.
Observability Implementation :
+ Apply Databricks observability tools to develop and maintain dashboards, alerts, and reporting mechanisms that provide insights into system performance and usage.
+ Establish and improve observability frameworks to supervise key performance indicators (KPIs) and service-level objectives (SLOs).
Incident Management :
+ Respond to and fix production incidents, performing root cause analysis and implementing corrective actions to prevent future occurrences.
+ Collaborate with multi-functional teams to ensure effective incident response processes and documentation.
Automation and Efficiency :
+ Develop automation scripts and tools to streamline operational tasks, improve deployment processes, and enhance system reliability.
+ Supply to the continuous improvement of deployment pipelines and infrastructure as code ( IaC ) practices.
Collaboration and Documentation :
+ Work closely with development teams to understand application architectures and give to system design discussions.
+ Document processes, best practices, and system architecture to facilitate knowledge sharing and onboarding.
Performance Optimization :
+ Analyze system performance and application usage patterns to recommend and implement optimizations that improve efficiency and reduce costs.
Job Qualifications
+ Education:
+ Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience.
+ Experience:
+ 3 to 6 years of experience in Platform Engineering, DevOps, or a related field.
+ Experience with Databricks, including its observability and monitoring features.
+ Experience in Grafana observability platform
+ Familiarity with cloud platforms (Azure)
+ Technical Skills:
+ Programming languages skills: Python, Scala.
+ SQL knowledge for data extraction and transformation
+ Experience in Power BI development, both semantic models & visualizations
+ Experience in Grafana visualizations
+ Soft Skills:
+ Problem-solving skills and the ability to work in a fast-paced, collaborative environment.
+ Good communication skills, with the ability to convey sophisticated technical concepts to non-technical collaborators.
+ A proactive attitude with a focus on continuous improvement and learning.
+ Open to explore and experiment new SRE processes and tools to support technical requirements of the D&A platform
+ Willing to proactively seek new opportunities to learn and adopt new knowledge into practice.
About us
We produce globally recognized brands and we grow the best business leaders in the industry. With a portfolio of trusted brands as diverse as ours, it is paramount our leaders are able to lead with courage the vast array of brands, categories and functions. We serve consumers around the world with one of the strongest portfolios of trusted, quality, leadership brands, including Always®, Ariel®, Gillette®, Head & Shoulders®, Herbal Essences®, Oral-B®, Pampers®, Pantene®, Tampax® and more. Our community includes operations in approximately 70 countries worldwide. Visit to know more.We are an equal opportunity employer and value diversity at our company. We do not discriminate against individuals on the basis of race, color, gender, age, national origin, religion, sexual orientation, gender identity or expression, marital status, citizenship, disability, HIV/AIDS status, or any other legally protected factor.
"At P&G, the hiring journey is personalized every step of the way, thereby ensuring equal opportunities for all, with a strong foundation of Ethics & Corporate Responsibility guiding everything we do.All the available job opportunities are posted either on our website - pgcareers.com, or on our official social media pages, for the convenience of prospective candidates, and do not require them to pay any kind of fees towards their application."
Job Schedule
Full time
Job Number
R
Job Segmentation
Experienced Professionals (Job Segmentation)
Software Engineer (Site Reliability)
Posted today
Job Viewed
Job Description
Description
We are seeking a motivated Software/Platform Engineer with experience in Databricks observability to join our dynamic team in D&A Platforms SRE. The ideal candidate will work and play a role in maintaining the reliability, availability, and performance of our data infrastructure and applications, demonstrating Databricks to ensure flawless operations and efficient performance. You will collaborate closely with development, operations, and data teams to implement best practices in observability and monitoring, enabling a proactive approach to incident management and system optimization.
Key Responsibilities
Reliability and Performance :
Observability Implementation :
Incident Management :
Automation and Efficiency :
Collaboration and Documentation :
Performance Optimization :
Job Qualifications
About us
We produce globally recognized brands and we grow the best business leaders in the industry. With a portfolio of trusted brands as diverse as ours, it is paramount our leaders are able to lead with courage the vast array of brands, categories and functions. We serve consumers around the world with one of the strongest portfolios of trusted, quality, leadership brands, including Always®, Ariel®, Gillette®, Head & Shoulders®, Herbal Essences®, Oral-B®, Pampers®, Pantene®, Tampax® and more. Our community includes operations in approximately 70 countries worldwide. Visit to know more.
We are an equal opportunity employer and value diversity at our company. We do not discriminate against individuals on the basis of race, color, gender, age, national origin, religion, sexual orientation, gender identity or expression, marital status, citizenship, disability, HIV/AIDS status, or any other legally protected factor.
"At P&G, the hiring journey is personalized every step of the way, thereby ensuring equal opportunities for all, with a strong foundation of Ethics & Corporate Responsibility guiding everything we do.
All the available job opportunities are posted either on our website - pgcareers.com, or on our official social media pages, for the convenience of prospective candidates, and do not require them to pay any kind of fees towards their application.”
Job Schedule
Full timeJob Number
RJob Segmentation
Experienced Professionals (Job Segmentation)Azure Data Engineers - Site Reliability Engineering
Posted today
Job Viewed
Job Description
Role and Responsibilities
Skills and Experience
Software Engineering
Posted 9 days ago
Job Viewed
Job Description
SharePoint helps millions of people work better together and empowers the biggest companies in the world to solve mission critical problems. We create global scale services to store, secure and manage some of the most sensitive data on the planet.
We have fantastic opportunities and are on the front-line of making many of our next generation architecture investments to deliver multi-geo content store, amazing performance/scale/reliability, and security capabilities using scalable cloud distributed systems.
Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
**Responsibilities**
Towards this vision, we are seeking a solid and highly motivated **Software Engineer** to disrupt and build next generation of products and take it to the next level:
+ Driving the complex problem solving and the design/development of software and ensure its quality.
+ Defining new components with complete understanding of service interdependencies and limitations.
+ Possess knowledge and is curious to learn more about performance, scalability, enterprise system architecture, and engineering best practices.
+ Creating prototypes and proof-of-concepts for iterative development.
+ Work effectively with product development and engineering teams.
+ You must be self-driven, curious to learn, proactive, and result-oriented.
Join a team of builders and innovators that think outside the box. A team that's committed to a low operational burden by designing for it. A team that puts work-life balance, personal and professional growth as a principle, not just a goal. If you enjoy working in a dynamic environment to deliver world class mission critical systems, this may be the career opportunity for you!
**Qualifications**
**Required Qualifications:**
+ Bachelor's/ Master's Degree in Computer Science OR related technical field AND 2+ year(s) technical engineering experience with coding in languages **preferably C#** but not limited to, C, C++, Java, JavaScript, Python, etc.
+ Strong CS fundamentals and exceptional coding skills.
+ Good communication and cross group collaboration skills.
+ Experience in Azure, Exchange, or other cloud and distributed systems is a big plus.
Microsoft is an equal opportunity employer. Consistent with applicable law, all qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations ( .
Be The First To Know
About the latest Software reliability Jobs in Hyderabad !
Software Engineering Lead
Posted 2 days ago
Job Viewed
Job Description
**Primary Responsibilities:**
+ Working with the rest of the team to deploy, maintain, and run a highly available, multi-tenant distributed system
+ Automating both the infrastructure creation and the application deployment to that environment
+ Supporting Linux systems internals and administration
+ Build and maintain different Flavors of Kubernetes clusters on-prem and public cloud
+ Support application teams to address the issues faced
+ Update, enhance, create new tools using Python or Golang programming languages
+ Using modern AI tools and agents to create new tools to enhance our products and services
+ Comply with the terms and conditions of the employment contract, company policies and procedures, and any and all directives (such as, but not limited to, transfer and/or re-assignment to different work locations, change in teams and/or work shifts, policies in regard to flexibility of work benefits and/or work environment, alternative work arrangements, and other decisions that may arise due to the changing business environment). The Company may adopt, vary or rescind these policies and directives in its absolute discretion and without any limitation (implied or otherwise) on its ability to do so
**Required Qualifications:**
+ Full time graduation degree
+ Experience or understanding in life cycle management of Linux servers which includes Server provisioning, Build, Kick start, decommission and configuring servers using Configuration management tools like (ansible or Chef or Terraform)
+ Experience in supporting On-call and addressing war rooms, P1 and P2 tickets
+ Working experience on applying SRE concepts
+ Working experience with the use of AI tools and agents
+ Experience with scaling, monitoring, and troubleshooting actively running systems
+ Understand DevOps model (end to end) and experience in automating the software dev or test or deployment lifecycle with continuous integration and continuous deployment
+ Working knowledge of managing different Flavors of Kubernetes clusters on-prem and Public Clouds (AWS, AZURE and GCP)
+ Expert in working with Docker and Kubernetes
+ Expert in tools such as Monit, ELK, Splunk, Prometheus, Grafana etc.
+ Expert in scripting tools like shell scripting
+ Expertise in leveraging Open Source Software technologies to solve business problems
+ Proficient in using Python or Golang
+ Proven skills to recommend solutions to host an application or designing the application hosting strategy
+ Proven development skill to auto heal of system alerts
+ Demonstrated ability in addressing different customers on email or chat-ops and one on one meetings or calls with customers
+ Proven solid cross-organizational collaboration skills
+ Proven solid hands-on Experience with DevOps tool chains and practices
_At UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone. We believe everyone - of every race, gender, sexuality, age, location and income - deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes. We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes - an enterprise priority reflected in our mission._