9,468 Devops Enginner jobs in India
Senior DevOps Enginner
Posted 1 day ago
Job Viewed
Job Description
Glowingbud is a rapidly growing eSIM services platform that simplifies connectivity with powerful APIs, robust B2B and B2C interfaces, and seamless integrations with Telna. Our platform enables global eSIM lifecycle management, user onboarding, secure payment systems, and scalable deployments. Recently acquired by Telna, we are expanding our product offerings and team to meet increasing demand and innovation goals.
Job Summary
We are seeking a highly experienced Senior DevOps Engineer with 10+ years of expertise in cloud infrastructure, automation, and system reliability. The ideal candidate will be responsible for maintaining scalable AWS-based environments, implementing robust CI/CD pipelines, optimizing system performance, and ensuring high availability of critical applications. This role requires deep expertise in Docker, Kubernetes, Infrastructure as Code (IaC), and system monitoring. The candidate will also be responsible for documenting system architecture, setting SLAs, and leading DevOps best practices across teams. If you thrive in a fast-paced, collaborative environment and are passionate about DevOps, we'd love to hear from you!
Key Responsibilities:
- Infrastructure Management: Design, implement, and maintain scalable cloud infrastructure using AWS services.
- System Documentation & Diagrams: Maintain up-to-date system diagrams, architecture documentation, and operational procedures.
- Containerization & Orchestration: Deploy and manage containerized applications using Docker and Kubernetes.
- System Maintenance & Optimization: Ensure high availability, performance tuning, and cost optimization of cloud and on-premise infrastructure.
- Monitoring & Observability: Implement detailed system monitoring, logging, and alerting using tools like Datadog, Prometheus, Grafana, ELK stack, or AWS CloudWatch.
- Security & Compliance: Enforce security best practices, conduct regular audits, and ensure adherence to compliance standards.
- CI/CD Pipeline Management: Build and maintain automated deployment pipelines for seamless application releases.
- Incident Response & SLA Management: Define SLAs, monitor system performance, and establish an efficient incident response strategy.
- Collaboration & Leadership: Work closely with development, QA, and operations teams to improve reliability, scalability, and efficiency.
Qualifications:
- 7+ years of experience in DevOps, Site Reliability Engineering (SRE), or Cloud Infrastructure roles.
- Expert knowledge of AWS Services (EC2, ECS, S3, RDS, Mongo Atlas, Lambda, VPC, ALB, Gateway, Cognito, WAF, IAM, Amplify CloudFormation, etc.).
- Strong experience with Docker & Kubernetes for container orchestration and management.
- Hands-on experience with infrastructure as code (IaC) tools like Terraform, CloudFormation, or Pulumi.
- Expertise in system monitoring and logging tools (Prometheus, Grafana, ELK Stack, Datadog, AWS CloudWatch).
- Proficiency in scripting languages (Bash, Python, or Go) for automation and infrastructure management.
- Experience with CI/CD pipelines using Jenkins, AWS CodePipeline, GitHub Actions.
- Knowledge of networking, security best practices, and system performance tuning.
- Experience with setting and enforcing SLAs for DevOps teams.
- Strong problem-solving skills and ability to work in a fast-paced environment.
Preferred Skills:
- Thorough Experience with AWS Infrastructure.
- Knowledge of serverless architectures and event-driven computing.
- Experience with configuration management tools (Ansible, Chef, Puppet).
- Background in database administration (PostgreSQL, MySQL, or NoSQL databases).
Senior devops enginner
Posted today
Job Viewed
Job Description
Glowingbud is a rapidly growing e SIM services platform that simplifies connectivity with powerful APIs, robust B2 B and B2 C interfaces, and seamless integrations with Telna. Our platform enables global e SIM lifecycle management, user onboarding, secure payment systems, and scalable deployments. Recently acquired by Telna, we are expanding our product offerings and team to meet increasing demand and innovation goals.Job SummaryWe are seeking a highly experienced Senior Dev Ops Engineer with 10+ years of expertise in cloud infrastructure, automation, and system reliability. The ideal candidate will be responsible for maintaining scalable AWS-based environments, implementing robust CI/CD pipelines, optimizing system performance, and ensuring high availability of critical applications. This role requires deep expertise in Docker, Kubernetes, Infrastructure as Code (Ia C), and system monitoring. The candidate will also be responsible for documenting system architecture, setting SLAs, and leading Dev Ops best practices across teams. If you thrive in a fast-paced, collaborative environment and are passionate about Dev Ops, we'd love to hear from you!Key Responsibilities:Infrastructure Management: Design, implement, and maintain scalable cloud infrastructure using AWS services.System Documentation & Diagrams: Maintain up-to-date system diagrams, architecture documentation, and operational procedures.Containerization & Orchestration: Deploy and manage containerized applications using Docker and Kubernetes.System Maintenance & Optimization: Ensure high availability, performance tuning, and cost optimization of cloud and on-premise infrastructure.Monitoring & Observability: Implement detailed system monitoring, logging, and alerting using tools like Datadog, Prometheus, Grafana, ELK stack, or AWS Cloud Watch.Security & Compliance: Enforce security best practices, conduct regular audits, and ensure adherence to compliance standards.CI/CD Pipeline Management: Build and maintain automated deployment pipelines for seamless application releases.Incident Response & SLA Management: Define SLAs, monitor system performance, and establish an efficient incident response strategy.Collaboration & Leadership: Work closely with development, QA, and operations teams to improve reliability, scalability, and efficiency.Qualifications:7+ years of experience in Dev Ops, Site Reliability Engineering (SRE), or Cloud Infrastructure roles.Expert knowledge of AWS Services (EC2, ECS, S3, RDS, Mongo Atlas, Lambda, VPC, ALB, Gateway, Cognito, WAF, IAM, Amplify Cloud Formation, etc.).Strong experience with Docker & Kubernetes for container orchestration and management.Hands-on experience with infrastructure as code (Ia C) tools like Terraform, Cloud Formation, or Pulumi.Expertise in system monitoring and logging tools (Prometheus, Grafana, ELK Stack, Datadog, AWS Cloud Watch).Proficiency in scripting languages (Bash, Python, or Go) for automation and infrastructure management.Experience with CI/CD pipelines using Jenkins, AWS Code Pipeline, Git Hub Actions.Knowledge of networking, security best practices, and system performance tuning.Experience with setting and enforcing SLAs for Dev Ops teams.Strong problem-solving skills and ability to work in a fast-paced environment.Preferred Skills:Thorough Experience with AWS Infrastructure.Knowledge of serverless architectures and event-driven computing.Experience with configuration management tools (Ansible, Chef, Puppet).Background in database administration (Postgre SQL, My SQL, or No SQL databases).
Senior devops enginner
Posted today
Job Viewed
Job Description
Glowingbud is a rapidly growing e SIM services platform that simplifies connectivity with powerful APIs, robust B2 B and B2 C interfaces, and seamless integrations with Telna. Our platform enables global e SIM lifecycle management, user onboarding, secure payment systems, and scalable deployments. Recently acquired by Telna, we are expanding our product offerings and team to meet increasing demand and innovation goals.Job SummaryWe are seeking a highly experienced Senior Dev Ops Engineer with 10+ years of expertise in cloud infrastructure, automation, and system reliability. The ideal candidate will be responsible for maintaining scalable AWS-based environments, implementing robust CI/CD pipelines, optimizing system performance, and ensuring high availability of critical applications. This role requires deep expertise in Docker, Kubernetes, Infrastructure as Code (Ia C), and system monitoring. The candidate will also be responsible for documenting system architecture, setting SLAs, and leading Dev Ops best practices across teams. If you thrive in a fast-paced, collaborative environment and are passionate about Dev Ops, we'd love to hear from you!Key Responsibilities:Infrastructure Management: Design, implement, and maintain scalable cloud infrastructure using AWS services.System Documentation & Diagrams: Maintain up-to-date system diagrams, architecture documentation, and operational procedures.Containerization & Orchestration: Deploy and manage containerized applications using Docker and Kubernetes.System Maintenance & Optimization: Ensure high availability, performance tuning, and cost optimization of cloud and on-premise infrastructure.Monitoring & Observability: Implement detailed system monitoring, logging, and alerting using tools like Datadog, Prometheus, Grafana, ELK stack, or AWS Cloud Watch.Security & Compliance: Enforce security best practices, conduct regular audits, and ensure adherence to compliance standards.CI/CD Pipeline Management: Build and maintain automated deployment pipelines for seamless application releases.Incident Response & SLA Management: Define SLAs, monitor system performance, and establish an efficient incident response strategy.Collaboration & Leadership: Work closely with development, QA, and operations teams to improve reliability, scalability, and efficiency.Qualifications:7+ years of experience in Dev Ops, Site Reliability Engineering (SRE), or Cloud Infrastructure roles.Expert knowledge of AWS Services (EC2, ECS, S3, RDS, Mongo Atlas, Lambda, VPC, ALB, Gateway, Cognito, WAF, IAM, Amplify Cloud Formation, etc.).Strong experience with Docker & Kubernetes for container orchestration and management.Hands-on experience with infrastructure as code (Ia C) tools like Terraform, Cloud Formation, or Pulumi.Expertise in system monitoring and logging tools (Prometheus, Grafana, ELK Stack, Datadog, AWS Cloud Watch).Proficiency in scripting languages (Bash, Python, or Go) for automation and infrastructure management.Experience with CI/CD pipelines using Jenkins, AWS Code Pipeline, Git Hub Actions.Knowledge of networking, security best practices, and system performance tuning.Experience with setting and enforcing SLAs for Dev Ops teams.Strong problem-solving skills and ability to work in a fast-paced environment.Preferred Skills:Thorough Experience with AWS Infrastructure.Knowledge of serverless architectures and event-driven computing.Experience with configuration management tools (Ansible, Chef, Puppet).Background in database administration (Postgre SQL, My SQL, or No SQL databases).
Senior devops enginner
Posted today
Job Viewed
Job Description
Glowingbud is a rapidly growing e SIM services platform that simplifies connectivity with powerful APIs, robust B2 B and B2 C interfaces, and seamless integrations with Telna. Our platform enables global e SIM lifecycle management, user onboarding, secure payment systems, and scalable deployments. Recently acquired by Telna, we are expanding our product offerings and team to meet increasing demand and innovation goals.Job SummaryWe are seeking a highly experienced Senior Dev Ops Engineer with 10+ years of expertise in cloud infrastructure, automation, and system reliability. The ideal candidate will be responsible for maintaining scalable AWS-based environments, implementing robust CI/CD pipelines, optimizing system performance, and ensuring high availability of critical applications. This role requires deep expertise in Docker, Kubernetes, Infrastructure as Code (Ia C), and system monitoring. The candidate will also be responsible for documenting system architecture, setting SLAs, and leading Dev Ops best practices across teams. If you thrive in a fast-paced, collaborative environment and are passionate about Dev Ops, we'd love to hear from you!Key Responsibilities:Infrastructure Management: Design, implement, and maintain scalable cloud infrastructure using AWS services.System Documentation & Diagrams: Maintain up-to-date system diagrams, architecture documentation, and operational procedures.Containerization & Orchestration: Deploy and manage containerized applications using Docker and Kubernetes.System Maintenance & Optimization: Ensure high availability, performance tuning, and cost optimization of cloud and on-premise infrastructure.Monitoring & Observability: Implement detailed system monitoring, logging, and alerting using tools like Datadog, Prometheus, Grafana, ELK stack, or AWS Cloud Watch.Security & Compliance: Enforce security best practices, conduct regular audits, and ensure adherence to compliance standards.CI/CD Pipeline Management: Build and maintain automated deployment pipelines for seamless application releases.Incident Response & SLA Management: Define SLAs, monitor system performance, and establish an efficient incident response strategy.Collaboration & Leadership: Work closely with development, QA, and operations teams to improve reliability, scalability, and efficiency.Qualifications:7+ years of experience in Dev Ops, Site Reliability Engineering (SRE), or Cloud Infrastructure roles.Expert knowledge of AWS Services (EC2, ECS, S3, RDS, Mongo Atlas, Lambda, VPC, ALB, Gateway, Cognito, WAF, IAM, Amplify Cloud Formation, etc.).Strong experience with Docker & Kubernetes for container orchestration and management.Hands-on experience with infrastructure as code (Ia C) tools like Terraform, Cloud Formation, or Pulumi.Expertise in system monitoring and logging tools (Prometheus, Grafana, ELK Stack, Datadog, AWS Cloud Watch).Proficiency in scripting languages (Bash, Python, or Go) for automation and infrastructure management.Experience with CI/CD pipelines using Jenkins, AWS Code Pipeline, Git Hub Actions.Knowledge of networking, security best practices, and system performance tuning.Experience with setting and enforcing SLAs for Dev Ops teams.Strong problem-solving skills and ability to work in a fast-paced environment.Preferred Skills:Thorough Experience with AWS Infrastructure.Knowledge of serverless architectures and event-driven computing.Experience with configuration management tools (Ansible, Chef, Puppet).Background in database administration (Postgre SQL, My SQL, or No SQL databases).
Senior DevOps Enginner
Posted 1 day ago
Job Viewed
Job Description
Job Summary
We are seeking a highly experienced Senior DevOps Engineer with 10+ years of expertise in cloud infrastructure, automation, and system reliability. The ideal candidate will be responsible for maintaining scalable AWS-based environments, implementing robust CI/CD pipelines, optimizing system performance, and ensuring high availability of critical applications. This role requires deep expertise in Docker, Kubernetes, Infrastructure as Code (IaC), and system monitoring. The candidate will also be responsible for documenting system architecture, setting SLAs, and leading DevOps best practices across teams. If you thrive in a fast-paced, collaborative environment and are passionate about DevOps, we'd love to hear from you!
Key Responsibilities:
Infrastructure Management: Design, implement, and maintain scalable cloud infrastructure using AWS services.
System Documentation & Diagrams: Maintain up-to-date system diagrams, architecture documentation, and operational procedures.
Containerization & Orchestration: Deploy and manage containerized applications using Docker and Kubernetes.
System Maintenance & Optimization: Ensure high availability, performance tuning, and cost optimization of cloud and on-premise infrastructure.
Monitoring & Observability: Implement detailed system monitoring, logging, and alerting using tools like Datadog, Prometheus, Grafana, ELK stack, or AWS CloudWatch.
Security & Compliance: Enforce security best practices, conduct regular audits, and ensure adherence to compliance standards.
CI/CD Pipeline Management: Build and maintain automated deployment pipelines for seamless application releases.
Incident Response & SLA Management: Define SLAs, monitor system performance, and establish an efficient incident response strategy.
Collaboration & Leadership: Work closely with development, QA, and operations teams to improve reliability, scalability, and efficiency.
Qualifications:
7+ years of experience in DevOps, Site Reliability Engineering (SRE), or Cloud Infrastructure roles.
Expert knowledge of AWS Services (EC2, ECS, S3, RDS, Mongo Atlas, Lambda, VPC, ALB, Gateway, Cognito, WAF, IAM, Amplify CloudFormation, etc.).
Strong experience with Docker & Kubernetes for container orchestration and management.
Hands-on experience with infrastructure as code (IaC) tools like Terraform, CloudFormation, or Pulumi.
Expertise in system monitoring and logging tools (Prometheus, Grafana, ELK Stack, Datadog, AWS CloudWatch).
Proficiency in scripting languages (Bash, Python, or Go) for automation and infrastructure management.
Experience with CI/CD pipelines using Jenkins, AWS CodePipeline, GitHub Actions.
Knowledge of networking, security best practices, and system performance tuning.
Experience with setting and enforcing SLAs for DevOps teams.
Strong problem-solving skills and ability to work in a fast-paced environment.
Preferred Skills:
Thorough Experience with AWS Infrastructure.
Knowledge of serverless architectures and event-driven computing.
Experience with configuration management tools (Ansible, Chef, Puppet).
Background in database administration (PostgreSQL, MySQL, or NoSQL databases).
Senior DevOps Enginner
Posted today
Job Viewed
Job Description
99.999% availability for 25+ services across 200+ servers.
SummaryWe are building the fastest, most reliable & intelligent trading platform. That requires highlyavailable, scalable & performant systems. And you will be playing one of the most crucialroles in making this happen.You will be leading our efforts in designing, automating, deploying, scaling and monitoring
all our core products.
Tech Facts so Far
1. 8+ services deployed on 50+ servers2. 35K+ concurrent users on average3. 1M+ algorithms run every min4. 100M+ messages/min5. We are a 4-member backend team with 1 Devops Engineer. Yes! this is all done bythis incredible lean team.
Big Challenges for You
1. Manage 25+ services on 200+ servers2. Achieve 99.999% (5 Nines) availability
3. Make 1-minute automated deployments possible
If you like to work on extreme scale, complexity & availability, then you will love it here.
Key Objectives for You Spearhead system & network architecture CI, CD & Automated Deployments Achieve 99.999% availability Ensure in-depth & real-time monitoring, alerting & analytics Enable faster root cause analysis with improved visibility Ensure high level of security
Possible Growth Paths for You
Be our Lead DevOps EngineerBe a Performance & Security Expert
PerksChallenges that will push you beyond your limitsA democratic place where everyone is heard & awareNo hierarchy, politics, bosses, managers or anything like thatAnd most importantly, Happy Vibes!
Senior DevOps Enginner
Posted today
Job Viewed
Job Description
Glowingbud is a rapidly growing eSIM services platform that simplifies connectivity with powerful APIs, robust B2B and B2C interfaces, and seamless integrations with Telna. Our platform enables global eSIM lifecycle management, user onboarding, secure payment systems, and scalable deployments. Recently acquired by Telna, we are expanding our product offerings and team to meet increasing demand and innovation goals.
Job Summary
We are seeking a highly experienced Senior DevOps Engineer with 10+ years of expertise in cloud infrastructure, automation, and system reliability. The ideal candidate will be responsible for maintaining scalable AWS-based environments, implementing robust CI/CD pipelines, optimizing system performance, and ensuring high availability of critical applications. This role requires deep expertise in Docker, Kubernetes, Infrastructure as Code (IaC), and system monitoring. The candidate will also be responsible for documenting system architecture, setting SLAs, and leading DevOps best practices across teams. If you thrive in a fast-paced, collaborative environment and are passionate about DevOps, we'd love to hear from you!
Key Responsibilities:
- Infrastructure Management: Design, implement, and maintain scalable cloud infrastructure using AWS services.
- System Documentation & Diagrams: Maintain up-to-date system diagrams, architecture documentation, and operational procedures.
- Containerization & Orchestration: Deploy and manage containerized applications using Docker and Kubernetes.
- System Maintenance & Optimization: Ensure high availability, performance tuning, and cost optimization of cloud and on-premise infrastructure.
- Monitoring & Observability: Implement detailed system monitoring, logging, and alerting using tools like Datadog, Prometheus, Grafana, ELK stack, or AWS CloudWatch.
- Security & Compliance: Enforce security best practices, conduct regular audits, and ensure adherence to compliance standards.
- CI/CD Pipeline Management: Build and maintain automated deployment pipelines for seamless application releases.
- Incident Response & SLA Management: Define SLAs, monitor system performance, and establish an efficient incident response strategy.
- Collaboration & Leadership: Work closely with development, QA, and operations teams to improve reliability, scalability, and efficiency.
Qualifications:
- 7+ years of experience in DevOps, Site Reliability Engineering (SRE), or Cloud Infrastructure roles.
- Expert knowledge of AWS Services (EC2, ECS, S3, RDS, Mongo Atlas, Lambda, VPC, ALB, Gateway, Cognito, WAF, IAM, Amplify CloudFormation, etc.).
- Strong experience with Docker & Kubernetes for container orchestration and management.
- Hands-on experience with infrastructure as code (IaC) tools like Terraform, CloudFormation, or Pulumi.
- Expertise in system monitoring and logging tools (Prometheus, Grafana, ELK Stack, Datadog, AWS CloudWatch).
- Proficiency in scripting languages (Bash, Python, or Go) for automation and infrastructure management.
- Experience with CI/CD pipelines using Jenkins, AWS CodePipeline, GitHub Actions.
- Knowledge of networking, security best practices, and system performance tuning.
- Experience with setting and enforcing SLAs for DevOps teams.
- Strong problem-solving skills and ability to work in a fast-paced environment.
Preferred Skills:
- Thorough Experience with AWS Infrastructure.
- Knowledge of serverless architectures and event-driven computing.
- Experience with configuration management tools (Ansible, Chef, Puppet).
- Background in database administration (PostgreSQL, MySQL, or NoSQL databases).
Be The First To Know
About the latest Devops enginner Jobs in India !
Senior DevOps Enginner
Posted today
Job Viewed
Job Description
Glowingbud is a rapidly growing eSIM services platform that simplifies connectivity with powerful APIs, robust B2B and B2C interfaces, and seamless integrations with Telna. Our platform enables global eSIM lifecycle management, user onboarding, secure payment systems, and scalable deployments. Recently acquired by Telna, we are expanding our product offerings and team to meet increasing demand and innovation goals.
Job Summary
We are seeking a highly experienced Senior DevOps Engineer with 10+ years of expertise in cloud infrastructure, automation, and system reliability. The ideal candidate will be responsible for maintaining scalable AWS-based environments, implementing robust CI/CD pipelines, optimizing system performance, and ensuring high availability of critical applications. This role requires deep expertise in Docker, Kubernetes, Infrastructure as Code (IaC), and system monitoring. The candidate will also be responsible for documenting system architecture, setting SLAs, and leading DevOps best practices across teams. If you thrive in a fast-paced, collaborative environment and are passionate about DevOps, we'd love to hear from you!
Key Responsibilities:
- Infrastructure Management: Design, implement, and maintain scalable cloud infrastructure using AWS services.
- System Documentation & Diagrams: Maintain up-to-date system diagrams, architecture documentation, and operational procedures.
- Containerization & Orchestration: Deploy and manage containerized applications using Docker and Kubernetes.
- System Maintenance & Optimization: Ensure high availability, performance tuning, and cost optimization of cloud and on-premise infrastructure.
- Monitoring & Observability: Implement detailed system monitoring, logging, and alerting using tools like Datadog, Prometheus, Grafana, ELK stack, or AWS CloudWatch.
- Security & Compliance: Enforce security best practices, conduct regular audits, and ensure adherence to compliance standards.
- CI/CD Pipeline Management: Build and maintain automated deployment pipelines for seamless application releases.
- Incident Response & SLA Management: Define SLAs, monitor system performance, and establish an efficient incident response strategy.
- Collaboration & Leadership: Work closely with development, QA, and operations teams to improve reliability, scalability, and efficiency.
Qualifications:
- 7+ years of experience in DevOps, Site Reliability Engineering (SRE), or Cloud Infrastructure roles.
- Expert knowledge of AWS Services (EC2, ECS, S3, RDS, Mongo Atlas, Lambda, VPC, ALB, Gateway, Cognito, WAF, IAM, Amplify CloudFormation, etc.).
- Strong experience with Docker & Kubernetes for container orchestration and management.
- Hands-on experience with infrastructure as code (IaC) tools like Terraform, CloudFormation, or Pulumi.
- Expertise in system monitoring and logging tools (Prometheus, Grafana, ELK Stack, Datadog, AWS CloudWatch).
- Proficiency in scripting languages (Bash, Python, or Go) for automation and infrastructure management.
- Experience with CI/CD pipelines using Jenkins, AWS CodePipeline, GitHub Actions.
- Knowledge of networking, security best practices, and system performance tuning.
- Experience with setting and enforcing SLAs for DevOps teams.
- Strong problem-solving skills and ability to work in a fast-paced environment.
Preferred Skills:
- Thorough Experience with AWS Infrastructure.
- Knowledge of serverless architectures and event-driven computing.
- Experience with configuration management tools (Ansible, Chef, Puppet).
- Background in database administration (PostgreSQL, MySQL, or NoSQL databases).
Senior DevOps Enginner
Posted today
Job Viewed
Job Description
Glowingbud is a rapidly growing eSIM services platform that simplifies connectivity with powerful APIs, robust B2B and B2C interfaces, and seamless integrations with Telna. Our platform enables global eSIM lifecycle management, user onboarding, secure payment systems, and scalable deployments. Recently acquired by Telna, we are expanding our product offerings and team to meet increasing demand and innovation goals.
Job Summary
We are seeking a highly experienced Senior DevOps Engineer with 10+ years of expertise in cloud infrastructure, automation, and system reliability. The ideal candidate will be responsible for maintaining scalable AWS-based environments, implementing robust CI/CD pipelines, optimizing system performance, and ensuring high availability of critical applications. This role requires deep expertise in Docker, Kubernetes, Infrastructure as Code (IaC), and system monitoring. The candidate will also be responsible for documenting system architecture, setting SLAs, and leading DevOps best practices across teams. If you thrive in a fast-paced, collaborative environment and are passionate about DevOps, we'd love to hear from you!
Key Responsibilities:
- Infrastructure Management: Design, implement, and maintain scalable cloud infrastructure using AWS services.
- System Documentation & Diagrams: Maintain up-to-date system diagrams, architecture documentation, and operational procedures.
- Containerization & Orchestration: Deploy and manage containerized applications using Docker and Kubernetes.
- System Maintenance & Optimization: Ensure high availability, performance tuning, and cost optimization of cloud and on-premise infrastructure.
- Monitoring & Observability: Implement detailed system monitoring, logging, and alerting using tools like Datadog, Prometheus, Grafana, ELK stack, or AWS CloudWatch.
- Security & Compliance: Enforce security best practices, conduct regular audits, and ensure adherence to compliance standards.
- CI/CD Pipeline Management: Build and maintain automated deployment pipelines for seamless application releases.
- Incident Response & SLA Management: Define SLAs, monitor system performance, and establish an efficient incident response strategy.
- Collaboration & Leadership: Work closely with development, QA, and operations teams to improve reliability, scalability, and efficiency.
Qualifications:
- 7+ years of experience in DevOps, Site Reliability Engineering (SRE), or Cloud Infrastructure roles.
- Expert knowledge of AWS Services (EC2, ECS, S3, RDS, Mongo Atlas, Lambda, VPC, ALB, Gateway, Cognito, WAF, IAM, Amplify CloudFormation, etc.).
- Strong experience with Docker & Kubernetes for container orchestration and management.
- Hands-on experience with infrastructure as code (IaC) tools like Terraform, CloudFormation, or Pulumi.
- Expertise in system monitoring and logging tools (Prometheus, Grafana, ELK Stack, Datadog, AWS CloudWatch).
- Proficiency in scripting languages (Bash, Python, or Go) for automation and infrastructure management.
- Experience with CI/CD pipelines using Jenkins, AWS CodePipeline, GitHub Actions.
- Knowledge of networking, security best practices, and system performance tuning.
- Experience with setting and enforcing SLAs for DevOps teams.
- Strong problem-solving skills and ability to work in a fast-paced environment.
Preferred Skills:
- Thorough Experience with AWS Infrastructure.
- Knowledge of serverless architectures and event-driven computing.
- Experience with configuration management tools (Ansible, Chef, Puppet).
- Background in database administration (PostgreSQL, MySQL, or NoSQL databases).
Senior DevOps Enginner
Posted today
Job Viewed
Job Description
Glowingbud is a rapidly growing eSIM services platform that simplifies connectivity with powerful APIs, robust B2B and B2C interfaces, and seamless integrations with Telna. Our platform enables global eSIM lifecycle management, user onboarding, secure payment systems, and scalable deployments. Recently acquired by Telna, we are expanding our product offerings and team to meet increasing demand and innovation goals.
Job Summary
We are seeking a highly experienced Senior DevOps Engineer with 10+ years of expertise in cloud infrastructure, automation, and system reliability. The ideal candidate will be responsible for maintaining scalable AWS-based environments, implementing robust CI/CD pipelines, optimizing system performance, and ensuring high availability of critical applications. This role requires deep expertise in Docker, Kubernetes, Infrastructure as Code (IaC), and system monitoring. The candidate will also be responsible for documenting system architecture, setting SLAs, and leading DevOps best practices across teams. If you thrive in a fast-paced, collaborative environment and are passionate about DevOps, we'd love to hear from you!
Key Responsibilities:
- Infrastructure Management: Design, implement, and maintain scalable cloud infrastructure using AWS services.
- System Documentation & Diagrams: Maintain up-to-date system diagrams, architecture documentation, and operational procedures.
- Containerization & Orchestration: Deploy and manage containerized applications using Docker and Kubernetes.
- System Maintenance & Optimization: Ensure high availability, performance tuning, and cost optimization of cloud and on-premise infrastructure.
- Monitoring & Observability: Implement detailed system monitoring, logging, and alerting using tools like Datadog, Prometheus, Grafana, ELK stack, or AWS CloudWatch.
- Security & Compliance: Enforce security best practices, conduct regular audits, and ensure adherence to compliance standards.
- CI/CD Pipeline Management: Build and maintain automated deployment pipelines for seamless application releases.
- Incident Response & SLA Management: Define SLAs, monitor system performance, and establish an efficient incident response strategy.
- Collaboration & Leadership: Work closely with development, QA, and operations teams to improve reliability, scalability, and efficiency.
Qualifications:
- 7+ years of experience in DevOps, Site Reliability Engineering (SRE), or Cloud Infrastructure roles.
- Expert knowledge of AWS Services (EC2, ECS, S3, RDS, Mongo Atlas, Lambda, VPC, ALB, Gateway, Cognito, WAF, IAM, Amplify CloudFormation, etc.).
- Strong experience with Docker & Kubernetes for container orchestration and management.
- Hands-on experience with infrastructure as code (IaC) tools like Terraform, CloudFormation, or Pulumi.
- Expertise in system monitoring and logging tools (Prometheus, Grafana, ELK Stack, Datadog, AWS CloudWatch).
- Proficiency in scripting languages (Bash, Python, or Go) for automation and infrastructure management.
- Experience with CI/CD pipelines using Jenkins, AWS CodePipeline, GitHub Actions.
- Knowledge of networking, security best practices, and system performance tuning.
- Experience with setting and enforcing SLAs for DevOps teams.
- Strong problem-solving skills and ability to work in a fast-paced environment.
Preferred Skills:
- Thorough Experience with AWS Infrastructure.
- Knowledge of serverless architectures and event-driven computing.
- Experience with configuration management tools (Ansible, Chef, Puppet).
- Background in database administration (PostgreSQL, MySQL, or NoSQL databases).