740 Production Operations jobs in India
Production Operations Engineer
Posted 5 days ago
Job Viewed
Job Description
Key Responsibilities
● Monitor production systems and job pipelines; respond promptly to alerts and anomalies
● Troubleshoot operational issues in collaboration with the development team
● Investigate incidents using logs, metrics, and observability tools (e.g., Grafana, Kibana)
● Perform recovery actions such as restarting pods, rerunning jobs, or applying known mitigations
● Operate in Kubernetes environments to inspect, debug, and manage components
● Support deployment activities through post-release validations and basic checks
● Validate data quality and flag anomalies to the relevant engineering teams
● Maintain clear documentation of incidents, actions taken, and resolution outcomes
● Communicate effectively with remote teams for operational handoffs and follow-ups
Required Qualifications
● Experience in production operations, system support, or devops roles
● Solid Linux skills (e.g., file system navigation, log analysis, process/network troubleshooting)
● Hands-on experience with Kubernetes and Docker in production environments
● Familiarity with observability tools (e.g., Grafana, Kibana, Prometheus)
● English proficiency for reading, writing, and asynchronous communication
● Strong execution discipline and ability to follow structured operational procedures
Preferred Qualifications
● Scripting ability (Python or Shell) for log parsing and automation
● Basic SQL skills for data verification or debugging
● Experience with Hadoop and Flink pipelines for batch and stream processing is a strong plus
● Experience with large-scale distributed data systems or job scheduling frameworks
Production Operations Engineer
Posted today
Job Viewed
Job Description
● Monitor production systems and job pipelines; respond promptly to alerts and anomalies
● Troubleshoot operational issues in collaboration with the development team
● Investigate incidents using logs, metrics, and observability tools (e.g., Grafana, Kibana)
● Perform recovery actions such as restarting pods, rerunning jobs, or applying known mitigations
● Operate in Kubernetes environments to inspect, debug, and manage components
● Support deployment activities through post-release validations and basic checks
● Validate data quality and flag anomalies to the relevant engineering teams
● Maintain clear documentation of incidents, actions taken, and resolution outcomes
● Communicate effectively with remote teams for operational handoffs and follow-ups
Required Qualifications
● Experience in production operations, system support, or devops roles
● Solid Linux skills (e.g., file system navigation, log analysis, process/network troubleshooting)
● Hands-on experience with Kubernetes and Docker in production environments
● Familiarity with observability tools (e.g., Grafana, Kibana, Prometheus)
● English proficiency for reading, writing, and asynchronous communication
● Strong execution discipline and ability to follow structured operational procedures
Preferred Qualifications
● Scripting ability (Python or Shell) for log parsing and automation
● Basic SQL skills for data verification or debugging
● Experience with Hadoop and Flink pipelines for batch and stream processing is a strong plus
● Experience with large-scale distributed data systems or job scheduling frameworks
Production Operations Engineer
Posted today
Job Viewed
Job Description
Key Responsibilities
● Monitor production systems and job pipelines; respond promptly to alerts and anomalies
● Troubleshoot operational issues in collaboration with the development team
● Investigate incidents using logs, metrics, and observability tools (e.g., Grafana, Kibana)
● Perform recovery actions such as restarting pods, rerunning jobs, or applying known mitigations
● Operate in Kubernetes environments to inspect, debug, and manage components
● Support deployment activities through post-release validations and basic checks
● Validate data quality and flag anomalies to the relevant engineering teams
● Maintain clear documentation of incidents, actions taken, and resolution outcomes
● Communicate effectively with remote teams for operational handoffs and follow-ups
Required Qualifications
● Experience in production operations, system support, or devops roles
● Solid Linux skills (e.g., file system navigation, log analysis, process/network troubleshooting)
● Hands-on experience with Kubernetes and Docker in production environments
● Familiarity with observability tools (e.g., Grafana, Kibana, Prometheus)
● English proficiency for reading, writing, and asynchronous communication
● Strong execution discipline and ability to follow structured operational procedures
Preferred Qualifications
● Scripting ability (Python or Shell) for log parsing and automation
● Basic SQL skills for data verification or debugging
● Experience with Hadoop and Flink pipelines for batch and stream processing is a strong plus
● Experience with large-scale distributed data systems or job scheduling frameworks
Production Operations Engineer
Posted today
Job Viewed
Job Description
Key Responsibilities
● Monitor production systems and job pipelines; respond promptly to alerts and anomalies
● Troubleshoot operational issues in collaboration with the development team
● Investigate incidents using logs, metrics, and observability tools (e.g., Grafana, Kibana)
● Perform recovery actions such as restarting pods, rerunning jobs, or applying known mitigations
● Operate in Kubernetes environments to inspect, debug, and manage components
● Support deployment activities through post-release validations and basic checks
● Validate data quality and flag anomalies to the relevant engineering teams
● Maintain clear documentation of incidents, actions taken, and resolution outcomes
● Communicate effectively with remote teams for operational handoffs and follow-ups
Required Qualifications
● Experience in production operations, system support, or devops roles
● Solid Linux skills (e.g., file system navigation, log analysis, process/network troubleshooting)
● Hands-on experience with Kubernetes and Docker in production environments
● Familiarity with observability tools (e.g., Grafana, Kibana, Prometheus)
● English proficiency for reading, writing, and asynchronous communication
● Strong execution discipline and ability to follow structured operational procedures
Preferred Qualifications
● Scripting ability (Python or Shell) for log parsing and automation
● Basic SQL skills for data verification or debugging
● Experience with Hadoop and Flink pipelines for batch and stream processing is a strong plus
● Experience with large-scale distributed data systems or job scheduling frameworks
Production Operations Engineer
Posted today
Job Viewed
Job Description
Key Responsibilities
● Monitor production systems and job pipelines; respond promptly to alerts and anomalies
● Troubleshoot operational issues in collaboration with the development team
● Investigate incidents using logs, metrics, and observability tools (e.g., Grafana, Kibana)
● Perform recovery actions such as restarting pods, rerunning jobs, or applying known mitigations
● Operate in Kubernetes environments to inspect, debug, and manage components
● Support deployment activities through post-release validations and basic checks
● Validate data quality and flag anomalies to the relevant engineering teams
● Maintain clear documentation of incidents, actions taken, and resolution outcomes
● Communicate effectively with remote teams for operational handoffs and follow-ups
Required Qualifications
● Experience in production operations, system support, or devops roles
● Solid Linux skills (e.g., file system navigation, log analysis, process/network troubleshooting)
● Hands-on experience with Kubernetes and Docker in production environments
● Familiarity with observability tools (e.g., Grafana, Kibana, Prometheus)
● English proficiency for reading, writing, and asynchronous communication
● Strong execution discipline and ability to follow structured operational procedures
Preferred Qualifications
● Scripting ability (Python or Shell) for log parsing and automation
● Basic SQL skills for data verification or debugging
● Experience with Hadoop and Flink pipelines for batch and stream processing is a strong plus
● Experience with large-scale distributed data systems or job scheduling frameworks
Production Operations Engineer
Posted today
Job Viewed
Job Description
Key Responsibilities
● Monitor production systems and job pipelines; respond promptly to alerts and anomalies
● Troubleshoot operational issues in collaboration with the development team
● Investigate incidents using logs, metrics, and observability tools (e.g., Grafana, Kibana)
● Perform recovery actions such as restarting pods, rerunning jobs, or applying known mitigations
● Operate in Kubernetes environments to inspect, debug, and manage components
● Support deployment activities through post-release validations and basic checks
● Validate data quality and flag anomalies to the relevant engineering teams
● Maintain clear documentation of incidents, actions taken, and resolution outcomes
● Communicate effectively with remote teams for operational handoffs and follow-ups
Required Qualifications
● Experience in production operations, system support, or devops roles
● Solid Linux skills (e.g., file system navigation, log analysis, process/network troubleshooting)
● Hands-on experience with Kubernetes and Docker in production environments
● Familiarity with observability tools (e.g., Grafana, Kibana, Prometheus)
● English proficiency for reading, writing, and asynchronous communication
● Strong execution discipline and ability to follow structured operational procedures
Preferred Qualifications
● Scripting ability (Python or Shell) for log parsing and automation
● Basic SQL skills for data verification or debugging
● Experience with Hadoop and Flink pipelines for batch and stream processing is a strong plus
● Experience with large-scale distributed data systems or job scheduling frameworks
Production Operations Engineer
Posted today
Job Viewed
Job Description
Key Responsibilities
● Monitor production systems and job pipelines; respond promptly to alerts and anomalies
● Troubleshoot operational issues in collaboration with the development team
● Investigate incidents using logs, metrics, and observability tools (e.g., Grafana, Kibana)
● Perform recovery actions such as restarting pods, rerunning jobs, or applying known mitigations
● Operate in Kubernetes environments to inspect, debug, and manage components
● Support deployment activities through post-release validations and basic checks
● Validate data quality and flag anomalies to the relevant engineering teams
● Maintain clear documentation of incidents, actions taken, and resolution outcomes
● Communicate effectively with remote teams for operational handoffs and follow-ups
Required Qualifications
● Experience in production operations, system support, or devops roles
● Solid Linux skills (e.g., file system navigation, log analysis, process/network troubleshooting)
● Hands-on experience with Kubernetes and Docker in production environments
● Familiarity with observability tools (e.g., Grafana, Kibana, Prometheus)
● English proficiency for reading, writing, and asynchronous communication
● Strong execution discipline and ability to follow structured operational procedures
Preferred Qualifications
● Scripting ability (Python or Shell) for log parsing and automation
● Basic SQL skills for data verification or debugging
● Experience with Hadoop and Flink pipelines for batch and stream processing is a strong plus
● Experience with large-scale distributed data systems or job scheduling frameworks
Be The First To Know
About the latest Production operations Jobs in India !
Production Operations Engineer
Posted today
Job Viewed
Job Description
Key Responsibilities
● Monitor production systems and job pipelines; respond promptly to alerts and anomalies
● Troubleshoot operational issues in collaboration with the development team
● Investigate incidents using logs, metrics, and observability tools (e.g., Grafana, Kibana)
● Perform recovery actions such as restarting pods, rerunning jobs, or applying known mitigations
● Operate in Kubernetes environments to inspect, debug, and manage components
● Support deployment activities through post-release validations and basic checks
● Validate data quality and flag anomalies to the relevant engineering teams
● Maintain clear documentation of incidents, actions taken, and resolution outcomes
● Communicate effectively with remote teams for operational handoffs and follow-ups
Required Qualifications
● Experience in production operations, system support, or devops roles
● Solid Linux skills (e.g., file system navigation, log analysis, process/network troubleshooting)
● Hands-on experience with Kubernetes and Docker in production environments
● Familiarity with observability tools (e.g., Grafana, Kibana, Prometheus)
● English proficiency for reading, writing, and asynchronous communication
● Strong execution discipline and ability to follow structured operational procedures
Preferred Qualifications
● Scripting ability (Python or Shell) for log parsing and automation
● Basic SQL skills for data verification or debugging
● Experience with Hadoop and Flink pipelines for batch and stream processing is a strong plus
● Experience with large-scale distributed data systems or job scheduling frameworks
Production Operations Engineer
Posted today
Job Viewed
Job Description
Key Responsibilities
● Monitor production systems and job pipelines; respond promptly to alerts and anomalies
● Troubleshoot operational issues in collaboration with the development team
● Investigate incidents using logs, metrics, and observability tools (e.g., Grafana, Kibana)
● Perform recovery actions such as restarting pods, rerunning jobs, or applying known mitigations
● Operate in Kubernetes environments to inspect, debug, and manage components
● Support deployment activities through post-release validations and basic checks
● Validate data quality and flag anomalies to the relevant engineering teams
● Maintain clear documentation of incidents, actions taken, and resolution outcomes
● Communicate effectively with remote teams for operational handoffs and follow-ups
Required Qualifications
● Experience in production operations, system support, or devops roles
● Solid Linux skills (e.g., file system navigation, log analysis, process/network troubleshooting)
● Hands-on experience with Kubernetes and Docker in production environments
● Familiarity with observability tools (e.g., Grafana, Kibana, Prometheus)
● English proficiency for reading, writing, and asynchronous communication
● Strong execution discipline and ability to follow structured operational procedures
Preferred Qualifications
● Scripting ability (Python or Shell) for log parsing and automation
● Basic SQL skills for data verification or debugging
● Experience with Hadoop and Flink pipelines for batch and stream processing is a strong plus
● Experience with large-scale distributed data systems or job scheduling frameworks
Production Operations Engineer
Posted today
Job Viewed
Job Description
Key Responsibilities
● Monitor production systems and job pipelines; respond promptly to alerts and anomalies
● Troubleshoot operational issues in collaboration with the development team
● Investigate incidents using logs, metrics, and observability tools (e.g., Grafana, Kibana)
● Perform recovery actions such as restarting pods, rerunning jobs, or applying known mitigations
● Operate in Kubernetes environments to inspect, debug, and manage components
● Support deployment activities through post-release validations and basic checks
● Validate data quality and flag anomalies to the relevant engineering teams
● Maintain clear documentation of incidents, actions taken, and resolution outcomes
● Communicate effectively with remote teams for operational handoffs and follow-ups
Required Qualifications
● Experience in production operations, system support, or devops roles
● Solid Linux skills (e.g., file system navigation, log analysis, process/network troubleshooting)
● Hands-on experience with Kubernetes and Docker in production environments
● Familiarity with observability tools (e.g., Grafana, Kibana, Prometheus)
● English proficiency for reading, writing, and asynchronous communication
● Strong execution discipline and ability to follow structured operational procedures
Preferred Qualifications
● Scripting ability (Python or Shell) for log parsing and automation
● Basic SQL skills for data verification or debugging
● Experience with Hadoop and Flink pipelines for batch and stream processing is a strong plus
● Experience with large-scale distributed data systems or job scheduling frameworks