179 Tcs jobs in New Delhi

TCS Hiring for Observability Tools Tech Lead_PAN India

Noida, Uttar Pradesh Tata Consultancy Services

Posted 1 day ago

Job Viewed

Tap Again To Close

Job Description

TCS Hiring for Observability Tools Tech Lead_PAN India

Experience: 8 to 12 Years Only

Job Location: PAN India


TCS Hiring for Observability Tools Tech Lead_PAN India


Required Technical Skill Set:


Core Responsibilities:

  1. Designing and Implementing Observability Solutions:
  • This involves selecting, configuring, and deploying tools and platforms for collecting, processing, and analyzing telemetry data (logs, metrics, traces).
  1. Developing and Maintaining Monitoring and Alerting Systems:
  • Creating dashboards, setting up alerts based on key performance indicators (KPIs), and ensuring timely notification of issues.
  1. Instrumenting Applications and Infrastructure:
  • Working with development teams to add instrumentation code to applications to generate meaningful telemetry data. This often involves using open standards like Open Telemetry.
  1. Analyzing and Troubleshooting System Performance:
  • Investigating performance bottlenecks, identifying root causes of issues, and collaborating with development teams to resolve them.
  1. Defining and Tracking Service Level Objectives (SLOs) and Service Level Indicators (SLIs):
  • Working with stakeholders to define acceptable levels of performance and reliability and tracking these metrics.
  1. Improving Incident Response and Post-Mortem Processes:
  • Using observability data to understand incidents, identify contributing factors, and implement preventative measures.
  1. Collaborating with Development, Operations, and SRE Teams:
  • Working closely with other teams to ensure observability practices are integrated throughout the software development lifecycle.
  1. Educating and Mentoring Teams on Observability Best Practices:
  • Promoting a culture of observability within the organization.
  1. Managing and Optimizing Observability Infrastructure Costs:
  • Ensuring the cost-effectiveness of observability tools and platforms.
  1. Staying Up to Date with Observability Trends and Technologies:
  • Continuously learning about new tools, techniques, and best practices.




Key Skills:

  1. Strong Understanding of Observability Principles:
  • Deep knowledge of logs, metrics, and traces and how they contribute to understanding system behavior.
  1. Proficiency with Observability Tools and Platforms:
  • Experience with tools like:
  • Logging: Elasticsearch, Splunk, Fluentd, Logstash, etc.,
  • Metrics: Prometheus, Grafana, InfluxDB, Graphite, etc.,
  • Tracing: OpenTelemetry, DataDog APM, etc.,
  • APM (Application Performance Monitoring): DataDog, New Relic, AppDynamics, etc,
  1. Programming and Scripting Skills:
  • Proficiency in languages like Python, Go, Java, or scripting languages like Bash for automation and tool integration.
  1. Experience with Cloud Platforms:
  • Familiarity with cloud providers like AWS, Azure, or GCP and their monitoring and logging services.
  1. Understanding of Distributed Systems:
  • Knowledge of how distributed systems work and the challenges of monitoring and troubleshooting them.
  1. Troubleshooting and Problem-Solving Skills:
  • Strong analytical skills to identify and resolve complex issues.
  1. Communication and Collaboration Skills:
  • Ability to effectively communicate technical concepts to different audiences and work collaboratively with other teams.
  1. Knowledge of DevOps and SRE Practices:
  • Understanding of continuous integration/continuous delivery (CI/CD), infrastructure as code, and site reliability engineering principles.
  1. Data Analysis and Visualization Skills:
  • Ability to analyze telemetry data and create meaningful dashboards and reports.
  1. Experience with Containerization and Orchestration:
  • Familiarity with Docker, Kubernetes, and related technologies.


Kind Regards,

Priyankha M

This advertiser has chosen not to accept applicants from your region.

TCS Hiring for Observability Tools Tech Lead_PAN India

Noida, Uttar Pradesh Tata Consultancy Services

Posted 1 day ago

Job Viewed

Tap Again To Close

Job Description

TCS Hiring for Observability Tools Tech Lead_PAN India
Experience: 8 to 12 Years Only
Job Location: PAN India

TCS Hiring for Observability Tools Tech Lead_PAN India

Required Technical Skill Set:

Core Responsibilities:
Designing and Implementing Observability Solutions:
This involves selecting, configuring, and deploying tools and platforms for collecting, processing, and analyzing telemetry data (logs, metrics, traces).
Developing and Maintaining Monitoring and Alerting Systems:
Creating dashboards, setting up alerts based on key performance indicators (KPIs), and ensuring timely notification of issues.
Instrumenting Applications and Infrastructure:
Working with development teams to add instrumentation code to applications to generate meaningful telemetry data. This often involves using open standards like Open Telemetry.
Analyzing and Troubleshooting System Performance:
Investigating performance bottlenecks, identifying root causes of issues, and collaborating with development teams to resolve them.
Defining and Tracking Service Level Objectives (SLOs) and Service Level Indicators (SLIs):
Working with stakeholders to define acceptable levels of performance and reliability and tracking these metrics.
Improving Incident Response and Post-Mortem Processes:
Using observability data to understand incidents, identify contributing factors, and implement preventative measures.
Collaborating with Development, Operations, and SRE Teams:
Working closely with other teams to ensure observability practices are integrated throughout the software development lifecycle.
Educating and Mentoring Teams on Observability Best Practices:
Promoting a culture of observability within the organization.
Managing and Optimizing Observability Infrastructure Costs:
Ensuring the cost-effectiveness of observability tools and platforms.
Staying Up to Date with Observability Trends and Technologies:
Continuously learning about new tools, techniques, and best practices.

Key Skills:
Strong Understanding of Observability Principles:
Deep knowledge of logs, metrics, and traces and how they contribute to understanding system behavior.
Proficiency with Observability Tools and Platforms:
Experience with tools like:
Logging: Elasticsearch, Splunk, Fluentd, Logstash, etc.,
Metrics: Prometheus, Grafana, InfluxDB, Graphite, etc.,
Tracing: OpenTelemetry, DataDog APM, etc.,
APM (Application Performance Monitoring): DataDog, New Relic, AppDynamics, etc,
Programming and Scripting Skills:
Proficiency in languages like Python, Go, Java, or scripting languages like Bash for automation and tool integration.
Experience with Cloud Platforms:
Familiarity with cloud providers like AWS, Azure, or GCP and their monitoring and logging services.
Understanding of Distributed Systems:
Knowledge of how distributed systems work and the challenges of monitoring and troubleshooting them.
Troubleshooting and Problem-Solving Skills:
Strong analytical skills to identify and resolve complex issues.
Communication and Collaboration Skills:
Ability to effectively communicate technical concepts to different audiences and work collaboratively with other teams.
Knowledge of DevOps and SRE Practices:
Understanding of continuous integration/continuous delivery (CI/CD), infrastructure as code, and site reliability engineering principles.
Data Analysis and Visualization Skills:
Ability to analyze telemetry data and create meaningful dashboards and reports.
Experience with Containerization and Orchestration:
Familiarity with Docker, Kubernetes, and related technologies.

Kind Regards,
Priyankha M
This advertiser has chosen not to accept applicants from your region.

TCS Hiring for Observability Tools Tech Lead_PAN India

Delhi, Delhi Tata Consultancy Services

Posted 7 days ago

Job Viewed

Tap Again To Close

Job Description

TCS Hiring for Observability Tools Tech Lead_PAN India

Experience: 8 to 12 Years Only

Job Location: PAN India


TCS Hiring for Observability Tools Tech Lead_PAN India


Required Technical Skill Set:


Core Responsibilities:

  • Designing and Implementing Observability Solutions:
  • This involves selecting, configuring, and deploying tools and platforms for collecting, processing, and analyzing telemetry data (logs, metrics, traces).
  • Developing and Maintaining Monitoring and Alerting Systems:
  • Creating dashboards, setting up alerts based on key performance indicators (KPIs), and ensuring timely notification of issues.
  • Instrumenting Applications and Infrastructure:
  • Working with development teams to add instrumentation code to applications to generate meaningful telemetry data. This often involves using open standards like Open Telemetry.
  • Analyzing and Troubleshooting System Performance:
  • Investigating performance bottlenecks, identifying root causes of issues, and collaborating with development teams to resolve them.
  • Defining and Tracking Service Level Objectives (SLOs) and Service Level Indicators (SLIs):
  • Working with stakeholders to define acceptable levels of performance and reliability and tracking these metrics.
  • Improving Incident Response and Post-Mortem Processes:
  • Using observability data to understand incidents, identify contributing factors, and implement preventative measures.
  • Collaborating with Development, Operations, and SRE Teams:
  • Working closely with other teams to ensure observability practices are integrated throughout the software development lifecycle.
  • Educating and Mentoring Teams on Observability Best Practices:
  • Promoting a culture of observability within the organization.
  • Managing and Optimizing Observability Infrastructure Costs:
  • Ensuring the cost-effectiveness of observability tools and platforms.
  • Staying Up to Date with Observability Trends and Technologies:
  • Continuously learning about new tools, techniques, and best practices.




Key Skills:

  • Strong Understanding of Observability Principles:
  • Deep knowledge of logs, metrics, and traces and how they contribute to understanding system behavior.
  • Proficiency with Observability Tools and Platforms:
  • Experience with tools like:
  • Logging: Elasticsearch, Splunk, Fluentd, Logstash, etc.,
  • Metrics: Prometheus, Grafana, InfluxDB, Graphite, etc.,
  • Tracing: OpenTelemetry, DataDog APM, etc.,
  • APM (Application Performance Monitoring): DataDog, New Relic, AppDynamics, etc,
  • Programming and Scripting Skills:
  • Proficiency in languages like Python, Go, Java, or scripting languages like Bash for automation and tool integration.
  • Experience with Cloud Platforms:
  • Familiarity with cloud providers like AWS, Azure, or GCP and their monitoring and logging services.
  • Understanding of Distributed Systems:
  • Knowledge of how distributed systems work and the challenges of monitoring and troubleshooting them.
  • Troubleshooting and Problem-Solving Skills:
  • Strong analytical skills to identify and resolve complex issues.
  • Communication and Collaboration Skills:
  • Ability to effectively communicate technical concepts to different audiences and work collaboratively with other teams.
  • Knowledge of DevOps and SRE Practices:
  • Understanding of continuous integration/continuous delivery (CI/CD), infrastructure as code, and site reliability engineering principles.
  • Data Analysis and Visualization Skills:
  • Ability to analyze telemetry data and create meaningful dashboards and reports.
  • Experience with Containerization and Orchestration:
  • Familiarity with Docker, Kubernetes, and related technologies.


Kind Regards,

Priyankha M

This advertiser has chosen not to accept applicants from your region.

TCS Hiring for Observability Tools Tech Lead_PAN India

Noida, Uttar Pradesh Tata Consultancy Services

Posted 7 days ago

Job Viewed

Tap Again To Close

Job Description

TCS Hiring for Observability Tools Tech Lead_PAN India

Experience: 8 to 12 Years Only

Job Location: PAN India


TCS Hiring for Observability Tools Tech Lead_PAN India


Required Technical Skill Set:


Core Responsibilities:

  • Designing and Implementing Observability Solutions:
  • This involves selecting, configuring, and deploying tools and platforms for collecting, processing, and analyzing telemetry data (logs, metrics, traces).
  • Developing and Maintaining Monitoring and Alerting Systems:
  • Creating dashboards, setting up alerts based on key performance indicators (KPIs), and ensuring timely notification of issues.
  • Instrumenting Applications and Infrastructure:
  • Working with development teams to add instrumentation code to applications to generate meaningful telemetry data. This often involves using open standards like Open Telemetry.
  • Analyzing and Troubleshooting System Performance:
  • Investigating performance bottlenecks, identifying root causes of issues, and collaborating with development teams to resolve them.
  • Defining and Tracking Service Level Objectives (SLOs) and Service Level Indicators (SLIs):
  • Working with stakeholders to define acceptable levels of performance and reliability and tracking these metrics.
  • Improving Incident Response and Post-Mortem Processes:
  • Using observability data to understand incidents, identify contributing factors, and implement preventative measures.
  • Collaborating with Development, Operations, and SRE Teams:
  • Working closely with other teams to ensure observability practices are integrated throughout the software development lifecycle.
  • Educating and Mentoring Teams on Observability Best Practices:
  • Promoting a culture of observability within the organization.
  • Managing and Optimizing Observability Infrastructure Costs:
  • Ensuring the cost-effectiveness of observability tools and platforms.
  • Staying Up to Date with Observability Trends and Technologies:
  • Continuously learning about new tools, techniques, and best practices.




Key Skills:

  • Strong Understanding of Observability Principles:
  • Deep knowledge of logs, metrics, and traces and how they contribute to understanding system behavior.
  • Proficiency with Observability Tools and Platforms:
  • Experience with tools like:
  • Logging: Elasticsearch, Splunk, Fluentd, Logstash, etc.,
  • Metrics: Prometheus, Grafana, InfluxDB, Graphite, etc.,
  • Tracing: OpenTelemetry, DataDog APM, etc.,
  • APM (Application Performance Monitoring): DataDog, New Relic, AppDynamics, etc,
  • Programming and Scripting Skills:
  • Proficiency in languages like Python, Go, Java, or scripting languages like Bash for automation and tool integration.
  • Experience with Cloud Platforms:
  • Familiarity with cloud providers like AWS, Azure, or GCP and their monitoring and logging services.
  • Understanding of Distributed Systems:
  • Knowledge of how distributed systems work and the challenges of monitoring and troubleshooting them.
  • Troubleshooting and Problem-Solving Skills:
  • Strong analytical skills to identify and resolve complex issues.
  • Communication and Collaboration Skills:
  • Ability to effectively communicate technical concepts to different audiences and work collaboratively with other teams.
  • Knowledge of DevOps and SRE Practices:
  • Understanding of continuous integration/continuous delivery (CI/CD), infrastructure as code, and site reliability engineering principles.
  • Data Analysis and Visualization Skills:
  • Ability to analyze telemetry data and create meaningful dashboards and reports.
  • Experience with Containerization and Orchestration:
  • Familiarity with Docker, Kubernetes, and related technologies.


Kind Regards,

Priyankha M

This advertiser has chosen not to accept applicants from your region.

TCS Hiring for Observability Tools Tech Lead_PAN India

Ghaziabad, Uttar Pradesh Tata Consultancy Services

Posted 7 days ago

Job Viewed

Tap Again To Close

Job Description

TCS Hiring for Observability Tools Tech Lead_PAN India

Experience: 8 to 12 Years Only

Job Location: PAN India


TCS Hiring for Observability Tools Tech Lead_PAN India


Required Technical Skill Set:


Core Responsibilities:

  • Designing and Implementing Observability Solutions:
  • This involves selecting, configuring, and deploying tools and platforms for collecting, processing, and analyzing telemetry data (logs, metrics, traces).
  • Developing and Maintaining Monitoring and Alerting Systems:
  • Creating dashboards, setting up alerts based on key performance indicators (KPIs), and ensuring timely notification of issues.
  • Instrumenting Applications and Infrastructure:
  • Working with development teams to add instrumentation code to applications to generate meaningful telemetry data. This often involves using open standards like Open Telemetry.
  • Analyzing and Troubleshooting System Performance:
  • Investigating performance bottlenecks, identifying root causes of issues, and collaborating with development teams to resolve them.
  • Defining and Tracking Service Level Objectives (SLOs) and Service Level Indicators (SLIs):
  • Working with stakeholders to define acceptable levels of performance and reliability and tracking these metrics.
  • Improving Incident Response and Post-Mortem Processes:
  • Using observability data to understand incidents, identify contributing factors, and implement preventative measures.
  • Collaborating with Development, Operations, and SRE Teams:
  • Working closely with other teams to ensure observability practices are integrated throughout the software development lifecycle.
  • Educating and Mentoring Teams on Observability Best Practices:
  • Promoting a culture of observability within the organization.
  • Managing and Optimizing Observability Infrastructure Costs:
  • Ensuring the cost-effectiveness of observability tools and platforms.
  • Staying Up to Date with Observability Trends and Technologies:
  • Continuously learning about new tools, techniques, and best practices.




Key Skills:

  • Strong Understanding of Observability Principles:
  • Deep knowledge of logs, metrics, and traces and how they contribute to understanding system behavior.
  • Proficiency with Observability Tools and Platforms:
  • Experience with tools like:
  • Logging: Elasticsearch, Splunk, Fluentd, Logstash, etc.,
  • Metrics: Prometheus, Grafana, InfluxDB, Graphite, etc.,
  • Tracing: OpenTelemetry, DataDog APM, etc.,
  • APM (Application Performance Monitoring): DataDog, New Relic, AppDynamics, etc,
  • Programming and Scripting Skills:
  • Proficiency in languages like Python, Go, Java, or scripting languages like Bash for automation and tool integration.
  • Experience with Cloud Platforms:
  • Familiarity with cloud providers like AWS, Azure, or GCP and their monitoring and logging services.
  • Understanding of Distributed Systems:
  • Knowledge of how distributed systems work and the challenges of monitoring and troubleshooting them.
  • Troubleshooting and Problem-Solving Skills:
  • Strong analytical skills to identify and resolve complex issues.
  • Communication and Collaboration Skills:
  • Ability to effectively communicate technical concepts to different audiences and work collaboratively with other teams.
  • Knowledge of DevOps and SRE Practices:
  • Understanding of continuous integration/continuous delivery (CI/CD), infrastructure as code, and site reliability engineering principles.
  • Data Analysis and Visualization Skills:
  • Ability to analyze telemetry data and create meaningful dashboards and reports.
  • Experience with Containerization and Orchestration:
  • Familiarity with Docker, Kubernetes, and related technologies.


Kind Regards,

Priyankha M

This advertiser has chosen not to accept applicants from your region.

TCS Hiring for Observability Tools Tech Lead_PAN India

Gurgaon, Haryana Tata Consultancy Services

Posted 7 days ago

Job Viewed

Tap Again To Close

Job Description

TCS Hiring for Observability Tools Tech Lead_PAN India

Experience: 8 to 12 Years Only

Job Location: PAN India


TCS Hiring for Observability Tools Tech Lead_PAN India


Required Technical Skill Set:


Core Responsibilities:

  • Designing and Implementing Observability Solutions:
  • This involves selecting, configuring, and deploying tools and platforms for collecting, processing, and analyzing telemetry data (logs, metrics, traces).
  • Developing and Maintaining Monitoring and Alerting Systems:
  • Creating dashboards, setting up alerts based on key performance indicators (KPIs), and ensuring timely notification of issues.
  • Instrumenting Applications and Infrastructure:
  • Working with development teams to add instrumentation code to applications to generate meaningful telemetry data. This often involves using open standards like Open Telemetry.
  • Analyzing and Troubleshooting System Performance:
  • Investigating performance bottlenecks, identifying root causes of issues, and collaborating with development teams to resolve them.
  • Defining and Tracking Service Level Objectives (SLOs) and Service Level Indicators (SLIs):
  • Working with stakeholders to define acceptable levels of performance and reliability and tracking these metrics.
  • Improving Incident Response and Post-Mortem Processes:
  • Using observability data to understand incidents, identify contributing factors, and implement preventative measures.
  • Collaborating with Development, Operations, and SRE Teams:
  • Working closely with other teams to ensure observability practices are integrated throughout the software development lifecycle.
  • Educating and Mentoring Teams on Observability Best Practices:
  • Promoting a culture of observability within the organization.
  • Managing and Optimizing Observability Infrastructure Costs:
  • Ensuring the cost-effectiveness of observability tools and platforms.
  • Staying Up to Date with Observability Trends and Technologies:
  • Continuously learning about new tools, techniques, and best practices.




Key Skills:

  • Strong Understanding of Observability Principles:
  • Deep knowledge of logs, metrics, and traces and how they contribute to understanding system behavior.
  • Proficiency with Observability Tools and Platforms:
  • Experience with tools like:
  • Logging: Elasticsearch, Splunk, Fluentd, Logstash, etc.,
  • Metrics: Prometheus, Grafana, InfluxDB, Graphite, etc.,
  • Tracing: OpenTelemetry, DataDog APM, etc.,
  • APM (Application Performance Monitoring): DataDog, New Relic, AppDynamics, etc,
  • Programming and Scripting Skills:
  • Proficiency in languages like Python, Go, Java, or scripting languages like Bash for automation and tool integration.
  • Experience with Cloud Platforms:
  • Familiarity with cloud providers like AWS, Azure, or GCP and their monitoring and logging services.
  • Understanding of Distributed Systems:
  • Knowledge of how distributed systems work and the challenges of monitoring and troubleshooting them.
  • Troubleshooting and Problem-Solving Skills:
  • Strong analytical skills to identify and resolve complex issues.
  • Communication and Collaboration Skills:
  • Ability to effectively communicate technical concepts to different audiences and work collaboratively with other teams.
  • Knowledge of DevOps and SRE Practices:
  • Understanding of continuous integration/continuous delivery (CI/CD), infrastructure as code, and site reliability engineering principles.
  • Data Analysis and Visualization Skills:
  • Ability to analyze telemetry data and create meaningful dashboards and reports.
  • Experience with Containerization and Orchestration:
  • Familiarity with Docker, Kubernetes, and related technologies.


Kind Regards,

Priyankha M

This advertiser has chosen not to accept applicants from your region.

TCS Hiring for Observability Tools Tech Lead_PAN India

New Delhi, Delhi Tata Consultancy Services

Posted 7 days ago

Job Viewed

Tap Again To Close

Job Description

TCS Hiring for Observability Tools Tech Lead_PAN India

Experience: 8 to 12 Years Only

Job Location: PAN India


TCS Hiring for Observability Tools Tech Lead_PAN India


Required Technical Skill Set:


Core Responsibilities:

  • Designing and Implementing Observability Solutions:
  • This involves selecting, configuring, and deploying tools and platforms for collecting, processing, and analyzing telemetry data (logs, metrics, traces).
  • Developing and Maintaining Monitoring and Alerting Systems:
  • Creating dashboards, setting up alerts based on key performance indicators (KPIs), and ensuring timely notification of issues.
  • Instrumenting Applications and Infrastructure:
  • Working with development teams to add instrumentation code to applications to generate meaningful telemetry data. This often involves using open standards like Open Telemetry.
  • Analyzing and Troubleshooting System Performance:
  • Investigating performance bottlenecks, identifying root causes of issues, and collaborating with development teams to resolve them.
  • Defining and Tracking Service Level Objectives (SLOs) and Service Level Indicators (SLIs):
  • Working with stakeholders to define acceptable levels of performance and reliability and tracking these metrics.
  • Improving Incident Response and Post-Mortem Processes:
  • Using observability data to understand incidents, identify contributing factors, and implement preventative measures.
  • Collaborating with Development, Operations, and SRE Teams:
  • Working closely with other teams to ensure observability practices are integrated throughout the software development lifecycle.
  • Educating and Mentoring Teams on Observability Best Practices:
  • Promoting a culture of observability within the organization.
  • Managing and Optimizing Observability Infrastructure Costs:
  • Ensuring the cost-effectiveness of observability tools and platforms.
  • Staying Up to Date with Observability Trends and Technologies:
  • Continuously learning about new tools, techniques, and best practices.




Key Skills:

  • Strong Understanding of Observability Principles:
  • Deep knowledge of logs, metrics, and traces and how they contribute to understanding system behavior.
  • Proficiency with Observability Tools and Platforms:
  • Experience with tools like:
  • Logging: Elasticsearch, Splunk, Fluentd, Logstash, etc.,
  • Metrics: Prometheus, Grafana, InfluxDB, Graphite, etc.,
  • Tracing: OpenTelemetry, DataDog APM, etc.,
  • APM (Application Performance Monitoring): DataDog, New Relic, AppDynamics, etc,
  • Programming and Scripting Skills:
  • Proficiency in languages like Python, Go, Java, or scripting languages like Bash for automation and tool integration.
  • Experience with Cloud Platforms:
  • Familiarity with cloud providers like AWS, Azure, or GCP and their monitoring and logging services.
  • Understanding of Distributed Systems:
  • Knowledge of how distributed systems work and the challenges of monitoring and troubleshooting them.
  • Troubleshooting and Problem-Solving Skills:
  • Strong analytical skills to identify and resolve complex issues.
  • Communication and Collaboration Skills:
  • Ability to effectively communicate technical concepts to different audiences and work collaboratively with other teams.
  • Knowledge of DevOps and SRE Practices:
  • Understanding of continuous integration/continuous delivery (CI/CD), infrastructure as code, and site reliability engineering principles.
  • Data Analysis and Visualization Skills:
  • Ability to analyze telemetry data and create meaningful dashboards and reports.
  • Experience with Containerization and Orchestration:
  • Familiarity with Docker, Kubernetes, and related technologies.


Kind Regards,

Priyankha M

This advertiser has chosen not to accept applicants from your region.
Be The First To Know

About the latest Tcs Jobs in New Delhi !

TCS Hiring for Observability Tools Tech Lead_PAN India

Faridabad, Haryana Tata Consultancy Services

Posted 7 days ago

Job Viewed

Tap Again To Close

Job Description

TCS Hiring for Observability Tools Tech Lead_PAN India

Experience: 8 to 12 Years Only

Job Location: PAN India


TCS Hiring for Observability Tools Tech Lead_PAN India


Required Technical Skill Set:


Core Responsibilities:

  • Designing and Implementing Observability Solutions:
  • This involves selecting, configuring, and deploying tools and platforms for collecting, processing, and analyzing telemetry data (logs, metrics, traces).
  • Developing and Maintaining Monitoring and Alerting Systems:
  • Creating dashboards, setting up alerts based on key performance indicators (KPIs), and ensuring timely notification of issues.
  • Instrumenting Applications and Infrastructure:
  • Working with development teams to add instrumentation code to applications to generate meaningful telemetry data. This often involves using open standards like Open Telemetry.
  • Analyzing and Troubleshooting System Performance:
  • Investigating performance bottlenecks, identifying root causes of issues, and collaborating with development teams to resolve them.
  • Defining and Tracking Service Level Objectives (SLOs) and Service Level Indicators (SLIs):
  • Working with stakeholders to define acceptable levels of performance and reliability and tracking these metrics.
  • Improving Incident Response and Post-Mortem Processes:
  • Using observability data to understand incidents, identify contributing factors, and implement preventative measures.
  • Collaborating with Development, Operations, and SRE Teams:
  • Working closely with other teams to ensure observability practices are integrated throughout the software development lifecycle.
  • Educating and Mentoring Teams on Observability Best Practices:
  • Promoting a culture of observability within the organization.
  • Managing and Optimizing Observability Infrastructure Costs:
  • Ensuring the cost-effectiveness of observability tools and platforms.
  • Staying Up to Date with Observability Trends and Technologies:
  • Continuously learning about new tools, techniques, and best practices.




Key Skills:

  • Strong Understanding of Observability Principles:
  • Deep knowledge of logs, metrics, and traces and how they contribute to understanding system behavior.
  • Proficiency with Observability Tools and Platforms:
  • Experience with tools like:
  • Logging: Elasticsearch, Splunk, Fluentd, Logstash, etc.,
  • Metrics: Prometheus, Grafana, InfluxDB, Graphite, etc.,
  • Tracing: OpenTelemetry, DataDog APM, etc.,
  • APM (Application Performance Monitoring): DataDog, New Relic, AppDynamics, etc,
  • Programming and Scripting Skills:
  • Proficiency in languages like Python, Go, Java, or scripting languages like Bash for automation and tool integration.
  • Experience with Cloud Platforms:
  • Familiarity with cloud providers like AWS, Azure, or GCP and their monitoring and logging services.
  • Understanding of Distributed Systems:
  • Knowledge of how distributed systems work and the challenges of monitoring and troubleshooting them.
  • Troubleshooting and Problem-Solving Skills:
  • Strong analytical skills to identify and resolve complex issues.
  • Communication and Collaboration Skills:
  • Ability to effectively communicate technical concepts to different audiences and work collaboratively with other teams.
  • Knowledge of DevOps and SRE Practices:
  • Understanding of continuous integration/continuous delivery (CI/CD), infrastructure as code, and site reliability engineering principles.
  • Data Analysis and Visualization Skills:
  • Ability to analyze telemetry data and create meaningful dashboards and reports.
  • Experience with Containerization and Orchestration:
  • Familiarity with Docker, Kubernetes, and related technologies.


Kind Regards,

Priyankha M

This advertiser has chosen not to accept applicants from your region.
 

Nearby Locations

Other Jobs Near Me

Industry

  1. request_quote Accounting
  2. work Administrative
  3. eco Agriculture Forestry
  4. smart_toy AI & Emerging Technologies
  5. school Apprenticeships & Trainee
  6. apartment Architecture
  7. palette Arts & Entertainment
  8. directions_car Automotive
  9. flight_takeoff Aviation
  10. account_balance Banking & Finance
  11. local_florist Beauty & Wellness
  12. restaurant Catering
  13. volunteer_activism Charity & Voluntary
  14. science Chemical Engineering
  15. child_friendly Childcare
  16. foundation Civil Engineering
  17. clean_hands Cleaning & Sanitation
  18. diversity_3 Community & Social Care
  19. construction Construction
  20. brush Creative & Digital
  21. currency_bitcoin Crypto & Blockchain
  22. support_agent Customer Service & Helpdesk
  23. medical_services Dental
  24. medical_services Driving & Transport
  25. medical_services E Commerce & Social Media
  26. school Education & Teaching
  27. electrical_services Electrical Engineering
  28. bolt Energy
  29. local_mall Fmcg
  30. gavel Government & Non Profit
  31. emoji_events Graduate
  32. health_and_safety Healthcare
  33. beach_access Hospitality & Tourism
  34. groups Human Resources
  35. precision_manufacturing Industrial Engineering
  36. security Information Security
  37. handyman Installation & Maintenance
  38. policy Insurance
  39. code IT & Software
  40. gavel Legal
  41. sports_soccer Leisure & Sports
  42. inventory_2 Logistics & Warehousing
  43. supervisor_account Management
  44. supervisor_account Management Consultancy
  45. supervisor_account Manufacturing & Production
  46. campaign Marketing
  47. build Mechanical Engineering
  48. perm_media Media & PR
  49. local_hospital Medical
  50. local_hospital Military & Public Safety
  51. local_hospital Mining
  52. medical_services Nursing
  53. local_gas_station Oil & Gas
  54. biotech Pharmaceutical
  55. checklist_rtl Project Management
  56. shopping_bag Purchasing
  57. home_work Real Estate
  58. person_search Recruitment Consultancy
  59. store Retail
  60. point_of_sale Sales
  61. science Scientific Research & Development
  62. wifi Telecoms
  63. psychology Therapy
  64. pets Veterinary
View All Tcs Jobs View All Jobs in New Delhi