2 Neural Networks jobs in India

Staff Engineer/Tech Lead - AI/ML [ Natural Language Processing, Transformers, Gen AI, LLM, Neural...

Bangalore, Karnataka Nutanix

Posted 5 days ago

Job Viewed

Tap Again To Close

Job Description

**Hungry, Humble, Honest, with Heart.**
**The Opportunity**
We are reimagining observability at Nutanix with **Panacea.ai** , our next-gen AI-driven log and metrics analyzer. In version 1.0, we leveraged regex-based filters to surface anomalies. Now, we're building **Panacea.ai** -powered by **AI/ML, ModernBERT, and LLMs** -to deliver intelligent, context-rich anomaly detection, automated root cause analysis (Auto-RCA), and continuous learning from user feedback.As a **Staff Engineer (MTS-6)** , you will **own the architecture and AI/ML systems that power both log and metrics analysis** , enabling automated diagnostics and reducing triage time for QA failures, regression runs, and customer issues. You'll also help define and drive the central AI charter at Nutanix, building reusable components, model infrastructure, and scalable ML services.
**About the Team**
The **Panacea** team has a passionate set of engineers across India and US office. We move fast, collaborate closely, and care deeply about quality and ownership. Our mission is to deliver **AI/ML-powered developer productivity tools** that solve real engineering and support pain points at scale.
Why Join Us
+ Build **AI-first observability tools** that redefine how engineers triage and troubleshoot.
+ Own systems that reduce hours of manual work in **engineering and SRE workflows** .
+ Collaborate with a **tight-knit team of high-ownership engineers** who are passionate about impact and innovation.
+ Hybrid work model that supports flexibility and deep focus.
+ Help shape the **central AI charter** at Nutanix and influence future AI products across the company.
**Your Role**
+ **AI-Powered Observability Platform** : Own the vision, architecture, and delivery of Panacea's ML-based log and metrics analyzer that reduces triage time and improves engineering efficiency.
+ **AI/ML-powered Log Analyzer Tool** : Use deep learning (e.g., **ModernBERT** ) to represent log messages, detect anomalies, group patterns, and surface actionable insights to users.
+ **Metrics Anomaly Detection Engine** : Build robust ML models to detect anomalies in time-series metrics like **CPU, memory, disk I/O, network traffic, service health** , and more-automatically identifying performance degradation or system regressions across distributed environments.
+ **Auto-RCA Engine** : Combine log and metrics signals with graph-based correlation and LLM-powered summarization to automatically diagnose the root cause of system failures.
+ **Feedback Loop & Continuous Learning** : Build infrastructure for incorporating user feedback to continuously retrain and improve anomaly detection systems.
+ **LLM Integration** : Integrate LLMs for user queries, problem summarization, anomaly explanation, and contextual recommendations.
+ **Central AI Charter** : Contribute to Nutanix's foundational AI platform by defining shared tooling, datasets, governance, and reusable ML components across products.
Responsibilities
+ Architect and scale ML pipelines for **real-time and batch-based anomaly detection** in both logs and time-series metrics.
+ Build and fine-tune **ModernBERT** and other transformer-based models for log understanding, anomaly classification, and summarization.
+ Develop unsupervised and semi-supervised ML models for **detecting anomalies in system metrics** (CPU, memory, network throughput, latency, etc.).
+ Implement correlation models to connect anomalies across logs and metrics to form a cohesive RCA narrative.
+ Own the entire ML lifecycle: data ingestion, feature extraction, model training, evaluation, deployment, and monitoring.
+ Build explainable AI systems that increase adoption and trust within engineering, QA, and support teams.
+ Collaborate with cross-functional stakeholders (SRE, QA, Dev) to deeply understand pain points and translate them into intelligent tooling.
+ Drive technical excellence through code and design reviews, mentoring, and setting engineering best practices.
**What You Will Bring**
+ **Educational Background** : B.Tech/M.Tech in Computer Science, Machine Learning, AI, or related fields.
+ **Experience** : 12+ years of engineering experience , including designing , developing and deploying AI/ML systems at scale.
+ **ML Expertise** :
+ Strong in time-series anomaly detection, statistical modeling, supervised/unsupervised learning.
+ Experience building ML models for **metrics data** (CPU, memory, IOPS, network, etc.) using models like Isolation Forest, Prophet, LSTM, or deep autoencoders.
+ Expertise in NLP using **ModernBERT, BERT, or** log classification, clustering, and summarization.
+ Experience with LLMs for downstream tasks like summarization, root cause reasoning, or intelligent Q&A.
+ **Engineering Skills** : Strong Python background, hands-on with ML libraries (PyTorch, TensorFlow, Scikit-learn), time-series frameworks, and MLOps tools. Familiar with data pipelines and serving models.
+ **Observability Knowledge** : Hands-on with logs, metrics, traces, and popular monitoring tools (e.g., Prometheus, Grafana, ELK).
+ **Leadership** : Ability to independently drive projects from requirements to delivery, mentor junior engineers, and deliver business impact.
**Work Arrangement**
Hybrid: This role operates in a hybrid capacity, blending the benefits of remote work with the advantages of in-person collaboration. For most roles, that will mean coming into an office a minimum of 2 - 3 days per week, however certain roles and/or teams may require more frequent in-office presence. Additional team-specific guidance and norms will be provided by your manager.
We're an Equal Opportunity Employer Nutanix is an Equal Employment Opportunity and (in the U.S.) an Affirmative Action employer. Qualified applicants are considered for employment opportunities without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, marital status, protected veteran status, disability status or any other category protected by applicable law. We hire and promote individuals solely on the basis of qualifications for the job to be filled. We strive to foster an inclusive working environment that enables all our Nutants to be themselves and to do great work in a safe and welcoming environment, free of unlawful discrimination, intimidation or harassment. As part of this commitment, we will ensure that persons with disabilities are provided reasonable accommodations. If you need a reasonable accommodation, please let us know by contacting
This advertiser has chosen not to accept applicants from your region.

Staff Engineer/Tech Lead – AI/ML [ Natural Language Processing, Transformers, Gen AI, LLM, Neural...

Bengaluru, Karnataka Nutanix

Posted today

Job Viewed

Tap Again To Close

Job Description

The Opportunity

We are reimagining observability at Nutanix with , our next-gen AI-driven log and metrics analyzer. In version 1.0, we leveraged regex-based filters to surface anomalies. Now, we’re building —powered by AI/ML, ModernBERT, and LLMs —to deliver intelligent, context-rich anomaly detection, automated root cause analysis (Auto-RCA), and continuous learning from user a Staff Engineer (MTS-6) , you will own the architecture and AI/ML systems that power both log and metrics analysis , enabling automated diagnostics and reducing triage time for QA failures, regression runs, and customer issues. You’ll also help define and drive the central AI charter at Nutanix, building reusable components, model infrastructure, and scalable ML services.


About the Team

The Panacea team has a passionate set of engineers across India and US office. We move fast, collaborate closely, and care deeply about quality and ownership. Our mission is to deliver AI/ML-powered developer productivity tools that solve real engineering and support pain points at scale.

Why Join Us
 

  • Build AI-first observability tools that redefine how engineers triage and troubleshoot.
  • Own systems that reduce hours of manual work in engineering and SRE workflows .
  • Collaborate with a tight-knit team of high-ownership engineers who are passionate about impact and innovation.
  • Hybrid work model that supports flexibility and deep focus.
  • Help shape the central AI charter at Nutanix and influence future AI products across the company.

  • Your Role

  • AI-Powered Observability Platform : Own the vision, architecture, and delivery of Panacea’s ML-based log and metrics analyzer that reduces triage time and improves engineering efficiency.
  • AI/ML-powered Log Analyzer Tool : Use deep learning (., ModernBERT ) to represent log messages, detect anomalies, group patterns, and surface actionable insights to users.
  • Metrics Anomaly Detection Engine : Build robust ML models to detect anomalies in time-series metrics like CPU, memory, disk I/O, network traffic, service health , and more—automatically identifying performance degradation or system regressions across distributed environments.
  • Auto-RCA Engine : Combine log and metrics signals with graph-based correlation and LLM-powered summarization to automatically diagnose the root cause of system failures.
  • Feedback Loop & Continuous Learning : Build infrastructure for incorporating user feedback to continuously retrain and improve anomaly detection systems.
  • LLM Integration : Integrate LLMs for user queries, problem summarization, anomaly explanation, and contextual recommendations.
  • Central AI Charter : Contribute to Nutanix’s foundational AI platform by defining shared tooling, datasets, governance, and reusable ML components across products.
  • Responsibilities
     

  • Architect and scale ML pipelines for real-time and batch-based anomaly detection in both logs and time-series metrics.
  • Build and fine-tune ModernBERT and other transformer-based models for log understanding, anomaly classification, and summarization.
  • Develop unsupervised and semi-supervised ML models for detecting anomalies in system metrics (CPU, memory, network throughput, latency, .
  • Implement correlation models to connect anomalies across logs and metrics to form a cohesive RCA narrative.
  • Own the entire ML lifecycle: data ingestion, feature extraction, model training, evaluation, deployment, and monitoring.
  • Build explainable AI systems that increase adoption and trust within engineering, QA, and support teams.
  • Collaborate with cross-functional stakeholders (SRE, QA, Dev) to deeply understand pain points and translate them into intelligent tooling.
  • Drive technical excellence through code and design reviews, mentoring, and setting engineering best practices.

  • What You Will Bring

  • Educational Background : / in Computer Science, Machine Learning, AI, or related fields.
  • Experience : 12+ years of engineering experience , including designing , developing and deploying AI/ML systems at scale.
  • ML Expertise :Strong in time-series anomaly detection, statistical modeling, supervised/unsupervised learning.Experience building ML models for metrics data (CPU, memory, IOPS, network, using models like Isolation Forest, Prophet, LSTM, or deep autoencoders.Expertise in NLP using ModernBERT, BERT, or log classification, clustering, and summarization.Experience with LLMs for downstream tasks like summarization, root cause reasoning, or intelligent Q&A.
  • Engineering Skills : Strong Python background, hands-on with ML libraries (PyTorch, TensorFlow, Scikit-learn), time-series frameworks, and MLOps tools. Familiar with data pipelines and serving models.
  • Observability Knowledge : Hands-on with logs, metrics, traces, and popular monitoring tools (., Prometheus, Grafana, ELK).
  • Leadership : Ability to independently drive projects from requirements to delivery, mentor junior engineers, and deliver business impact.

  • Work Arrangement

    Hybrid: This role operates in a hybrid capacity, blending the benefits of remote work with the advantages of in-person collaboration. For most roles, that will mean coming into an office a minimum of 2 - 3 days per week, however certain roles and/or teams may require more frequent in-office presence. Additional team-specific guidance and norms will be provided by your manager.


    --

    This advertiser has chosen not to accept applicants from your region.
    Be The First To Know

    About the latest Neural networks Jobs in India !

     

    Nearby Locations

    Other Jobs Near Me

    Industry

    1. request_quote Accounting
    2. work Administrative
    3. eco Agriculture Forestry
    4. smart_toy AI & Emerging Technologies
    5. school Apprenticeships & Trainee
    6. apartment Architecture
    7. palette Arts & Entertainment
    8. directions_car Automotive
    9. flight_takeoff Aviation
    10. account_balance Banking & Finance
    11. local_florist Beauty & Wellness
    12. restaurant Catering
    13. volunteer_activism Charity & Voluntary
    14. science Chemical Engineering
    15. child_friendly Childcare
    16. foundation Civil Engineering
    17. clean_hands Cleaning & Sanitation
    18. diversity_3 Community & Social Care
    19. construction Construction
    20. brush Creative & Digital
    21. currency_bitcoin Crypto & Blockchain
    22. support_agent Customer Service & Helpdesk
    23. medical_services Dental
    24. medical_services Driving & Transport
    25. medical_services E Commerce & Social Media
    26. school Education & Teaching
    27. electrical_services Electrical Engineering
    28. bolt Energy
    29. local_mall Fmcg
    30. gavel Government & Non Profit
    31. emoji_events Graduate
    32. health_and_safety Healthcare
    33. beach_access Hospitality & Tourism
    34. groups Human Resources
    35. precision_manufacturing Industrial Engineering
    36. security Information Security
    37. handyman Installation & Maintenance
    38. policy Insurance
    39. code IT & Software
    40. gavel Legal
    41. sports_soccer Leisure & Sports
    42. inventory_2 Logistics & Warehousing
    43. supervisor_account Management
    44. supervisor_account Management Consultancy
    45. supervisor_account Manufacturing & Production
    46. campaign Marketing
    47. build Mechanical Engineering
    48. perm_media Media & PR
    49. local_hospital Medical
    50. local_hospital Military & Public Safety
    51. local_hospital Mining
    52. medical_services Nursing
    53. local_gas_station Oil & Gas
    54. biotech Pharmaceutical
    55. checklist_rtl Project Management
    56. shopping_bag Purchasing
    57. home_work Real Estate
    58. person_search Recruitment Consultancy
    59. store Retail
    60. point_of_sale Sales
    61. science Scientific Research & Development
    62. wifi Telecoms
    63. psychology Therapy
    64. pets Veterinary
    View All Neural Networks Jobs