1 Statistical Modeling jobs in India

Data Scientist (machine learning, statistical modeling and NLP applications)

Bengaluru, Karnataka AT&T

Posted 4 days ago

Job Viewed

Tap Again To Close

Job Description

**Job Description:**
**Key Responsibilities**
**Text Embeddings & NLP**
-Must have 3+ Years of experience on traditional ML.
-Must have 3+ Years of experience working as Data scientist.
Design and implement pipelines leveraging text embeddings for semantic search, classification, clustering, and document retrieval.
- Work with embedding techniques such as TF-IDF, Word2Vec, GloVe, FastText, and transformer-based models including BERT, Sentence-BERT, OpenAI, and Azure OpenAI embeddings.
- Apply dimensionality reduction methods (PCA, t-SNE, UMAP) to analyze and visualize embedding spaces.
- Use cosine similarity, Euclidean distance, and approximate nearest neighbor algorithms like FAISS and ScaNN for similarity search and clustering.
- Integrate embedding outputs into downstream applications such as intent detection, topic modeling, semantic deduplication, document ranking, and retrieval systems.
***Traditional Machine Learning & Statistical Modeling***
- Build and deploy predictive models with logistic/linear regression, random forests, gradient boosting techniques (XGBoost, LightGBM), SVM, Naive Bayes, k-means, and hierarchical clustering.
- Employ statistical inference techniques including hypothesis testing, confidence intervals, bootstrapping, Bayesian inference, multicollinearity diagnostics, residual analysis, and time series forecasting (ARIMA, SARIMA).
- Evaluate model performance using ROC/Precision-Recall curves, AUC, confusion matrices, F1-score, lift/gain charts, and KS statistics.
- Conduct feature selection via Lasso/Ridge regression, recursive feature elimination (RFE), and SHAP values for interpretability.
**Experimentation & Causal Inference**
- Design and analyze A/B and multivariate tests, DOE experiments, and sophisticated causal inference methods including propensity score matching, causal forests, and difference-in-differences.
- Translate experimental results into clear, actionable business insights that drive measurable outcomes.
**Data Engineering & Productionization**
- Develop scalable data pipelines using PySpark, SQL, and Azure Data Factory on platforms including Azure Data Lake, Databricks, MongoDB, and Cosmos DB.
- Deploy machine learning solutions with FastAPI, Docker containers, and Azure App Services endpoints, while monitoring model health with MLflow and model drift.
***Collaboration & Leadership***
- Partner effectively with engineering, product, and business teams to define problem statements and deliver impactful solutions.
- Lead technical discussions, perform code reviews, and mentor junior data scientists to foster technical growth.
- Communicate complex analytical insights clearly to both technical and non-technical stakeholders.
**Required Skills and Qualifications**
Hands-on experience in machine learning, statistical modeling, and NLP applications.
- Deep expertise in text embeddings and their real-world applications.
- Proficiency in Python, PySpark, and SQL.
- Strong foundation in statistical inference, model diagnostics, and evaluation metrics.
- Experience working with Azure cloud ecosystem, Databricks, and production deployment of ML models.
- Proven ability to design, execute, and interpret experiments with statistical rigor.
**Preferred (Good-to-Have) Skills**
- Familiarity with transformer-based large language models (LLMs), LangChain, or OpenAI APIs.
- Experience with MLOps tools such as MLflow and Github Actions CI/CD pipelines with Azure App Services.
- Exposure to graph analytics, retrieval-augmented generation (RAG) pipelines, or agent-based systems.
**Day-to-Day Responsibilities**
You will architect and implement advanced NLP and machine learning pipelines leveraging diverse text embeddings for semantic search, classification, and clustering tasks. Applying sound statistical modeling and causal inference techniques, you will lead experimentation efforts and build scalable data workflows using PySpark, SQL, and Azure services. Cross-functional collaboration will be a core part of your role as you translate analytical insights into strategic business outcomes.
**Weekly Hours:**
40
**Time Type:**
Regular
**Location:**
IND:KA:Bengaluru / Innovator Building, Itpb, Whitefield Rd - Adm: Intl Tech Park, Innovator Bldg
It is the policy of AT&T to provide equal employment opportunity (EEO) to all persons regardless of age, color, national origin, citizenship status, physical or mental disability, race, religion, creed, gender, sex, sexual orientation, gender identity and/or expression, genetic information, marital status, status with regard to public assistance, veteran status, or any other characteristic protected by federal, state or local law. In addition, AT&T will provide reasonable accommodations for qualified individuals with disabilities. AT&T is a fair chance employer and does not initiate a background check until an offer is made.
AT&T will consider for employment qualified applicants in a manner consistent with the requirements of federal, state and local laws
We expect employees to be honest, trustworthy, and operate with integrity. Discrimination and all unlawful harassment (including sexual harassment) in employment is not tolerated. We encourage success based on our individual merits and abilities without regard to race, color, religion, national origin, gender, sexual orientation, gender identity, age, disability, marital status, citizenship status, military status, protected veteran status or employment status
This advertiser has chosen not to accept applicants from your region.
Be The First To Know

About the latest Statistical modeling Jobs in India !

 

Nearby Locations

Other Jobs Near Me

Industry

  1. request_quote Accounting
  2. work Administrative
  3. eco Agriculture Forestry
  4. smart_toy AI & Emerging Technologies
  5. school Apprenticeships & Trainee
  6. apartment Architecture
  7. palette Arts & Entertainment
  8. directions_car Automotive
  9. flight_takeoff Aviation
  10. account_balance Banking & Finance
  11. local_florist Beauty & Wellness
  12. restaurant Catering
  13. volunteer_activism Charity & Voluntary
  14. science Chemical Engineering
  15. child_friendly Childcare
  16. foundation Civil Engineering
  17. clean_hands Cleaning & Sanitation
  18. diversity_3 Community & Social Care
  19. construction Construction
  20. brush Creative & Digital
  21. currency_bitcoin Crypto & Blockchain
  22. support_agent Customer Service & Helpdesk
  23. medical_services Dental
  24. medical_services Driving & Transport
  25. medical_services E Commerce & Social Media
  26. school Education & Teaching
  27. electrical_services Electrical Engineering
  28. bolt Energy
  29. local_mall Fmcg
  30. gavel Government & Non Profit
  31. emoji_events Graduate
  32. health_and_safety Healthcare
  33. beach_access Hospitality & Tourism
  34. groups Human Resources
  35. precision_manufacturing Industrial Engineering
  36. security Information Security
  37. handyman Installation & Maintenance
  38. policy Insurance
  39. code IT & Software
  40. gavel Legal
  41. sports_soccer Leisure & Sports
  42. inventory_2 Logistics & Warehousing
  43. supervisor_account Management
  44. supervisor_account Management Consultancy
  45. supervisor_account Manufacturing & Production
  46. campaign Marketing
  47. build Mechanical Engineering
  48. perm_media Media & PR
  49. local_hospital Medical
  50. local_hospital Military & Public Safety
  51. local_hospital Mining
  52. medical_services Nursing
  53. local_gas_station Oil & Gas
  54. biotech Pharmaceutical
  55. checklist_rtl Project Management
  56. shopping_bag Purchasing
  57. home_work Real Estate
  58. person_search Recruitment Consultancy
  59. store Retail
  60. point_of_sale Sales
  61. science Scientific Research & Development
  62. wifi Telecoms
  63. psychology Therapy
  64. pets Veterinary
View All Statistical Modeling Jobs