37,149 Spark jobs in India
Spark Developer
Posted today
Job Viewed
Job Description
ETL Developer will be responsible for designing, implementing, and optimizing distributed data processing jobs to handle large-scale data in Hadoop Distributed File System(HDFS) using Apache Spark and Python. This role required deep understanding of data engineering principles, proficiency in Python and hands-on experience with Spark and Hadoop ecosystems. Developer will collaborate with data engineers, analysts, and business stakeholders to process, transform and drive insights and data driven decisions.
Responsibilities:
- Data Processing and Transformation:
Design and Implement of Spark applications to process and transform large datasets in HDFS.
Develop ETL Pipelines in Spark using Python for data Ingestion, cleaning, aggregation, and transformations.
Performance Optimization:
Optimize Spark jobs for efficiency, reducing run time and resource usage.
Finetune memory management, caching, and partitioning strategies for Optimal performance
Data Engineering with Hadoop and Spark:
Load data from different sources into HDFS, ensuring data accuracy and integrity.
Integrate Spark Applications with Hadoop frameworks like Hive, Sqoop etc.
Testing and debugging:
Troubleshoot and debug Spark Job failures, monitor job logs, and Spark UI to Identify Issues.
Qualifications:
- 2-5 years of relevant experience
- Experience in programming/debugging used in business applications
- Working knowledge of industry practice and standards
- Comprehensive knowledge of specific business area for application development
- Working knowledge of program languages
- Consistently demonstrates clear and concise written and verbal communication
- Expertise in handling complex large-scale Warehouse environments
- Hands-on experience writing complex SQL queries, exporting and importing large amounts of data using utilities
Education:
- Bachelor's degree in a quantitative field (such as Engineering, Computer Science) or equivalent experience
This job description provides a high-level review of the types of work performed. Other job-related duties may be assigned as required.
-
Job Family Group:
Technology
-
Job Family:
Applications Development
-
Time Type:
Full time
-
Most Relevant Skills
Please see the requirements listed above.
-
Other Relevant Skills
For complementary skills, please see above and/or contact the recruiter.
-
Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.
If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi .
View Citi's EEO Policy Statement and the Know Your Rights poster.
Spark Developer
Posted today
Job Viewed
Job Description
Educational Requirements
Bachelor of Engineering
Service Line
Data & Analytics Unit
Responsibilities
A day in the life of an Infoscion- As part of the Infosys consulting team, your primary role would be to get to the heart of customer issues, diagnose problem areas, design innovative solutions and facilitate deployment resulting in client delight.
- You will develop a proposal by owning parts of the proposal document and by giving inputs in solution design based on areas of expertise.
- You will plan the activities of configuration, configure the product as per the design, conduct conference room pilots and will assist in resolving any queries related to requirements and solution design
- You will conduct solution/product demonstrations, POC/Proof of Technology workshops and prepare effort estimates which suit the customer budgetary requirements and are in line with organizations financial guidelines
- Actively lead small projects and contribute to unit-level and organizational initiatives with an objective of providing high quality value adding solutions to customers. If you think you fit right in to help our clients navigate their next in their digital transformation journey, this is the place for you
Additional Responsibilities:
- Ability to develop value-creating strategies and models that enable clients to innovate, drive growth and increase their business profitability
- Good knowledge on software configuration management systems
- Awareness of latest technologies and Industry trends
- Logical thinking and problem solving skills along with an ability to collaborate
- Understanding of the financial processes for various types of projects and the various pricing models available
- Ability to assess the current processes, identify improvement areas and suggest the technology solutions
- One or two industry domain knowledge
- Client Interfacing skills
- Project and Team management
Technical and Professional Requirements:
- Primary skills:Technology->Big Data - Data Processing->Spark
Preferred Skills:
Technology->Big Data - Data Processing->Spark
Spark Developer
Posted today
Job Viewed
Job Description
We are seeking a Big Data Engineer with strong hands-on experience in Spark and AWS technologies. The ideal candidate should demonstrate a deep understanding of big data concepts, programming fundamentals, and the ability to solve complex problems related to scalability, failure handling, and optimization.
Key Responsibilities:
- Design, develop, and optimize big data pipelines using Spark on AWS .
- Implement scalable and fault-tolerant data processing solutions.
- Troubleshoot and resolve performance bottlenecks in big data workflows.
- Collaborate with cross-functional teams to understand business requirements and translate them into technical solutions.
- Write clean, efficient, and well-documented code following core programming principles.
- Continuously improve existing data infrastructure for better reliability and performance.
Required Skills & Experience:
- Strong practical experience with Apache Spark and big data ecosystems.
- Hands-on experience with AWS services relevant to big data (e.G., EMR, S3, Lambda).
- Solid understanding of core programming fundamentals, including Object-Oriented Programming (OOP) concepts.
- Proven problem-solving skills related to scaling, failure handling, and performance optimization in big data environments.
- Ability to explain not just what technologies are used, but why and how they work.
- Familiarity with common big data terms and best practices.
Python Spark Developer
Posted today
Job Viewed
Job Description
Well versed with:
- Very good proficiency in Python and Spark programming.
- Pandas: Experience with data manipulation and analysis using Pandas.
- Implementation experience of Spark Core, Spark SQL and Spark Streaming
- Working with Spark in combination Hadoop Ecosystem
- Design and implementation of low-latency, high-availability, and performance applications.
- Should lead and guide a team of junior Python developers.
- As Sr Python Developer responsibilities include coding, testing, debugging, and troubleshooting throughout the application development process.
- Performance tuning, improvement, balancing, usability, and automation.
- Collaborate with other team members and stakeholders.
Educational Qualification
- Proficiency in Python programming.
- Experience with Pandas for data manipulation and analysis.
- Knowledge of Polars for efficient data processing.
- Strong problem-solving skills and attention to detail.
- Ability to work collaboratively in a team environment
Java Spark Developer
Posted today
Job Viewed
Job Description
The Applications Development Intermediate Programmer Analyst is an intermediate level position responsible for participation in the establishment and implementation of new or revised application systems and programs in coordination with the Technology team. The overall objective of this role is to contribute to applications systems analysis and programming activities.
Responsibilities:
- Utilize knowledge of applications development procedures and concepts, and basic knowledge of other technical areas to identify and define necessary system enhancements, including using script tools and analyzing/interpreting code
- Consult with users, clients, and other technology groups on issues, and recommend programming solutions, install, and support customer exposure systems
- Apply fundamental knowledge of programming languages for design specifications.
- Analyze applications to identify vulnerabilities and security issues, as well as conduct testing and debugging
- Serve as advisor or coach to new or lower level analysts
- Identify problems, analyze information, and make evaluative judgements to recommend and implement solutions
- Resolve issues by identifying and selecting solutions through the applications of acquired technical experience and guided by precedents
- Has the ability to operate with a limited level of direct supervision.
- Can exercise independence of judgement and autonomy.
- Acts as SME to senior stakeholders and /or other team members.
- Appropriately assess risk when business decisions are made, demonstrating particular consideration for the firm's reputation and safeguarding Citigroup, its clients and assets, by driving compliance with applicable laws, rules and regulations, adhering to Policy, applying sound ethical judgment regarding personal behavior, conduct and business practices, and escalating, managing and reporting control issues with transparency.
Qualifications:
- 4-6 years of relevant experience in the Financial Service industry
- Intermediate level experience in Applications Development role
- Consistently demonstrates clear and concise written and verbal communication
- Demonstrated problem-solving and decision-making skills
- Ability to work under pressure and manage deadlines or unexpected changes in expectations or requirements
Education:
- Bachelor's degree/University degree or equivalent experience
This job description provides a high-level review of the types of work performed. Other job-related duties may be assigned as required.
- Experience in ETL processes
- Java Spark, Big data skills, Ab Initio
-
Job Family Group:
Technology
-
Job Family:
Applications Development
-
Time Type:
Full time
-
Most Relevant Skills
Please see the requirements listed above.
-
Other Relevant Skills
For complementary skills, please see above and/or contact the recruiter.
-
Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.
If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi .
View Citi's EEO Policy Statement and the Know Your Rights poster.
java + spark developer
Posted today
Job Viewed
Job Description
Hiring Java + Spark Developer with experience range 5 to 15 Years of experience
Mandatory Skills: Java /Spark/ Pyspark/ Hadoop
Education: Btech/BE, BCA/MCA,Bsc/MSc
Java/ Spark Developer
Posted today
Job Viewed
Job Description
Translate use cases into functional apps; design, build, and optimize Java/Spark code; identify bottlenecks and fix bugs; process data with Hive, Impala, HBASE. Skills: 8+ yrs Java, Big Data (HDFS, Spark, Kafka), SQL, multithreading, Agile, Git.
Required Candidate profile
Java/Spark Developer with 5+ years' experience in Big Data, HDFS, Hive, Kafka, SQL, and multithreading. Proficient in Agile, Git, and performance optimization with strong problem-solving skills.
Be The First To Know
About the latest Spark Jobs in India !
Java Spark Developer
Posted today
Job Viewed
Job Description
Java+Spark
- Primary skill - Apache Spark Secondary skill - Java
- Strong knowledge in Apache Spark framework Core Spark, Spark Data Frames, Spark streaming
- Hands-on experience in any one of the programming languages (Java)
- Good understanding of distributed programming concepts.
- Experience in optimizing Spark DAG, and Hive queries on Tez
- Experience using tools like Git, Autosys, Bitbucket, Jira
- Ability to apply DWH principles within Hadoop environment and NoSQL databases.
Mandatory Skills: Apache Spark.Experience: 5-8 Years.
Scala Spark Developer
Posted today
Job Viewed
Job Description
Scala/Spark Developer: 5+ yrs experience in designing scalable data pipelines with Scala, Spark, Hadoop, Kafka. Skilled in data formats(Avro,Parquet),HDFS, SQL, NoSQL. Proficient in troubleshooting, optimization, delivering reliable bigdata solutions
Required Candidate profile
Scala/Spark Developer with 5+ years' experience in building scalable data pipelines using Scala, Spark, Hadoop, Kafka. Skilled in SQL, NoSQL, HDFS, and optimizing big data processes for reliability.
Scala Spark Developer
Posted today
Job Viewed
Job Description
Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you'd like, where you'll be supported and inspired by a collaborative community of colleagues around the world, and where you'll be able to reimagine what's possible. Join us and help the world's leading organizations unlock the value of technology and build a more sustainable, more inclusive world.
Your Role- As a senior software engineer with Capgemini, you will have 3 + years of experience in Scala with strong project track record.
- Hands-on expertise in Scala and Spark, strong SQL skills on DB2, and proficiency in handling diverse file formats including JSON, Parquet, AVRO, ORC, and XML.
- Proven experience in HDFS platform development, data analysis, profiling, and lineage. Effective communicator with a solid background in Agile project environments
- This position offers an exciting opportunity to contribute to high-impact data engineering projects within a dynamic and innovative environment.
- As a senior software engineer with Capgemini, you will have 3 + years of experience in Scala with strong project track record.
- Hands On experience in Scala/Spark developer
- Hands on SQL writing skills on RDBMS (DB2) databases
- Experience in working with different file formats like JSON, Parquet, AVRO, ORC and XML.
- Must have worked in a HDFS platform development project.
- Proficiency in data analysis, data profiling, and data lineage
- Strong oral and written communication skills
- Experience working in Agile projects.
•You can shape your career with us. We offer a range of career paths and internal opportunities within Capgemini group. You will also get personalized career guidance from our leaders.
•You will get comprehensive wellness benefits including health checks, telemedicine, insurance with top-ups, elder care, partner coverage or new parent support via flexible work.
•You will have the opportunity to learn on one of the industry's largest digital learning platforms, with access to 250,000+ courses and numerous certifications.
•We're committed to ensure that people of all backgrounds feel encouraged and have a sense of belonging at Capgemini. You are valued for who you are, and you can bring your original self to work .
Capgemini is a global business and technology transformation partner, helping organizations to accelerate their dual transition to a digital and sustainable world, while creating tangible impact for enterprises and society. It is a responsible and diverse group of 340,000 team members in more than 50 countries. With its strong over 55-year heritage, Capgemini is trusted by its clients to unlock the value of technology to address the entire breadth of their business needs. It delivers end-to-end services and solutions leveraging strengths from strategy and design to engineering, all fueled by its market leading capabilities in AI, generative AI, cloud and data, combined with its deep industry expertise and partner ecosystem.