Citi 2917 jobs openings
Citi 2917 jobs openings

Pyspark/Python ETL Developer

Onsite Chennai, India Posted 2 hours ago
Save Job

Job details

Title: Pyspark/Python ETL Developer with strong SQL Knowledge

 

Primary Responsibilities:-
Design, develop, and implement efficient and scalable data pipelines and ETL processes using Pyspark and Python.
Develop, optimize, and maintain complex SQL queries and stored procedures within Oracle database environments.
Collaborate with data architects, data scientists, and other stakeholders to understand data requirements and translate them into technical solutions.
Perform data analysis, profiling, and quality checks to ensure data accuracy and integrity.
Optimize Pyspark and Python code for performance and efficiency on large datasets.
Troubleshoot and resolve data-related issues, ensuring data availability and reliability.
Participate in code reviews, testing, and deployment processes.
Stay up-to-date with emerging technologies and best practices in data engineering and big data.
Document technical designs, data flows, and code.

 

Qualifications:-
Required Skills
----------------
Education: Bachelor's degree in Computer Science, Engineering, Information Technology, or a related field.
Experience: 3+ years of experience in data engineering or software development with a focus on Pyspark/Python.
Pyspark/Python: Proven expertise in developing data processing applications using Pyspark and Python.
Oracle SQL: Strong proficiency in writing complex SQL queries, stored procedures, and understanding database schema in Oracle.
Big Data: Experience with Apache Spark and its ecosystem for big data processing.
ETL: Solid understanding and experience with ETL methodologies and tools.
Automation Tools: Experience with job scheduling and automation tools, such as Autosys.
Shell Scripting: Proficiency in Unix/Linux shell scripting for automation and system tasks.
Version Control: Experience with Git or similar version control systems.
Problem-Solving: Excellent analytical and problem-solving skills with attention to detail.

 

Preferred Skills
----------------
Experience with other database systems (e.g., PostgreSQL, SQL Server).
Knowledge of cloud platforms (AWS, Azure, GCP) and their data services (e.g., AWS S3, EMR, Glue).
Familiarity with data orchestration tools (e.g., Apache Airflow).
Experience with data visualization tools (e.g., Tableau, Power BI).
Understanding of data warehousing concepts (e.g., Kimball, Inmon).

------------------------------------------------------

Job Family Group:

Technology

------------------------------------------------------

Job Family:

Applications Development

------------------------------------------------------

Time Type:

Full time

------------------------------------------------------

Most Relevant Skills

Please see the requirements listed above.

------------------------------------------------------

Other Relevant Skills

For complementary skills, please see above and/or contact the recruiter.

------------------------------------------------------

Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.

 

If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.

View Citi’s EEO Policy Statement and the Know Your Rights poster.

Get Weekly Job Offers

Be first to know when jobs open.

Pyspark/Python ETL Developer
Onsite Chennai, India Posted 2 hours ago
Save Job