Software Engineer III - ETL, PySpark and AWS

Company:  myGwork - LGBTQ+ professionals & allies
Location: Houston
Closing Date: 03/11/2024
Salary: £100 - £125 Per Annum
Hours: Full Time
Type: Permanent
Job Requirements / Description

This job is with JPMorganChase, an inclusive employer and a member of myGwork – the largest global platform for the LGBTQ+ business community. Please do not contact the recruiter directly.

We have an exciting and rewarding opportunity for you to take your software engineering career to the next level.

As a Software Engineer III at JPMorgan Chase within the Corporate Technology, Legal Reporting team you serve as a seasoned member of an agile team to design and deliver trusted market-leading technology products in a secure, stable, and scalable way. We are looking for an experienced Data Engineer to join our dynamic team. In this role, you will be responsible for designing, developing, and optimizing data pipelines using AWS services like Glue, Redshift, and Lambda. The ideal candidate should have hands-on experience with ETL processes, performance tuning, and a strong understanding of cloud-based data platforms.

Job Responsibilities

  1. Develop and Maintain ETL Pipelines: Design, develop, and implement scalable ETL workflows using PySpark, Python, and AWS Glue.
  2. Data Transformation and Integration: Extract, transform, and load data from various sources to AWS S3 and Redshift.
  3. Performance Optimization: Identify and resolve performance bottlenecks in ETL processes, ensuring optimal performance across large datasets.
  4. Automation and Monitoring: Implement automation scripts using AWS Lambda to schedule and monitor data pipelines.
  5. Data Quality: Ensure data integrity and quality across all stages of the ETL pipeline.
  6. Collaboration: Work closely with data architects, analysts, and stakeholders to understand requirements and provide clear communication throughout the project lifecycle.
  7. Documentation: Create and maintain technical documentation, including data mapping, workflow designs, and ETL processes.
  8. Proactively identify hidden problems and patterns in data and use these insights to drive improvements to coding hygiene and system architecture.
  9. Contribute to software engineering communities of practice and events that explore new and emerging technologies.
  10. Add to team culture of diversity, equity, inclusion, and respect.

Required Qualifications, Capabilities, And Skills

  1. Formal training or certification on software engineering concepts and 3+ years of applied experience.
  2. Hands-on experience in ETL development using PySpark, Python, and AWS services (Glue, Lambda, S3, and Redshift).
  3. Experience in optimizing data pipelines and troubleshooting performance issues.
  4. Strong understanding of SQL and relational databases.
  5. Familiarity with data warehousing concepts and design patterns.
  6. Excellent problem-solving skills and attention to detail.
  7. Strong communication skills, with the ability to explain technical concepts to non-technical stakeholders.

Preferred Qualifications, Capabilities, And Skills

  1. Experience with other AWS services like Athena, Step Functions, and CloudWatch.
  2. Knowledge of CI/CD pipelines and best practices in deployment automation.
  3. Experience working with large-scale distributed systems and big data environments.
#J-18808-Ljbffr
Apply Now
An error has occurred. This application may no longer respond until reloaded. Reload 🗙