Job ID: 2742083 | Amazon Development Center U.S., Inc.
The AWS Center for Quantum Computing (CQC) is looking to hire a Systems Development Engineer to develop and maintain high performance computing (HPC) systems on AWS that CQC scientists and engineers use for quantum computing hardware design and simulation. These systems enable running large-scale HPC applications with MPI- and OpenMP-based parallelism on EC2 instances, as well as provide graphical environments to run computer-aided engineering applications.
Key job responsibilities
- Administer cloud-based HPC environments, including automating the deployment and maintenance of clusters, automating application testing, provisioning user accounts, managing and updating operating systems and software environments, building and deploying containers, debugging problems, and securing these systems.
- Support computer-aided engineering end users by improving the robustness and performance of the HPC environment. Collaborate with partner teams and software developers to increase availability across instance types, manage data spread across clusters in different regions, and improve integration with other services.
- Support computational scientists to enable state-of-the-art applications development, for example with Palace ( You may investigate the impact of build and environment configurations on performance, implement CI/CD and benchmarking strategies to test across heterogeneous compute environments, and keep library dependencies up-to-date, maintaining patch files as needed.
- Anticipate computational resource needs within, and expand the capabilities of, the CQC to support developing a scalable fault-tolerant quantum computer. Improvements to computational infrastructure will allow the CQC to expand physical modeling capabilities used within design, utilizing the scalable compute power of AWS to accelerate the design cycle.
BASIC QUALIFICATIONS
- 4+ years of systems design, software development, operations, automation, and process improvement experience
- Experience leading the design, automation, deployment, and support of large-scale infrastructure
- Experience programming with at least one modern language such as Python, Ruby, Golang, Java, C++, C#, Rust
- Experience with Linux/Unix
- Experience with CI/CD pipelines build processes
PREFERRED QUALIFICATIONS
- Experience with high-performance computing schedulers (OpenPBS, Slurm) and package managers (Spack or similar)
- Experience with technologies for heterogeneous large-scale computing (CUDA-aware MPI, etc.)
- Proficient in Python, C++, Cmake, Bash scripting
- Proficient with GitHub Actions and GitLab CI/CD
- Experience with computer-aided engineering and visualization applications
- Experience with AWS compute, storage, networking, and security services
- Bonus: experience with Prefect, ECS, EKS, Julia
Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.
#J-18808-Ljbffr