Sandia National Laboratories Jobs

Sandia National Laboratories Career Site

Job Information

Sandia National Laboratories High Performance Computing Information Systems Architect (Experienced) in Albuquerque, New Mexico

:

This posting will be open for application submissions for a minimum of seven (7) calendar days, including the ‘posting date’. Sandia reserves the right to extend the posting date at any time.

:

Sandia demonstrates its commitment to public safety in the national interest by requiring that all new hires be fully vaccinated or have an approved medical or religious accommodation before commencing employment. The requirement also applies to those who are telecommuting and working virtually.

Any concerns about the ability to meet this requirement should be directed to HR Solutions at (505) 284-4700.

:

Passionate about your work and dream of joining a dynamic team that solves complicated issues for our nation's security? Join us and unleash your potential!

We are seeking an energetic and innovative individual for the position of Information Systems Architect to join our team. The team manages both emerging and next-generation High Performance Computing (HPC) architectures as part of a collaborative partnership with staff to explore the potentials for such architectures to meet large-scale HPC needs. If you want to use your HPC systems management experience to determine and deploy new approaches for operations and efficient utilization of advanced and exploratory computing technologies, this is the opportunity for you!

On any given day, the selected candidate may be asked to:

  • Initiate, lead, and complete the development of new operational methodologies and design of infrastructure to enable efficient operations of multiple, concurrent, emerging technology and prototype HPC Clusters.

  • Collaborate with research and development staff, colleagues, and vendors to deliver functional platforms for pre-production systems running research software in exploratory configurations.

  • Identify complex technical issues and devise solutions on wide variety of HPC platforms.

  • Participate in all aspects of the HPC system lifecycle including facility integration, standup, acceptance testing, performance benchmarking, operational support, and reclamation.

  • Maintain all system aspects of security, networks, filesystems, system software installation, and user support.

    Required:

  • Bachelor’s degree in Computer Science, Computer Engineering, Information Systems Engineering (CIS/MIS), or relevant STEM field plus eight more years of relevant IT experience

  • Five years’ experience administering Linux /Unix Cluster systems, including hardware setup, installation, upgrades, and diagnostic troubleshooting

  • Experience administering multiple Linux/Unix clusters

  • Can obtain a DOE Q clearance

    Desired:

  • Experience customizing complex Linux container (docker, kubernetes, etc.) solutions for HPC workflows

  • Experience with automation tools for configuration management (e.g. Ansible, Puppet, Chef)

  • Experience with complex programming environments typical in HPC platforms, including use of MPI and other tools for system troubleshooting and benchmarking

  • Experience configuring/building/installing scheduling software (e.g. LSF, Moab, or SLURM), parallel filesystems (e.g. GPFS, Lustre), and high-speed networks (e.g., Infiniband, high-speed ethernet, Cray Aries)

  • Experience configuring storage administration, fiber channel SAN, LUN provisioning, NFS filesystem management, and related processes and technologies for multiple HPC resources and architectures

  • Experience administering heterogenous clusters consisting of GPU-based, ARM-based, x86_64-based, and next-generation architectures

  • Advanced scripting experience (Shell, Python, PERL, or any other system-level scripting)

  • Knowledge of and experience with security and authentication components, such as ssh, Kerberos, LDAP, SSL, nmap, public and private key encryption, and other third party security products

  • Knowledge of and experience with complex networking infrastructure, firewalls, routing, bonding, and VLANs

    Department Description:

We support and develop innovative solutions for the operation and efficient utilization of leading and next-generation computing systems. Our Heterogeneous Advanced Architecture Platforms (HAAPs) comprise small instances of the latest and prototype technology in computing to enable code developers and computer science researchers to test and evaluate candidate advanced processors, accelerators, networks, etc. to determine their potential to meet large-scale HPC needs. Our Advanced Technology Systems (ATS) testbeds enable porting of codes within Sandia's network environment in preparation to run production calculations on the extreme-scale platforms of the DOE. The HAAPs team uses its expertise and exposure to new architectural features to advance administration and operations of both sets of platforms.

About Sandia:

Sandia National Laboratories is the nation’s premier science and engineering lab for national security and technology innovation, with teams of specialists focused on cutting-edge work in a broad array of areas. Some of the main reasons we love our jobs:

  • Challenging work with amazing impact that contributes to security, peace, and freedom worldwide

  • Extraordinary co-workers

  • Some of the best tools, equipment, and research facilities in the world

  • Career advancement and enrichment opportunities

  • Flexible work arrangements for many positions include 9/80 (work 80 hours every two weeks, with every other Friday off) and 4/10 (work 4 ten-hour days each week) compressed workweeks, part-time work, and telecommuting (a mix of onsite work and working from home)

  • Generous vacations, strong medical and other benefits, competitive 401k, learning opportunities, relocation assistance and amenities aimed at creating a solid work/life balance*

World-changing technologies. Life-changing careers. Learn more about Sandia at: http://www.sandia.gov*These benefits vary by job classification.

Security Clearance:

Sandia is required by DOE to conduct a pre-employment drug test and background review that includes checks of personal references, credit, law enforcement records, and employment/education verifications. Applicants for employment need to be able to obtain and maintain a DOE Q-level security clearance, which requires U.S. citizenship. If you hold more than one citizenship (i.e., of the U.S. and another country), your ability to obtain a security clearance may be impacted.

Applicants offered employment with Sandia are subject to a federal background investigation to meet the requirements for access to classified information or matter if the duties of the position require a DOE security clearance. Substance abuse or illegal drug use, falsification of information, criminal activity, serious misconduct or other indicators of untrustworthiness can cause a clearance to be denied or terminated by DOE, resulting in the inability to perform the duties assigned and subsequent termination of employment.

EEO Statement:

All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, or veteran status and any other protected class under state or federal law.

Job ID: 679500

DirectEmployers