Access Existing Account/Create New Account

Deep Learning Platform System Reliability Engineer (Experienced)

Livermore, CA

Job ID: 689145

Apply Now

Posting Duration:

This posting will be open for application submissions for a minimum of seven (7) calendar days, including the ‘posting date’. Sandia reserves the right to extend the posting date at any time.

NNSA Requirements for MedPEDs:

If you have a Medical Portable Electronic Device (MedPED), such as a pacemaker, defibrillator, drug-releasing pump, hearing aids, or diagnostic equipment and other equipment for measuring, monitoring, and recording body functions such as heartbeat and brain waves, if employed by Sandia National Laboratories you may be required to comply with NNSA security requirements for MedPEDs.

If you have a MedPED and you are selected for an on-site interview at Sandia National Laboratories, there may be additional steps necessary to ensure compliance with NNSA security requirements prior to the interview date.

Salary Range:

$100,900 - $195,400 *Salary range is estimated, and actual salary will be determined after consideration of the selected candidate¿s experience and qualifications, and application of any approved geographic salary differential.

What Your Job Will Be Like:

We are seeking Computer Engineers that are interested in shaping the direction of future computing platforms!

On any given day, you may be called on to:

  • Collaborate with other projects to help map scientific problems to hardware resources in a way that improves performance or efficiency.

  • Investigate new computing/storage technologies (GPUs, SmartNICs, DAs) and conduct performance studies to evaluate their value to Sandia applications (deep learning DL, artificial intelligence AI, data science and engineering)

  • Develop and maintain software tools that improve efficiency of system management

  • Make recommendations on architectural improvements to hardware/software and assist in procuring/deploying changes

  • Share operation responsibilities, including networking, security, system administration, and monitoring concerns

  • Build expertise in new computing architectures with potential to participate in and lead research projects to deploy them for our unique application

Qualifications We Require:

  • Bachelor’s degree in STEM field; or eight years' of relevant engineering or scientific experience

  • Demonstrated proficiency with code development and software engineering

  • Proficiency with Linux, Ubuntu, Unix, and/or RedHat-like Operating Systems

  • Ability to obtain and maintain a U.S. DOE Q-level security clearance

  • Due to the nature of the work, the selected applicant must be able to work onsite in Livermore, CA at least 50% and be available to come onsite as needed while telecommuting

Qualifications We Desire:

  • Proficiency with Python, Bash, and at least one compiled language (e.g., C/C++, go, rust, java)

  • Experience in one or more of the following areas: computer system performance modeling and analysis, HPC system architecture, containerization, and multi-core/GPU/parallel computing

  • Understanding of Linux boot process and system services

  • Proficiency with high-performance computing platforms and tools (e.g., Spack or environment modules)

  • Experience with networked storage systems (e.g., Ceph, DAOS, Lustre, or NVMeoF)

  • Development experience with C++ (11,14) and C

About Our Team:

The Computer Sciences and Information Systems Center is the home of computer science and information systems work and capabilities at Sandia's California site. We have capabilities that span across multiple computer science and information systems disciplines. Our activities include computational science and mathematics research, high performance computing, visualization systems research and development, problem solving environments, information security research and operations, and network operations.

The Scalable Modeling and Analysis department is home to a diverse community of researchers that leverage high-performance computing (HPC) technologies to solve challenging problems for multiple National Security mission partners. Researchers in this department are involved in a wide variety of projects, including accelerating scientific simulations with GPUs and asynchronous programming models; designing software that automatically dispatches parallel simulation and analysis tasks to different HPC platforms; and comparing architecture simulation results to real-world hardware measurements.

A unique capability of this department is the design, procurement, and management of multiple compute platforms for Sandia’s data-intensive workloads. These platforms feature advanced hardware technologies (e.g., tensor accelerators, persistent memory, and SmartNICs) and are the proving grounds for new computing concepts (e.g., user-managed containers, distributed processing frameworks, and in-situ deep-learning methods).

About Sandia:

Sandia National Laboratories is the nation’s premier science and engineering lab for national security and technology innovation, with teams of specialists focused on cutting-edge work in a broad array of areas. Some of the main reasons we love our jobs:

  • Challenging work with amazing impact that contributes to security, peace, and freedom worldwide

  • Extraordinary co-workers

  • Some of the best tools, equipment, and research facilities in the world

  • Career advancement and enrichment opportunities

  • Flexible work arrangements for many positions include 9/80 (work 80 hours every two weeks, with every other Friday off) and 4/10 (work 4 ten-hour days each week) compressed workweeks, part-time work, and telecommuting (a mix of onsite work and working from home)

  • Generous vacations, strong medical and other benefits, competitive 401k, learning opportunities, relocation assistance and amenities aimed at creating a solid work/life balance*

World-changing technologies. Life-changing careers. Learn more about Sandia at:*These benefits vary by job classification.

Security Clearance:

Sandia is required by DOE to conduct a pre-employment drug test and background review that includes checks of personal references, credit, law enforcement records, and employment/education verifications. Applicants for employment need to be able to obtain and maintain a DOE Q-level security clearance, which requires U.S. citizenship. If you hold more than one citizenship (i.e., of the U.S. and another country), your ability to obtain a security clearance may be impacted.

Applicants offered employment with Sandia are subject to a federal background investigation to meet the requirements for access to classified information or matter if the duties of the position require a DOE security clearance. Substance abuse or illegal drug use, falsification of information, criminal activity, serious misconduct or other indicators of untrustworthiness can cause a clearance to be denied or terminated by DOE, resulting in the inability to perform the duties assigned and subsequent termination of employment.


All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, or veteran status and any other protected class under state or federal law.

Job ID: 689145

Apply Now


  • Worklife Balance
  • Flexible Work Schedules
  • Generous Paid Time Off
  • Exceptional 401K Savings Plan
  • Medical/Dental/Vision Insurance
  • Wellness Programs
  • On-site Amenities
  • Vacation Buy Plan
  • Telecommuting Arrangements*

*with management approval

Life in California

  • Close proximity to first-tier universities, Silicon Valley companies, and other top research laboratories and facilitiesM
  • Access to California’s finest public and private schoolsM
  • VineyardsM
  • BeachesM
  • State ParksM
  • Sports – Nearby major league franchisesM
  • Art havenM
  • Proximity to SF Bay AreaM

Learn more about Life in Livermore, California

Sandia invites you to review the Equal Employment Opportunity posters which include EEO is the Law, EEO is the Law Poster Supplement, and Pay Transparency Nondiscrimination Provision.

Sandia is a drug-free workplace. As a national laboratory funded by a U.S. government agency, we are subject to federal laws regarding illegal drug use. Illegal use of a controlled substance, including marijuana even in places where it does not violate state law, may impact your ability to obtain and/or maintain a Department of Energy security clearance, and may result in the withdrawal of an employment offer or termination of employment.

Sandia is committed to Equal Employment Opportunity and providing reasonable accommodation in its application process for qualified individuals with disabilities. If you have difficulty using our online system due to a disability and need special assistance or accommodation, please send an email with your request to the Job Accommodation Specialist in (NM) . Determinations on requests for reasonable accommodation are made on a case-by-case basis.