Overview
The Institute for Defense Analyses (IDA) has an immediate career opening for a Lead Systems Engineer. This opening is located at IDA's Center for Communications Research in Princeton, New Jersey (CCRP). IDA offers a competitive salary, an excellent benefits package and a superior professional working environment. To the right individual, IDA offers the opportunity to have a major impact on key national programs while working in support of technical issues and projects.
IDA is seeking a highly qualified individual to manage its High Performance Computing (HPC) resources, including compute clusters, parallel file systems and high-speed networks. The successful applicant will be an expert in the Linux operating system and have significant experience with CPU/GPU based systems, high-performance storage technologies (e.g. Lustre), HPC or High Throughput job allocation technologies (e.g. Slurm, HTCondor), parallel computing environments such as MPI and CUDA, and high-performance network technologies (e.g. InfiniBand, GigE). The incumbent will recommend technologies; work with vendors to specify equipment; supervise or participate in installation; maintain, administer and troubleshoot systems; install software to support research; ensure compliance with DoD and sponsoring Agency requirements; and help researchers get the most out of the systems. Moreover, the individual will act as part of a team to maintain the environment in which the HPC systems function and support the mission of IDA/CCR-P and its sponsor.
Responsibilities
Takes the primary role as project leader and designer of new IT technology initiatives.
Develops test and integration plans for new systems and software in order to ensure compatibility with current infrastructure.
Provides operational support and maintenance, when necessary, to ensure systems functionality, availability, security and performance.
Ensures all systems meet or exceed the business and security requirements in accordance with IDA, DOD, NSA, DISA and DSS directives and guidelines.
Mentors junior staff and coordinates work assignments of junior administrators to ensure project schedules are maintained.
Prepares technical documentation, to include standard operating procedures and processes.
Develops resolutions to complex problems that require the frequent use of creativity. Coordinates resources to resolve problems when necessary.
Administers and maintains classified and unclassified systems and services to ensure optimum performance and availability.
Maintains technical proficiency in new IT technologies; keeps abreast of industry trends, and makes recommendations to improve or advance services and system performance.
Communicates all computer network, system and service problems and outages immediately to the appropriate supervisors and/or managers.
Responds to critical after hours support issues.
Performs other duties as assigned.
Qualifications
Bachelor of Science degree in Computer Science or equivalent experience in related field.
Eight years minimum experience in Information Technology, which includes at least six in systems administration.
Possess advanced, subject matter expertise in design, administration, and support of servers, systems and software, using Linux/Unix. Experience in more than one of the following areas is required:
High Performance Computing (HPC) systems or large cluster computing, including GPU based systems.
High performance storage technologies such as Lustre or Hadoop.
HPC or High Throughput Computing (HTC) job allocation technologies such as Slurm or HTCondor.
Parallel computing libraries and environments such as MPI and CUDA.
High performance network technologies such as InfiniBand and GigE.
Authentication, access control, compliance and security in a DOD environment.
Open Source software installation and support.
Must be organized, self-motivated and able to work with moderate supervision.
Ability to communicate effectively in both written and verbal form and with all levels of employees; possess good interpersonal skills.
Must be willing to work hours outside of a regular schedule, including periodic on-call support.
Position requires ability to obtain and maintain Top Secret/SCI security clearance with full scope polygraph. Current TS/SCI with full scope polygraph clearance preferred.
Ability to obtain and maintain DOD 8570 IAT II certification.
#J-18808-Ljbffr