Senior Linux Engineer needed to install, configure and maintain Linux HPC and HA clusters. Will be testing and integrating third party applications in Linux HPC and HA clusters. Need to troubleshoot software/hardware/OS/application compatibility and configuration issues including onsite installation, integration and verification of customer systems.
Experience with SME-like knowledge of Linux server environment and Configuration of high performance GPFS file systems using RHEL 6.1, IBM Series X servers, and IBM DS Storage. Cluster set-up and configuration using xCAT. Documenting and executing test cases using Rational Quality Manager, using Rational Team Concert as a source code repository. Proactive problems solving skills. Bachelor’s Degree in Computer Science, Math, or Electrical Engineering and 5 years of relevant experience required along with outstanding verbal, written and interpersonal communication skills.
Technical Environment:
GPFS installed on RHEL6 and System X Servers IBM DS Storage Manager IBM advance settings utility IBM Rational Team Concert, IBM Rational Quality Manager Third Party Products: Cisco Nexus and Catalyst switches Brocade Fiber Channel Switches and 8 Gb Fiber Channel adapters Mellanox 10 GbEMellanox Ethernet adapters and QDR Infiniband adapters Intel C/C++/F90 Compliers supporting LP64 and Math Kernel Library Oracle MySQL Enterprise Other Tools and Utilities: xCAT – Cluster installation and management slurm, openmpi – support the IOR and CPU Suite test environment Compilers – Intel C/C++/F90 and GNU C/C++/F90 w/Intel Math Kernel Library Debugging utilities – gdb, idb Performance Analysis Tools – IOR, nsdperf, nmon Linux HA – setting up highly available services Be familiar with OpenMP, Bash, Python