Systems/Applications Engineer

Johns Hopkins University

East Baltimore, MD

ID: 7075394
Posted: February 23, 2021
Application Deadline: Open Until Filled

Job Description

General Summary/Purpose:

We are seeking a self-motivated and creative HPC system administrator with strong application and user support skills to join a small but highly-skilled and creative research computing team that develops and operates a dynamic and rapidly growing High Performance Computing (HPC) environment for Public Health and Biomedical research. The facility has doubled its storage capacity every 12-18 months for over 10 years and currently has over 10PB of custom-built Lustre and ZFS storage.

Specific Duties & Responsibilities:

Administer, troubleshoot, implement and extend an existing High Performance Computing (HPC) system in a top flight academic research institution.
Participate in architecture development, strategic decision-making, user training and outreach.
Deploy x86-64 and GPU nodes.
Participate in development, implementation and management of custom-designed multi-petabyte storage and backup systems based on ZFS and Lustre.
Install and administer application and development environments.
Share rotations in after-hours help-desk support with other members of the team.
Prepare web-based documentation. Maintain security and authentication systems within a HIPAA compliant system.

Minimum Qualifications (mandatory):

Bachelor's degree.
5 years related experience.
Additional education may substitute for required experience and additional related experience may substitute for required education, to the extent permitted by the JHU equivalency formula.

Preferred Qualifications:

M.S. or Ph.D. degree in STEM field
Seven years of experience managing Linux servers and applications or five years of experience managing Linux HPC clusters.

Special Knowledge, Skills, and Abilities:

Ability to work independently with minimal supervision
Experience with R, SAS, Stata, Matlab, image manipulation software, genomic software
Experience administering SGE, Hadoop, Spark, Lua, Lmod. Git, Anaconda
Experience with ZFS, Lustre, EXT3/4 and NFS file system administration
Familiarity with security and authentication technologies and SSH, VPN, LDAP, Shibboleth, SAML SELinux, openSSL etc.
Database administration MySQL, Postgresql, MariaDB
Experience with Unix tools such as awk, grep, sed, vi, emacs
Familiarity with lower level Linux programs and fundamentals such as strace, the /proc filesystem in a RHEL or Centos environment, though other flavors of Linux are helpful.

Technical Qualifications or Specialized Certifications:

Hands-on Linux system administration in an HPC or large enterprise environment
Solid understanding of IP network fundamentals
Solid understanding of large storage array fundamentals
Solid understanding of POSIX and networked file system concepts and administration (e.g. NSF)
Solid scripting language skills with Perl, Python and Bash
Excellent documentation and writing skills
Ability to share off-hours user and system support responsibility
Ability to operate a motor vehicle and has a valid U.S. driver’s license

Any specific physical requirements for the job:

Ability to lift equipment, e.g. a 1U server (40lbs)


Apply Now

Please mention to the employer that you saw this ad on