Data Center Engineer

Lehigh University

Bethlehem, PA

ID: 7146645
Posted: May 23, 2023
Application Deadline: Open Until Filled

Job Description

rovide Data Center Infrastructure Management for Lehigh University data centers. Provide input to and execute the design, implementation, maintenance, enhancement and monitoring of the data center infrastructure composed of power, cooling, fire, security while planning for future growth and changes in technology. Implement, perform and maintain hardware and software installations in the data centers. Collaborate with Library and Technology Services Architects to support their designs. The Lehigh community takes seriously our commitment to antiracism and The Principles of our Equitable Community.

1. Data Center Infrastructure Management (DCIM) - Design, monitor, measure, optimize, manage and/or control data center utilization and energy consumption of all IT-related equipment (such as servers, storage and network switches) and facility infrastructure components (such as power distribution units [PDUs] and computer room air conditioners [CRACs])
*Research, design and implement data center improvements to support Disaster Recovery and Business Continuity Planning (DR/BCP) and continued growth
*Design and document data center processes and procedures
*Responsible for upkeeping of the data center environment, ensuring all guests comply while working in the data center operates within approved processes
*Conducting capacity assessments of existing infrastructure to ensure that it can support future growth
*Manage and lead DC projects to include capacity planning for power and cooling
*Plan, research, formulate and prepare project analyses for replacement of outdated older technology

2. Leverage and support existing monitoring and metrics; define new ones to improve efficiency, scalability and proactively analyze data to quantify risks and opportunities for efficiency gains
*Understand process for diagnostic methods of troubleshooting data center equipment
*Produce periodic status reports on utilization, progress, downtime, performance and costs
*Focus on on-premise data centers but incorporate cloud infrastructures
*Monitoring energy and cooling usage across the data center to ensure efficient operation

3. Support Network Engineering, Research/HPC and System Engineering to perform required work within the data center environment
*Installing and configuring new servers, storage devices, routers, switches and other network equipment as designated in the data centers (Linux, Windows OS, firmware updates, repurposing hardware by removing and/or installing additional components)
*Repair servers (replace hard drives; replace memory, GPUs, Fiber SFP, cabling, etc.)
*Configure IPMI, update firmware and OS installation (Linux, Windows, etc.)
*Implement automation via scripting (shell, python)
*Self-perform server diagnostic and troubleshooting
*Assist with HPC grants by reviewing quotes to ensure conformity and to prepare for installation
*Review design and quotes for recommended equipment to ensure conformity within DC environments

4. Disaster Recovery and Business Continuity Planning (DR/BCP)
*Lead improvement initiatives to DR to restore normal business-critical operations after a disaster
*Lead improvement initiatives to BCP to maintain essential functions during and after a natural or man-made disruption
*Participate in exercises
*Assist with documentation of processes

Position Number: S97920

Grade: 11 - 40; $80,240 - $97,620

This is an approximate range and is subject to change based on experience, skills and qualifications.

Special Considerations

Initially, the employee in this position will be required to work on campus where they can be fully accessible to the Lehigh community. However, this position will be eligible to work partially remote after 6 months to a year in the position.

This position has contact with minors

Will sometimes need to climb or balance, pull or push, see, stand, use hands to touch, handle or feel, reach with hands and arms, sit, stoop, kneel, crouch or crawl, walk, work near moving mechanical parts, lift up to 25-50 pounds and be subject to electrical hazards and frequent loud noise(s)

Qualifications

Bachelor’s Degree in Electrical/Mechanical Engineering, Computer Science, Information Technology or related field or equivalent combination of education and experience

Five to eight years related work experience

Understanding of hardware and software and installation

Ability to read and interpret data center diagrams and schematics

Problem-solving, troubleshooting and analytical skills

Strong attention to detail

Scripting experience

Customer-focused mindset, with demonstrated skill in managing expectations, providing proactive status updates and producing high-quality work product

Ability to use independent judgment to make sound, justifiable decisions and take action to solve problems

Ability to communicate effectively with customers and internal staff and effectively work in team environment

A strong desire and aptitude for solving problems and performing deep technical dives to resolve issues quickly and efficiently

Electrical and HVAC knowledge

Professional Networking

Successful completion of standard background checks including but not limited to: social security verification, education verification, national criminal background checks, motor vehicle checks, PATCH, FBI fingerprinting, Child Abuse Clearance and credit history based upon the requirements of the position