High Performance Computing System Administrator 3

Louisiana State University and Agricultural and Mechanical College
Baltimore, MD
United States
Category
Job Description

All Job Postings will close at 12:01a.m. CST (1:01a.m. EST) on the specified Closing Date (if designated).

If you close the browser or exit your application prior to submitting, the application progress will be saved as a draft. You will be able to access and complete the application through “My Draft Applications” located on your Candidate Home page.

Job Posting Title:

High Performance Computing System Administrator 3

   

Position Type:

Professional / Unclassified

   

Department:

LSUAM FA - ITS - TA - RETS - HPC - Systems (Eric Bryan Wiggins (00002603))

   

Work Location:

0100 Fred C. Frey Computing Services Building

   

Pay Grade:

Professional

   

Job Description:

This position is for an IT Analyst 3 in High Performance Computing System Administration group in the Information Technology Services Department at LSU.  The HPC IT Analyst 3 is responsible for maintaining High Performance Computing storage and infrastructure such that we are able to provide a high quality of service and reliable uptime to our service community.  This position also helps to implement new technology both on the clusters and in the infrastructure used to maintain the clusters.  All Information Technology Services employees are expected to demonstrate a commitment to exemplary customer service in all facets of their work. 

Job Responsibilities:

Operations: Expertise in verifying the quality of operations of Linux super computers and storage systems including, but not limited to, monitoring and analyzing storage/infrastructure/job performance, performing daily system checks, analyzing system logs, helping users recognize performance problems, writing scripts to enhance monitoring, and responding to unplanned system events such as power outages.  This may require travel to various HPC sites to maintain physical installation of systems located.

Proactively perform hardware maintenance on both storage systems and cluster infrastructure as needed.  This includes diagnosing and fixing problems which includes, but is not limited to, running diagnostics, re-seating dimms, replacing hard disks, calling vendors for RMA support, replacing mother boards, and return shipping replacement parts.

Plan and perform software maintenance on both storage systems and the cluster infrastructure as needed.  This includes, but is not limited to, installing operating systems, installing security patches, installing or upgrading drivers, upgrading firmware, installing or upgrading software licenses, installing or upgrading software specific to HPC cluster management. (50%)


Research: Investigates, architects and implements new technology as appropriate to add new features to the user, management, and infrastructure environments.  This requires the ability to work without training to take a new technology through installation to production. This also includes the ability to develop and document procedures related to that technology and to train other members of the group. (15%)

Customer Support: Respond to tickets which include complaints, requests, troubleshooting, assessing storage options, etc. Provide training to groups or individuals as needed. (25%)

Other duties as assigned. (10%)

Bachelor's Degree with 3 years of experience.

Minimum requirement 2 years of Unix operating systems, Redhat Linux preferred.
A Ph.D. in Computation Science, Engineering or other computationally intensive disciplines will substitute for two years of experience.
Ideal candidates will have a good understanding of system hardware and software processes (development, configuration, testing, and deployment).

Desired Qualifications:

Bachelor's Degree in in Computation Science, Engineering or related computationally intensive disciplines.

Scripting skills in bash or similar language. Experience with filesystems and hardware such as Lustre, GPFS, NAS, DDN, Panasas. Experience with large scale Linux deployments, RHEL, Fedora or CentOS preferred. Experience with Grid Computing systems and software such as XSEDE, TeraGrid, Open Science Grid. Knowledge of HPC clusters resource managers such as Torque, SLURM, Condor. Experience with scientific application portals. Well versed in computer fundamentals and protocols. Experience with Virtualization technologies (KVM). Experience with containerization such as Docker and Singularity. Experience working with large complex HPC systems.

   

Additional Job Description:

Special Instructions:

A copy of your transcript(s) may be attached to your application (if available). However, original transcripts are required prior to hire.
Please provide three professional references including name, title, phone number and e-mail address.
An offer of employment is contingent on a satisfactory pre-employment background check.

   

Posting Date:

February 22, 2023

   

Closing Date (Open Until Filled if No Date Specified):

  

Additional Position Information:

Background Check - An offer of employment is contingent on a satisfactory pre-employment background check.

Benefits - LSU offers outstanding benefits to eligible employees and their dependents including health, life, dental, and vision insurance; flexible spending accounts; retirement options; various leave options; paid holidays; wellness benefits; tuition exemption for qualified positions; training and development opportunities; employee discounts; and more!

Remote Work - Positions approved to work remotely outside the State of Louisiana shall be employed through Louisiana State University’s partner, nextSource Workforce Solutions, for Employer of Record Services including but not limited to employment, benefits, payroll, and tax compliance. Positions employed through Employer of Record Services will be offered benefits and retirement as applicable through their provider and will not be eligible for State of Louisiana benefits and retirement.

   

Essential Position (Y/N):

   

LSU is an Equal Opportunity Employer:

LSU believes diversity, equity, and inclusion enrich the educational experience of our students, faculty, and staff, and are necessary to prepare all people to thrive personally and professionally in a global society. We celebrate diversity and are committed to the principles of diversity and inclusion. We actively seek and encourage qualified applications from persons with diverse backgrounds, cultures and experiences. To learn more about how LSU is committed to diversity and inclusivity, please see LSU’s Diversity Statement and Roadmap. Persons needing accommodations or assistance with the accessibility of materials related to this search are encouraged to contact the Office of Human Resource Management (hr@lsu.edu).

   

HCM Contact Information:

Questions or concerns can be directed to the LSU Human Resources Management Office at 225-578-8200 or emailed HR@lsu.edu