Senior Systems Engineer - Storage
Company: Dana-Farber Cancer Institute
Location: Boston
Posted on: November 6, 2024
Job Description:
Located in Boston and the surrounding communities, Dana-Farber
Cancer Institute is a leader in life changing breakthroughs in
cancer research and patient care. We are united in our mission of
conquering cancer, HIV/AIDS and related diseases. We strive to
create an inclusive, diverse, and equitable environment where we
provide compassionate and comprehensive care to patients of all
backgrounds, and design programs to promote public health
particularly among high-risk and underserved populations. We
conduct groundbreaking research that advances treatment, we educate
tomorrow's physician/researchers, and we work with amazing
partners, including other Harvard Medical School-affiliated
hospitals.The HPC Sr. Engineer, Storage will serve the Dana-Farber
Cancer Institute (DFCI) and its patients within the Computational
Solutions team under the Chief Health Information Office. The
successful candidate will support research storage offerings at a
production level being provided to the research community and other
services.The role requires a strong technical background with
hands-on experience in bringing a research computational pipeline
from prototype to production with regards to error handling,
stability, reliability, usability, scalability and performance. The
ideal candidate understands how hardware and software go hand in
hand and only the ideal combination of compute resources and
software components deliver the optimal
result.Responsibilities:
- Serves as subject matter expert and scientific computing
solutions designer, leveraging deep understanding of both the
scientific domain as well as the computing technology.
- Overall responsible for the productionization and the reliable
operation of HPC workloads (NGS pipelines and others).
- Acts as primary contact for the research community with respect
to high performance computing applications and resources.
- Directly performs and coordinates monitoring and
troubleshooting of production HPC workloads and implementation of
fixes when necessary.
- Researches, finds and implements optimal runtime conditions for
HPC workloads.
- Acts as a cluster scheduler power user and works with system
administration to align configuration of compute resources with
computational workloads.
- Collects metrics and provides input on how to optimally utilize
available compute resources.
- Is fully knowledgeable of informatics services and resources
offered by Research Computing or Partners HealthCare and proposes
solutions to optimally address concrete computational needs through
a combination of different services.
- Maintains the flow of information for end-to-end service
delivery and provides updates where needed.
- Performs other related duties as assigned / needed.Knowledge,
Skills and Abilities required:
- Good working knowledge of Java or C/C++ programming or
equivalent.
- Can build software from sources including knowledge of build
systems (Make, CMake, Maven, Gradle etc.).
- Solid knowledge of computer architectures and
multi-threaded/parallel processing applications.
- Strong knowledge of local and distributed I/O performance
tuning.
- Hands-on knowledge of network- and distributed filesystems
(e.g. NFS, BeeGFS) and knowledge of ZFS.
- Extensive experience with HPC or cloud scheduling, such as
GridEngine, SLURM or LSF.
- Fluency in at least one scripting language (Python preferred),
bash and knowledge of parallel shell.
- Working knowledge of containerization (e.g., Docker).
- Hands-on experience with system administrative tasks in Linux
environments.
- Knowledge of collaboration tools such as Jira, Confluence,
SharePoint and others.
- Exceptional service orientation, excellent problem-solving
abilities; keen attention to detail.
- Excellent analytical, organizational and time management
skills.
- Ability to work under pressure with minimal supervision in a
complex environment.
- Demonstrated ability to work effectively in a highly
collaborative technical team.
- Strong interpersonal skills - ability to interact productively
with users and colleagues of diverse seniority levels and
professional backgrounds.
- Ability to communicate technical topics to technical and
non-technical audiences appropriately.
- Ability to serve both hands-on and in a management capacity, as
needed.Minimum Requirements:
- Bachelor's degree in Computer Science, Software Engineering or
a related field. Master's degree may substitute for
experience.
- 5 years in a similar role; experience may substitute for
degree.At Dana-Farber Cancer Institute, we work every day to create
an innovative, caring, and inclusive environment where every
patient, family, and staff member feels they belong. As relentless
as we are in our mission to reduce the burden of cancer for all, we
are equally committed to diversifying our faculty and staff. Cancer
knows no boundaries and when it comes to hiring the most dedicated
and diverse professionals, neither do we. If working in this kind
of organization inspires you, we encourage you to apply.Dana-Farber
Cancer Institute is an equal opportunity employer and affirms the
right of every qualified applicant to receive consideration for
employment without regard to race, color, religion, sex, gender
identity or expression, national origin, sexual orientation,
genetic information, disability, age, ancestry, military service,
protected veteran status, or other characteristics protected by
law.
#J-18808-Ljbffr
Keywords: Dana-Farber Cancer Institute, Newton , Senior Systems Engineer - Storage, Other , Boston, Massachusetts
Didn't find what you're looking for? Search again!
Loading more jobs...