Senior Manager Professional Services Hpc Deployment (Remote) at NVIDIA #vacancy #remote

Senior Manager Professional Services Hpc Deployment (Remote) Senior Manager Professional Services Hpc Deployment | NVIDIA |United States NVIDIA is in search of an HPC Deployment Manager to bolster itsProfessional Services division. Across academia and industry,NVIDIA’s products are driving ground-breaking advancements in deep… Login to continue NVIDIA is in search of an HPC Deployment Manager to bolster itsProfessional Services division. Across academia and industry,NVIDIA’s products are driving ground-breaking advancements in deeplearning, data analytics, and the optimization of data centers. Join ourteam, where we are at the forefront of constructing some of theglobe’s most expansive and rapid data centers! We seek an individualcapable of supervising the deployment of cutting-edge InfiniBand andEthernet technologies with a team comprising AI and HPC experts. This roledemands dynamic interpersonal abilities and a customer-centricapproach.The chosen candidate will engage with clients, collaborators, andinternal units to assess, delineate, and complete large-scale AI/HPCinitiatives. They will orchestrate the day-to-day operations, guidance, andcultivation of a multi-layered team of HPC service professionals. Thisentails ensuring the timely delivery of a varied spectrum of AI HPC datacenter projects. Furthermore, this role offers an opportunity to thrivewithin a fast-paced, inventive, and technologically sophisticatedatmosphere, emphasizing unparalleled performance and the exploration of anarray of novel hardware and software technologies in AI supercomputing.What You Will Be DoingDirects and supervises the service HPC engineering functions indesigning, developing, installing, and validating hardware and software forthe Customer AI High-Performance Computing (HPC) systems.Leads, handles, mentors, and builds a very hardworking HPC serviceengineering team to deliver innovative advances in high-performancecomputing AI systems.Responsible for leading our HPC projects’ planning,implementation, and performance. Improves the integrity of system servicesbring-up and related by applying groundbreaking technical and operationalknowledge to configure and maintain HPC AI network and serverplatforms.Drives HPC team hardware and software deployment, plans, develops, anddeploys procedures for system validation.Lead team activities and drive tests and plans for Customer’s HPCAI systems implementations, custom scripts, and testing procedures toensure operational reliability for the system.Supports the HPC Engineering team, working with other internalcollaborators to develop and run a well-rounded strategy for deliveringservice quality and continuous service improvement. Supports governance forsoftware engineering through the implementation of standards and qualitymeasures.Leads team member development, helping them set and achieve goals fortheir career growth. Develop an inclusive environment that values teammember differences, creating a sense of belonging and appreciation. Chipsin to a culture of trust and clarity.Build strong relationships with INVIDIA leaders, customers, partners,and collaborators. Works closely to identify, implement, and supportleading NVIDIA’s AI solutions engineering, maintaining currency withindustry standards and innovations. Provides input around processoptimization, department budgeting, and the monitoring and management ofresources.Be the domain authority with customers during planning calls throughimplementation.What We Need To See8+ overall years’ experience in IT, high-performance computing,or other related field; 3+ years of experience in a management orleadership roleDemonstrated expertise in HPC systems design configuration andplanning.Proficiency with low latency/high-bandwidth interconnect infrastructure(Infiniband and Ethernet).Expertise with HPC system software cluster management/provisioningtools, including job schedulers (Slurm, salt, xCAT).Proficiency with shared and distributed memory parallelism (OpenMP,MPI, NCCL and HPL) and accelerators (GPUs).Strong scripting ability (Bash, Perl, Python, etc.) and experience withprogramming fundamentals.Expertise with administration, supervising and maintaining secureLinux/Unix operating systems (CentOS, Solaris).Experience establishing processes for maintaining system performance,managing best-in-class standards, and familiarity with cloud computing andcontainer technologies.Ability to understand and work with large, sophisticated systems,identify and resolve problems, handle performance, and troubleshoot networkissues related to infrastructure.Expertise with multi-vendor hardware/software management, security, andnetwork/Internet protocols. Strong communication and social skills, withthe ability to provide detailed information and high-level summaries tomanagement-level individuals and groups, present the business side oftechnical topics to non-technical audiences, and develop positive workingrelationships and strong rapport with team members.Bachelor’s degree in computer science, information systems, or arelated field or equivalent experienceSolid knowledge of HPC storageExemplary communication and interpersonal skills, with the ability topresent the business side of technical topics to non-technical audiencesand persuasively and optimally get along with relationships with variousstakeholders and diverse individuals and groupsWays To Stand Out From The CrowdInfiniBand experience.Experience with GPU-focused hardware/software.Experience with MPI.Automation tooling background (Ansible, Salt, Puppet, etc.).Ethernet and Storage technologies such as Lustre or GPFS.The base salary range is 208,000 USD – 327,750 USD. Your basesalary will be determined based on your location, experience, and the payof employees in similar positions.You will also be eligible for equity and benefits . NVIDIA acceptsapplications on an ongoing basis. NVIDIA is committed to fostering a diverse work environment andproud to be an equal opportunity employer. As we highly value diversity inour current and future employees, we do not discriminate (including in ourhiring and promotion practices) on the basis of race, religion, color,national origin, gender, gender expression, sexual orientation, age,marital status, veteran status, disability status or any othercharacteristic protected by law. Show more Show lessTagged as: remote, remote job, virtual, Virtual Job,virtual position, Work at Home, work from home When applying state you found this job on Pangian.com Remote Network. #J-18808-Ljbffr

puppet Artificial intelligence (AI) Establishing interpersonal relationships Computer Science solaris lustre remote work Communication ethernet cloud-computing scripting Security hpc salt hardware configuration Bachelor’s Degree centos Information Systems mpi infiniband Leadership openmp low-latency nvidia Ansible

Leave a Reply