工作內容
Job Description:
We are seeking a skilled and motivated AI/HPC cluster engineer to design, develop, and maintain a virtualized HPC environment and then deploy to the lab environment and the factory. This role is crucial for enabling our team to develop and test AI/HPC tools in a virtualized environment first, and after verification the complete process, to deploy it on the physical infrastructure.
[Key Responsibilities]
Virtual Environment:
- Build and manage a virtual AI/HPC cluster for development and testing.
- Configure and optimize networking between virtual nodes to emulate real-world HPC environments.
- Collaborate with teams to ensure the virtual cluster meets testing and performance requirements.
- Troubleshoot and resolve networking and system-level issues within the virtual environment.
- Document system configurations, workflows, and best practices for maintaining the virtual cluster.
Physical Envrionment:
- realize and impelemnt the validation test cases into the cluster DUTs.
工作說明
-
工作縣市:高雄市
- 上班地點:高雄市前鎮區
-
工作待遇:面議
-
上班時段:日班,
-
需求人數:1 ~ 2
條件要求
-
工作經歷:
2年以上
-
學歷要求:碩士
-
科系要求:
電機電子工程相關
-
專長需求:
-
擅長工具:
- 具備駕照:
-
其他條件:
[Required Skills]
1. Strong understanding of computer networking concepts and protocols.
2. Proficiency in Linux system administration and troubleshooting.
[Preferred Skills]
1. Experience with Ansible for automation and configuration management.
2. Experience with Shell script, Python, or Node.js.
3. Familiarity with BMC for remote system administration.
4. Understanding of AI/HPC workloads and cluster configurations.