Job Description
Job DescriptionDescriptionFII USA, Inc., a Foxconn Technology Group Company, is seeking an Reliability Manager to design, implement, and oversee reliability testing plans for enterprise-class server and storage systems. Once a part of the team, you will be responsible for a wide variety of tasks within the Operations Department in a lab and production-support environment and have the opportunity to display strong technical and leadership skills to expand your career in Smart Manufacturing.
The Reliability Manager will lead reliability testing activities, manage a team of engineers and technicians, and develop automation and data analysis processes while assisting the Operations Department as needed.
Responsibilities
- Design and implement ORT plans for new and existing server and storage platforms, covering thermal cycling, power cycling, HTOL (high-temperature operating life), and burn-in conditions.
- Configure and maintain test infrastructure for 24/7 system-level reliability testing, including server racks, power cycling systems, and functional stress diagnostic tools.
- Manage ORT equipment setup and operation (developing profiles/programs, building fixtures, configuring servers, switches, AOC cables, etc.).
- Oversee and drive the execution of ORT testing using temperature/humidity chambers, burn-in environments, and related equipment.
- Collect and analyze large datasets from test conditions, applying statistical methods to identify early-life failure mechanisms.
- Develop, automate, and maintain test scripts and procedures for environmental screening and stress testing.
- Collaborate with design, validation, manufacturing, and quality teams to provide feedback on reliability issues and improvements based on ORT results.
- Engage in root cause analysis (RCA) and escalate reliability concerns as needed.
- Generate and present detailed reports and dashboards on ORT outcomes, highlighting areas for improvement and recommending design or process changes.
- Continuously improve ORT methodologies, test coverage, and automation frameworks to meet evolving server architectures and customer expectations.
- Provide leadership, training, and support to engineers and technicians performing ORT activities.
- Other duties as assigned.
Qualifications
- Education required: Advanced degree in Electrical, Mechanical, or Materials Engineering, Chemistry, or a related technical field.
- Experience required:
- 5+ years of experience in reliability or test engineering roles, preferably with enterprise/AI server and storage platforms.
- Experience running ORT testing, including use of environmental temperature/humidity chambers and burn-in rooms.
- Technical skills required:
- Strong understanding of server architecture (CPU, memory, power supplies, I/O, fans, storage).
- Proficiency with environmental test equipment and methodologies.
- Experience with scripting or automation tools (Python, Bash, LabVIEW, etc.).
- Familiarity with data analysis tools (MATLAB, Excel, Power BI, or similar).
- Leadership and soft skills:
- Strong problem-solving, organizational, and project management skills.
- Ability to manage multiple projects, plan and schedule test activities, and lead a cross-functional team.
- Excellent written and verbal communication skills with strong report writing ability.
- Ability to work independently with minimal supervision while managing a team to achieve results.
- High integrity, professionalism, and motivation to drive continuous improvement.
- Physical requirements:
- Ability to sit at a desk and computer for extended periods.
- Ability to travel by air or car for extended periods.
- Ability to lift up to 40 lbs.
Reasons you should work for us:
- Comprehensive benefits package including medical, dental, and vision insurance coverage.
- Basic life insurance and short-term disability coverage provided by employer.
- Supplemental life insurance and long-term disability coverage options available.
- 401K with employer contribution.
- Personal, Vacation, and Holiday paid time off for all full-time employees.
- Onsite Aurora Health & Wellness Center available for all employees.
- Employees are continuously encouraged to learn and grow their careers in smart manufacturing.