Senior Researcher – LLM System Architecture

Recruitment Consultant

Simon Troupe

Contact Details

stroupe@eu-recruit.com +44 (0)330 335 2369

Posted

2 months ago

Senior Researcher – LLM Systems & Inference Architecture

Position Overview

A research-driven technology organization is seeking a Senior Researcher specializing in large language model (LLM) systems and inference architecture.

This role focuses on advancing the efficiency, scalability, and performance of LLM inference systems through system-level optimization and hardware–software co-design. The position operates at the intersection of AI systems, compilers, and heterogeneous computing platforms.

Key Responsibilities

LLM Inference Engine Optimization

Design and implement high-performance inference engines for large language models.
Improve efficiency of both open-source and internal inference frameworks.
Develop optimization techniques such as:
- Model quantization
- Sparse attention mechanisms
- Key-value cache reuse and memory optimization
Enhance throughput and latency for transformer-based and generative AI workloads.

Hardware–Software Co-Design

Design and implement optimized compute kernels for heterogeneous accelerators.
Enable efficient execution of LLM workloads across specialized hardware platforms.
Profile end-to-end AI workflows to identify system bottlenecks across runtime, framework, and hardware layers.
Develop low-latency, high-throughput solutions bridging Python-based frameworks and accelerator backends.

Performance Engineering & System Optimization

Analyze and optimize large-scale AI pipelines, focusing on:
- Memory bandwidth utilization
- Compute efficiency
- Scheduling and execution pipelines
Improve system-level performance across distributed or heterogeneous environments.

Research & Ecosystem Contribution

Publish research in leading systems and machine learning conferences.
Contribute to open-source AI systems and tooling ecosystems.
Support adoption of AI infrastructure through:
- Developer tools
- Documentation
- Collaboration with external partners

Required Qualifications

PhD or Master’s degree in Computer Science, Computer Engineering, or a related field.
Strong programming skills in Python and C/C++ for system-level development.
Solid experience with AI frameworks such as PyTorch or TensorFlow.
Deep understanding of model optimization techniques for large-scale neural networks.
Experience with system-level performance profiling and optimization.

Preferred Qualifications

Experience with heterogeneous or accelerator programming environments (e.g., GPU, NPU, or other xPU architectures).
Familiarity with kernel development, custom operators, or compiler-level optimization.
Experience with large multimodal or vision-language models.
Contributions to open-source AI infrastructure or research publications in top-tier venues.

Personal Attributes

Strong systems thinking with ability to bridge AI models and hardware platforms.
Research-oriented mindset combined with practical engineering skills.
Ability to work independently while collaborating across multidisciplinary teams.
Clear technical communication skills.

Industry

AI & Machine Learning

Contract Type

Permanent

Location

Switzerland

City

Lausanne

Work Model

On-Site

Apply for a vacancy

Apply Now

By applying to this role, you acknowledge that we may collect, store, and process your personal data on our systems.

For more information, please refer to our
Privacy Notice

Name

Phone

Location

Message

Upload CV:

Choose file

Formats: Word, PDF (max. size: 20MB)

Subscribe for industry highlights.

Send Application

Senior Researcher – LLM System Architecture

Apply Now

Other relevant jobs

Business Development Representative

LLM / RAG Expert

Founding Engineer (Systems + ML)

Personalization and Recommendation Expert – Permanent

Founding AI Engineer

Senior Foundation Model Researcher – Contractor

Principal Platform Engineer

Founding Engineer

Solutions Engineer

Founding Engineer

Research Scientist / Founding Member – Agentic AI

Senior Thermal Engineer

Site Reliability Engineer

Senior Compiler Researcher / Architect

Multimodal Content Intelligence Expert – Permanent

Founding Silicon Engineer

Software Engineer (C++ Systems)

Senior Data Centre Network Engineer

Senior System Engineer – AI & Agentic Sandbox

Model Based Tool Engineer

SLAM Engineer – XR Labs

Embedded Software Senior Engineer –SoC Firmware

Technical Lead – AI and Computing Systems

Senior Deep Learning Researcher – Model Efficiency

Ads Recommendation Expert

Senior Research Engineer in Artificial Intelligence and Embedded Systems

Founding Engineer (Full Stack)

Product Engineer (Python)

Senior LLM Agent Researcher – Contract Role

Simulation Engineer

Simulation Platform Engineer

Member of Technical Staff

Frontend CFD Visualization Engineer

System Administrator

Software Engineer

3D Machine Learning Engineer

Senior Researcher: AI Computing Systems

Software Engineer (Frontend)

Sr MLOps Enigneer

Head of Global Marketing & Communications

Founding Frontend Software Engineer

Model Based Developer – Senior Expert

Systems Engineer (ML/C++/C)

Senior Researcher – LLM System Architecture

Fullstack Software Engineer

Physics Simulation Team Lead

US – Enterprise Account Executive (AI / LLM / Infrastructure)

DataOps & MLOps Engineer

Infrastructure & DevOps Engineer

Deep Learning & Computer Vision Engineer

C++ CUDA Engineer

Senior Platform Engineer – Customer Facing

Neural Rendering & Graphics Engineer

3D Computer Vision Engineer

AI Strategy Consultant (Contractor)

LLM Engineer

Principal AI Researcher

Looking for tech jobs in the US or Europe?

Looking for tech jobs
in the US or Europe?