Role: Senior Hybrid Evaluation / Test Environment Lead (CXR VLM/LLM)
Location: Eindhoven, Netherlands (High Tech Campus 52, 5656 AG Eindhoven)
Is it Permanent/ Contract: Contract – 6-12 months
Is it Onsite/Remote/Hybrid: Hybrid – 3 days a week)
Language: English
This role focuses on independent evaluation and controlled experimentation, separate from core model development, to support evidence generation and safe iteration.
Purpose of the role:
Design, build, and operate a hybrid evaluation and test environment for CXR VLM/LLM models, enabling systematic testing of model functionality, edge cases, and performance across findings without interfering with the main development pipeline.
Key capabilities
- Experience setting up model evaluation sandboxes or test harnesses for AI/ML systems, ideally in medical imaging.
- Ability to test and compare multiple CXR VLM/LLM variants across findings (e.g. pneumothorax, cardiomegaly, fractures) using consistent protocols.
- AWS experience and familiarity with cloud based dev environment.
- Familiarity with report-level evaluation, discrepancy analysis, and structured comparison between AI-generated outputs and clinician-validated references.
- Comfortable working at the intersection of engineering, clinical logic, and governance, feeding findings into monitoring, change management, and validation processes.
- Senior judgement to distinguish exploratory testing from evidence that is eligible for regulated use.