ALICE-SHARK User Meeting 2026

ALICE-SHARK User Meeting 2026

Announcement

We are excited to announce that the ALICE-SHARK User Meeting 2026 will take place on Tuesday, 2 June 2026 from 10:00 - 14:00 in the Agora building room 5A.42.

Program is now available (see Schedule below)

If you are a user of the ALICE HPC cluster at Leiden University or the SHARK HPC cluster at the Leiden University Medical Center (or both), this meeting is for you. Our goal is to bring the user communities of both clusters together to connect with each other and the support teams of both clusters. The meeting is also of interest to users who are not yet actively using one of the two clusters.

The meeting will include an update about both clusters, a selection of talks from users and an interactive Q&A session with the support teams. 

Attendance is possible in person and remotely. For in-person attendees, the meeting will be held in the Agora building room 5A.42 and lunch will be provided. For remote attendees, all sessions will be streamed live with the possibility to ask questions remotely (details will follow). However, we recommend the attendance of the meeting in-person because it will be easier to interact and connect with each other.

Registration for the meeting is mandatory. The number of people that can attend in-person is limited by the space of the room and will be filled on a first-come-first-serve basis.

Timeline

  • Registration open: 20 Mar 2026

  • Abstract Submission deadline: 4 May 2026 at 23:59 CEST

  • Deadline for registration: 31 May 2026 at 23:59 CEST.

  • Meeting: 02 Jun 2026, 10:00-14:00 CEST

Registration

If you want to register for the meeting, please use the following link: Registration Form

Schedule

The following table shows a tentative schedule for the meeting. User talks will be published after the abstract submission deadline has passed and the program has been finalized.

Start Time

End Time

Title

Speaker

Abstract

Start Time

End Time

Title

Speaker

Abstract

10:00

10:15

Arrival Participants

 

 

10:15

10:20

Welcome

 

 

10:20

10:40

Overview/Update SHARK

 SHARK Team

 

10:40

11:00

Overview/Update ALICE

 ALICE Team

 

11:00

11:05

Handover speaker

 

 

11:05

11:20

Social and Neurobiological Mechanisms of Risk and Resilience in Young People

E. Buimer (LEI)

Many young people experience adversity while growing up, such as bullying, neglect, or abuse, which increases the risk of mental health problems later in life. Yet, some remain resilient. Resilience refers to the ability to maintain or regain mental health and well-being despite these challenges. In the THRIVE study, we investigate why some individuals are more resilient than others.

We follow young adults (18–24 years) over time and combine questionnaires, behavioral tasks, hormonal measures, and brain imaging to understand how social interactions, cognitive processes, and stress responses relate to mental health. For example, we study how individuals process emotions, learn from feedback, and respond to social stress.

From a data perspective, this project involves integrating complex, multimodal, and longitudinal datasets. Imaging analyses are performed on the ALICE high-performance computing cluster, while other sensitive data (e.g., behavioral and clinical measures) are stored and processed in separate, secure environments. This setup reflects both computational demands and strict data governance requirements. The project also involves methodological challenges, such as reliably de-identifying anatomical MRI scans to prevent participant recognition while preserving data quality for analysis.

By uncovering how social support and cognitive functioning contribute to resilience, the THRIVE study aims to inform interventions that strengthen mental health in young people.

11:20

11:25

Handover speaker

 

 

11:25

11:40

Modelling Job Resources for Improved Scheduling

H. Rasche (LUMC)

Scheduling jobs is the core of how we interface with our clusters. But how do we schedule jobs optimally? How do we ensure we're not wasting resources unnecessarily? This talk will look at some prior art for optimising cluster scheduling as well as the author's recent work towards improving job runtime and memory requirement with linear models based on historical job performance data, in the context of WDL pipelines executed within Klinische Genetica.

11:40

11:45

Handover speaker

 

 

11:45

12:00

Working with a lot of small images on ALICE: Practical Lessons

A. van de Pol (LEI)

I will describe image pipelines I ran on ALICE for a dissertation on colonial Korean print
culture. Two pipelines sit at the centre. One uses a YOLO detector to extract 600,000
advertisement cutouts from digitised Korean newspapers, then embeds them with a DINOv3
ViT-B backbone fine-tuned with rank-8 DoRA adapters. The other starts from 57,583 page
images of four colonial printshops and produces several million attention-weighted patches
through a multi-instance-learning model trained from scratch on ALICE.

 

The hardest questions were not about input rates but about storage. Do we keep raw image
files on disk, or move everything into a database? Each pipeline started from raw files and
ended with a database of cutouts and embeddings that downstream code could query without
touching the scans again. For the advertisement pipeline, I ran an entire round of experiments
on 200-pixel thumbnails before committing ALICE time to full-resolution crops.

After training, the pipelines run YOLO, fine-tuned DINOv3, and the MIL model over the
full corpus to populate the database. The downstream clustering first used scikit-learn UMAP
and HDBSCAN on CPU nodes, which collapsed past the million-patch mark. Swapping in
NVIDIA cuML on a single GPU required essentially no code changes and turned days into
hours, cheap enough to iterate on rather than run once.

I will close with a few small best practices I have picked up along the way.

12:00

12:45

Lunch

 

 

12:45

13:00

slurm-board: A pre-release tool for monitoring Slurm & configuring Slurm jobs using interactive, real-time resource insights

V. van der Sluis (LUMC)

In this talk, we introduce slurm-board, an interactive dashboard designed to simplify the configuration, monitoring and submitting of Slurm jobs. Users often find it difficult to configure a job due to complex settings and limited knowledge of available resources, resulting in inefficient cluster usage and longer queue times.

Slurm-board addresses these issues by providing a user-friendly SBATCH-generator that allows users to easily configure jobs. Combined with real-time resource availability information, it enables users to efficiently configure their jobs

13:00

13:05

Handover speaker

 

 

13:05

13:50

Q&A with ALICE - SHARK Team

 ALICE & SHARK Teams

 

13:50

13:55

Closing