Research

ECAI 2023, Santiago, Spain

I study reinforcement learning with deep neural networks, focusing on transformer-based agents, representation learning, and interpretability. My work is part of the 3AI Project between hessian.AI and the Artificial Intelligence and Machine Learning Lab at TU Darmstadt.

Quick navigation:

Research Topics

A snapshot of the themes that guide my current work.

Representation Learning

Learning compact and semantically meaningful state representations that make decision-making more data-efficient and robust.

Transformer-based Agents

Exploring attention-based architectures for long-horizon reasoning and generalization in RL.

Interpretability

Understanding why agents act the way they do by emphasizing object-centric views and causal cues.

Robustness

Designing agents that remain reliable under distribution shifts and noisy, imperfect environments.

Publications

Grouped by venue type for quick scanning.

Journals

  • Quentin Delfosse*, Jannis Blüml*, Bjarne Gregori, Sebastian Sztwiertnia, Kristian Kersting (2024): OCAtari: Object-Centric Atari 2600 Reinforcement Learning Environments. Reinforcement Learning Journal Vol. 1 (RLJ). More Information
  • Jannis Blüml, Johannes Czech, Kristian Kersting (2023): AlphaZe**: AlphaZero-like baselines for imperfect information games are surprisingly strong. Frontiers in Artificial intelligence 6. More Information

Conferences

  • Johannes Czech, Jannis Blüml, Kristian Kersting, Hedinn Steingrimsson (2024): Representation Matters for Mastering Chess: Improved Feature Representation in AlphaZero Outperforms Changing to Transformers. In Proceedings of the 27th European Conference on Artificial Intelligence (ECAI). More Information

Workshops

  • Timo Kaufmann*, Jannis Blüml*, Antonia Wüst*, Quentin Delfosse*, Kristian Kersting, Eyke Hüllermeier (2024): OCALM: Object-Centric Assessment with Language Models. At RLC 2024 Workshop on Reinforcement Learning Beyond Rewards. More Information
  • Quentin Delfosse*, Jannis Blüml*, Bjarne Gregori, Kristian Kersting (2024): HackAtari: Atari Learning Environments for Robust and Continual Reinforcement Learning. At RLC 2024 Workshop on Interpretable Policies in Reinforcement Learning. More Information

Preprint

  • Yannik Keller, Jannis Blüml, Gopika Sudhakaran, Kristian Kersting (2023): From Images to Connections: Can DQN with GNNs learn the Strategic Game of Hex?. arXiv preprint arXiv:2311.13414

Reviewing

Serving as a reviewer at RLC, AAAI, IEEE ToG, NeurIPS

Projects

Selected ongoing work and recently completed projects.

Object-centric Representation for Interpretability and Robustness

The Arcade Learning Environments platform is widely used to train deep RL agents. Object-centric environments help build interpretable agents and compare against object-aware baselines. The results can be found here. We also develop new methods to adapt environments to search for misalignments and shortcut learning, enabling more robust agents while focusing on object-centric input representations.

Feature Representation in AlphaZero.

While transformers have gained the reputation as the "Swiss army knife of AI", no one has challenged them to master the game of chess, one of the classical AI benchmarks. Simply using vision transformers (ViTs) within AlphaZero does not master the game of chess, mainly because ViTs are too slow. Even making them more efficient using a combination of MobileNet and NextViT does not beat what actually matters: a simple change of the input representation and value loss, resulting in a greater boost of up to 180 Elo points over AlphaZero. The results can be found here. Further we researched how information in games like Stratego (Information Sets) or Hex (Graph-based Inputs) can be represented to enable methods like AlphaZero to work.